Document OCR AI Model

Usage

The Zenphi Document OCR (Optical Character Recognition) AI Model is designed to Identify and extract text from documents in over 200 languages for printed text and 50 languages for handwritten text. This model leverages advanced machine learning techniques to accurately recognize and extract text from documents, enabling automation and digitization of document-heavy processes.

Features

  • Text Extraction: Recognizes and extracts text from various document formats.
  • Language Support: Supports multiple languages for text recognition.
  • Document Types: Handles diverse document types including invoices, receipts, forms, and more.
  • High Accuracy: Provides high accuracy in text recognition, even for complex layouts.
  • Integration: Easily integrates with Zenphi workflows for seamless automation.

Use Cases

  • Invoice Processing: Automate the extraction of invoice data for financial systems.
  • Document Archiving: Convert physical documents into digital formats for archiving and easy retrieval.
  • Form Processing: Extract data from forms for input into databases or other systems.
  • Receipt Tracking: Automate the capture of receipt data for expense management.

How to create Document OCR AI Model

  1. Choose the OCR Document and click on it's Create button.

  1. From the left column, add the source file(s) as shown below.
  1. Click on the "Select Tool" in the toolbar.
  2. To extract a value, click and hold the left mouse button, then drag to cover the area you want to select.

  1. If the source file has multiple pages, you can select the desired pages from this section.
  2. Click on the "Plus Sign" button to define the value you want to extract.

  1. In the box that appears, enter a desired name for the extracted field and click on "Add"

  1. Once the field is named, it will appear in the list.
  2. By hovering over the field, you can view the value extracted by the OCR AI Model.


Review Results

  • Once the process is complete, review the extracted text.
  • Make any necessary adjustments or corrections.
  • Save and Publish the extracted data for further use.

Integrate with flow

  • In the designer page of your flow, select the "Run AI Model" from "Artificial Intelligence" category

  • On the settings page of the action, click on the "Select AI Model" field and from the dropdown list select your desired AI Model.


Best Practices

  • Document Quality: Ensure documents are clear and legible for better OCR accuracy.
  • Preprocessing: Use preprocessing techniques like image enhancement to improve text recognition.
  • Validation: Always review the extracted data to ensure accuracy before automating further processes.
  • Security: Handle sensitive data with care and comply with relevant data protection regulations.

Troubleshooting

  • Common Issues
    • Poor Recognition Accuracy: Ensure the document is clear and the correct language is selected.
    • Incomplete Text Extraction: Check if the document layout is too complex for the default settings and adjust the configuration.

Support

For additional support, contact Zenphi customer service or refer to the Zenphi help center.


Conclusion

The Zenphi Document OCR AI Model is a powerful tool for automating the extraction of text from various document types, enhancing efficiency and accuracy in document processing tasks. By following the steps and best practices outlined in this documentation, users can effectively utilize this model to streamline their workflows.