Document Processor

Introduction

This tutorial will help you build a Document Processor React application in Catalyst that implements AI/ML capabilities to detect, process, and extract specific data from scanned documents and images. The Document Processor application built in this tutorial will be able to do the following:

  1. Recognize handwritten or printed text in scanned digital documents or images, and convert them to a machine-readable format (a JSON response).

  2. Scan printed barcodes (1D and 2D formats), and extract the encoded information in them.

  3. Scan and process digital ID proof such as PAN, AADHAAR, and other documents including passbooks and cheques, and extract the field records from those documents. Also note that PAN and AADHAAR are identity proof documents specific to the India region.

Note : PAN and AADHAAR are identity proof documents specific to the India region.

The client application will appear as follows:

local-hosted

You can access a working application and test its functionality here: https://ziaservices.catalystserverlessapp.com

The Document Processor application leverages the following Catalyst services and their components:

  1. Catalyst Zia Services

    OCR : This component helps you detect and recognize textual characters in your input digital documents or images, and convert them to machine-encoded text. Catalyst OCR can recognize texts in 10 major languages.

    Barcode Scanner : This component allows you to scan barcodes and extract the encoded information in them. You can learn more about the supported barcode formats from this page.

    Identity Scanner : The Identity Scanner component enables you to scan and extract data from common ID proof documents such as AADHAAR cards, PAN cards, bank passbooks, and cheques. Refer to this page to learn more about the supported languages, file formats, and file sizes.

  2. Catalyst Serverless

    Functions: We will be coding an Advanced I/O function in Node.js. It contains the logic that enables you to process and extract data from the input documents or images. We have leveraged the Catalyst Zia Services Node.js SDK package for this purpose.

  3. Catalyst Cloud Scale

    Web Client Hosting : . We will be creating and initializing the web client as a React application in this tutorial using in-built Catalyst plugins. The client application will be hosted in Catalyst using Web Client Hosting.

We will use the Catalyst web console and the Catalyst Command Line Interface (CLI) to build this application.

Note: If you are a Visual Studio Code IDE user, you can install the Catalyst Tools extension, and use your IDE in place of the CLI. You can find more details about the Catalyst VS Code extension from this help section.

You can fetch the tutorial’s code from this git repo. You will have to replace the files in the project directory as directed in the function and client configuration sections of this tutorial.

Last Updated 2024-06-18 12:08:24 +0530 +0530

Min Time to Complete:

20 mins

Difficulty Level:

Beginner

SERVICES INVOLVED

Serverless Cloud Scale Zia Services