Home » AI and ML » Optical Character Recognition Document

Optical Character Recognition Document

Every day, a vast quantity of textual information is written or printed on tangible paper, such as study-related messages, invoices, periodicals, books, ads, and so on. Paper contamination is a major issue in the corporate world and has obvious environmental consequences. Aside from that, it will be difficult to keep a large quantity of information or conduct a quick look for information if we use physical paper in business. Both STS Software GmbH and the clients are affected by these issues.

INTRODUCTION

OUR APPROACHES

Our purpose is to convert text image data to text and then process the output text to extract some important information. To do that, we have applied some Deep Learning models in Computer Vision to detect the text location on the natural image and then recognize some specific words. We separate our system into multi parts from pre-processing input images to get the final meaning of the text.

As you could see, firstly our system will receive data from the input text image or printed image… This input data will be cleaned or pre-processed by some methods like enhancing the image quality, removing blur, noise, and normalization. Then, the system will run some Deep Learning models to detect the text region on the cleaned input image and recognize, classify each text to some specific word, and at this step, we will have the output text data. Finally, there is an NLP model to clean again this text data to make these text data meaningful and extract the necessary information from them.

USAGE

Step 01

Step 1: Access to the Optical Character Recognition site: https://experiment.saigontechnology.vn/invoice/ or https://experiment.saigontechnology.vn/cvparser Or you can access the main Saigon Technology AI Research Lab page here: https://experiment.saigontechnology.vn/ , select the Optical Character Recognition section, and click Try our demo button.

Step 02

On the Optical Character Recognition page, to start please click the Browse files button.

Step 03

Choose an image file (.png, .jpg or another image format…) you want to run.

Step 04

After the chosen image is uploaded, click the Run button to run the OCR model.

Step 05

The output of the OCR model will be drawn directly on the image like below.

Step 06

Scroll down to see the output text of the OCR model as below.

Next Case Studies

AI and ML

Natural Language Processing Toolkit

The Natural Language Processing Toolkit (NLTK) is a Python-based software application that offers a suite of tools for the purpose of processing natural language data.

AI and ML

Product Recognition

Utilizing AI-based Computer Vision techniques, the Product Recognition system autonomously detects and categorizes products present within images or videos.

Let’s Talk

Together with our developers and analysts, we begin by discussing and analysing our client’s needs, sketching the outline