A text recognition method and device based on tesseract engine

A text recognition and engine technology, applied in character recognition, character and pattern recognition, instruments, etc., can solve the problems that cannot meet the needs of users' text recognition, cannot meet the needs of text recognition, low recognition rate, etc., and achieve text recognition efficiency High, high update efficiency, high recognition efficiency

Active Publication Date: 2019-02-05
HANGZHOU CCRFID MICROELECTRONICS
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, it cannot meet the needs of current users to recognize the printed text
[0005] The current image and text recognition technology is either very expensive or has a low recognition rate, which cannot meet the needs of current users for text recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text recognition method and device based on tesseract engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The invention provides an OCR character recognition method and device. The present invention uses the API interface of the cloud server to call the cloud server for image character recognition, and at the same time upgrades the local character library by means of the cloud server. After the upgrade, the local tesseract engine module uses the data of the local character library to correct the recognition results during recognition and improve the recognition rate of the tesseract engine module. The specific steps of identification are as follows:

[0026] Step 1, receiving the picture to be recognized by the server;

[0027] Step 2, connect the server to the cloud server, the server transmits the image to be recognized to the tesseract engine module and the cloud server at the same time, the tesseract engine module and the cloud server perform text recognition on the image to be recognized at the same time, and feed back the recognition results to the server respectivel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a character recognition method based on a tesseract engine and a corresponding device thereof. A cloud server is used to upgrade a local character library to correct the recognition result of the tesseract engine, thereby improving the accuracy of the tesseract engine in recognizing characters in an image. The character recognition device of the present invention includes a server, a tesseract engine module, a cloud server and a local character library. When upgrading the local text library, the server uses the text recognized by the cloud server to correct the recognition result of the tesseract engine module, and supplements the text that the tesseract engine module cannot correctly recognize into the local text library. In this way, when performing character recognition, the recognition accuracy can be improved by querying the local character library. The present invention uses the upgraded local character library to correct the recognition result of the tesseract engine module, which can achieve the same accuracy as directly using the cloud server for character recognition, and can also shorten the time for character recognition calculations, and is suitable for daily learning and image recognition at work. into words.

Description

technical field [0001] The invention relates to an image recognition method, which belongs to the technical field of OCR character recognition (Optical Character Recognition, optical character recognition). Background technique [0002] OCR text recognition refers to the process of electronic devices (such as scanners or digital cameras) checking characters printed on paper, determining their shapes by detecting dark and light patterns, and then using character recognition methods to translate the shapes into computer text; that is, for Printed characters, using optical methods to convert the text in the paper document into a black and white dot matrix image file, and converting the text in the image into a text format through the recognition software, which can be further edited and processed by the word processing software. Optical character recognition (OCR) is the process of converting images of printed text into machine-encoded text. It is widely used to convert data r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/32G06K9/72
CPCG06V20/62G06V10/768G06V30/10
Inventor 孙磊秦阳莫凌飞杜喆宁姚昕宇齐恒冯增涛
Owner HANGZHOU CCRFID MICROELECTRONICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products