Tesseract engine based character recognition method and device

A text recognition and engine technology, applied in character recognition, character and pattern recognition, instruments, etc., can solve the problems of high cost, low recognition rate, inability to meet the needs of text recognition, etc., and achieve high update efficiency and high recognition efficiency. Effect

Active Publication Date: 2016-08-03
HANGZHOU CCRFID MICROELECTRONICS
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, it cannot meet the needs of current users to recognize the printed text
[0005] The current image and text recognition technology is either very expensive or has a low recognition rate, which cannot meet the needs of current users for text recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tesseract engine based character recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The invention provides an OCR character recognition method and device. The present invention uses the API interface of the cloud server to call the cloud server for image character recognition, and at the same time upgrades the local character library by means of the cloud server. After the upgrade, the local tesseract engine module uses the data of the local character library to correct the recognition results during recognition and improve the recognition rate of the tesseract engine module. The specific steps of identification are as follows:

[0026] Step 1, receiving the picture to be recognized by the server;

[0027] Step 2, connect the server to the cloud server, the server transmits the image to be recognized to the tesseract engine module and the cloud server at the same time, the tesseract engine module and the cloud server perform text recognition on the image to be recognized at the same time, and feed back the recognition results to the server respectivel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a tesseract engine based character recognition method and device. A cloud-end server updates a local literal pool to correct a recognition result of the tesseract engine, and the precision in recognizing characters in images of the tesseract engine is improved. The character recognition device comprises a server, a tesseract engine module, the cloud-end server and the local literal pool. When the local literal base is upgraded, the server uses characters recognized by the cloud-end server to correct the recognition result of the tesseract engine module, and the local literal pool is supplemented with characters that cannot be recognized correctly by the tesseract engine module. Thus, the recognition precision can be improved by inquiring the local literal pool during character recognition. According to the invention, the upgraded local literal pool is used to correct the recognition result of the tesseract engine module, the precision when the cloud-end server is directly used for character recognition can be achieved, time of character recognition operation is shortened, and the method and device are suitable for recognizes images in daily learning and life into characters.

Description

technical field [0001] The invention relates to an image recognition method, which belongs to the technical field of OCR character recognition (Optical Character Recognition, optical character recognition). Background technique [0002] OCR text recognition refers to the process of electronic devices (such as scanners or digital cameras) checking characters printed on paper, determining their shapes by detecting dark and light patterns, and then using character recognition methods to translate the shapes into computer text; that is, for Printed characters, using optical methods to convert the text in the paper document into a black and white dot matrix image file, and converting the text in the image into a text format through the recognition software, which can be further edited and processed by the word processing software. Optical character recognition (OCR) is the process of converting images of printed text into machine-encoded text. It is widely used to convert data r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/72
CPCG06V20/62G06V10/768G06V30/10
Inventor 孙磊秦阳莫凌飞杜喆宁姚昕宇齐恒冯增涛
Owner HANGZHOU CCRFID MICROELECTRONICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products