Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Quick text recognition method

A text recognition and fast technology, applied in the field of optical character recognition, can solve the problems of low English character recognition rate and English character recognition error, and achieve the effect of reducing the error recognition rate, improving the recognition speed, and efficient version

Active Publication Date: 2010-06-23
HANVON CORP
View PDF1 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the system is used on an embedded system, the speed problem will become particularly prominent
[0010] (2) The recognition rate of English characters is lower than that of only calling the English OCR recognition engine
Using the Chinese OCR recognition engine to recognize the English region may recognize some English characters as Chinese characters with high confidence. In this case, the English OCR recognition engine is no longer called to recognize the English region, resulting in the final English character recognition error.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quick text recognition method
  • Quick text recognition method
  • Quick text recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047]The fast text recognition method of the present invention will be described in detail below with reference to the accompanying drawings and taking mixed texts in Chinese and English as an example. However, those skilled in the art should know that the present invention is not limited to Chinese and English mixed texts, but can also be used in oriental languages ​​such as Japanese, Korean and other western languages ​​such as Russian, French, German , Italian, etc. bilingual mixed text.

[0048] The invention provides a method for recognizing text images mixed in two languages. Among them, the two languages ​​are respectively called the first language and the second language; the mixed text image is an image containing several lines of mixed language lines; the content of the mixed language lines can be: only contain the first language characters, or only Contains characters from the second language, or a mixture of characters from the first and second languages. The fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a quick text recognition method, belonging to the OCR technical field. In the method, an OCR recognition engine is used for recognizing the mixing character images of two languages; firstly, a text is segregated into text lines; next, the text lines are arrayed according to the number of the characters of a first language or a second language included in each text line; then a Chinese OCR recognition engine is used for recognition to extract an English doubt region, and an English OCR recognition engine is used for recognition. If the current-line recognition result is an English line, the OCR recognition strategy at the next line is that: firstly, the English OCR recognition engine is used for recognition to extract a Chinese doubt region; then, the Chinese OCR recognition engine is used for recognition; and finally, the recognition results are mixed. The method improves the recognition speed, reduces the character misrecognition rate and provides the high-efficient version for an embedded device.

Description

technical field [0001] The present invention relates to the technical field of Optical Character Recognition (OCR, Optical Character Recognition), in particular to a fast text recognition method for mixed text image recognition in two languages. Background technique [0002] A practical optical recognition system usually needs to recognize at least two languages. Taking Chinese text recognition as an example, there are usually part or large segments of English characters mixed in (hereinafter referred to as: mixed text). [0003] Two options are usually adopted at present. [0004] Solution 1: Use a Chinese OCR recognition engine that includes an English character set to segment and recognize Chinese characters and English characters at the same time. [0005] However, due to the characteristics of different language types (for example: character cohesion, quantity, topology, etc.), the Chinese OCR recognition engine is not ideal for English recognition. In order to impro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/20
Inventor 万鑫刘正珍朱军民
Owner HANVON CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products