Unlock instant, AI-driven research and patent intelligence for your innovation.

Multilingual natural scene text detection and recognition oriented system and method

A text detection and natural scene technology, applied in character and pattern recognition, instruments, biological neural network models, etc., can solve the problems of not being able to meet the needs of multiple languages, not being able to expand other languages, and numerous processing steps, and achieve cost savings , improve accuracy, and reduce the effect of the labeling process

Pending Publication Date: 2020-02-07
HARBIN INST OF TECH
View PDF8 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 1. It is only applicable to the detection and recognition of Chinese and English, which has limitations and cannot be extended to other languages, and cannot meet the needs of multiple languages ​​​​in actual natural scenes
[0008] 2. There are many steps for image processing, and the detection speed is slow
[0009] 3. Random forest method is used for text classification, but over-fitting will occur when classification is performed in natural scenes with high noise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilingual natural scene text detection and recognition oriented system and method
  • Multilingual natural scene text detection and recognition oriented system and method
  • Multilingual natural scene text detection and recognition oriented system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0086] The natural scene images processed in this embodiment are as figure 2 Shown in the figure are examples of natural scene images in multiple languages. The development platform of the recognition program is the Linux operating system CentOS7.2, and the GPU is two NVIDIA GeForce GTXTITAN X GPUs. The recognition program is written in python3.5 and uses the PyTorch0.4.1 framework.

[0087] The images in the collected natural scenes need to have the following characteristics:

[0088] (1) Color images with an image resolution above 96dpi;

[0089] (2) The image should contain the complete text area.

[0090] If the input natural scene image does not meet the above standards, the recognition rate may be reduced.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multilingual natural scene text detection and recognition system and method. The system comprises a text detection module, a character recognition module and a language judgment module, the text detection module is responsible for the detection function of a text area, the character recognition module is responsible for the recognition function of various characters, andthe language judgment module is responsible for the judgment function of characters and languages. The method comprises the steps of text detection, character recognition and language judgment. According to the method, detection and recognition of text images of various different languages can be realized, and the used network model can be expanded to a new language only by modifying a part. According to the method, an end-to-end method is adopted, so that the data labeling process in the middle step and frequent data input and output can be reduced, and the cost is greatly saved. According tothe method, the improved FPN algorithm is adopted, good robustness is achieved, the candidate boxes are predicted through the method that the space conversion layer is combined with the LNMS, and theaccuracy of the prediction boxes can be effectively improved.

Description

technical field [0001] The invention relates to a system and method capable of detecting and recognizing text in natural scenes, in particular to a system and method for detecting and recognizing multilingual printed text in natural scenes. Background technique [0002] OCR refers to processing photos scanned by various scanning devices to obtain text information contained in the photos. Strictly speaking, OCR refers to text recognition for scanned documents, and text recognition for natural scenes is called STR (Scene Text Recognition), most of which are pictures of door signs, traffic signs, and advertisements. [0003] The difficulty of STR text recognition is much greater than that of OCR text recognition. There are three reasons: First, the diversity of text forms in natural scenes. In natural scenes, the size, font, color, brightness, etc. of characters are uncertain. Text Lines may be horizontal, vertical, slanted, or distorted, and multiple languages ​​may be mixed,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/40G06K9/32G06K9/62G06N3/04
CPCG06V30/1478G06V10/30G06N3/045G06F18/24G06F18/214
Inventor 苏统华杨超杰王忠杰涂志莹徐晓飞
Owner HARBIN INST OF TECH