Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-language scene text detection and recognition method

A technology for text detection and recognition methods, applied in character recognition, character and pattern recognition, instruments, etc., can solve problems such as low efficiency, inability to apply multi-language scenarios, and failure to capture long-range dependencies, and achieve the effect of improving the effect.

Active Publication Date: 2019-09-24
UNIV OF SCI & TECH OF CHINA
View PDF15 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, many research methods only do text detection or text recognition alone. There are some methods that can perform text detection and recognition at the same time, but they are mainly for text in one language (for example, English or Chinese), that is, these methods cannot be applied. multilingual scene
[0003] Furthermore, these methods only use local operations such as convolutional neural networks and recurrent neural networks, which do not capture long-range dependencies
Furthermore, these methods generally use online hard case mining algorithms to reduce the false positive rate of the network, but their efficiency is low.
Finally, existing methods only use Connectionist Temporal Classification (CTC) or attention-based decoders to decode input sequences into text, making text recognition performance low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language scene text detection and recognition method
  • Multi-language scene text detection and recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0014] Embodiments of the present invention provide a multilingual scene text detection and recognition method, such as figure 1 As shown, it mainly includes:

[0015] 1. Process the input image through the text detector to obtain a series of text candidate boxes; it mainly includes two modules, namely feature selection and long-range dependency module and feature enhancement module. Through feature selection and long-range dependency extraction m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-language scene text detection and recognition method. The method comprises: processing an input image through a text detector to obtain a series of text candidate boxes; adaptively generating a one-dimensional weight vector and a two-dimensional weight matrix through a feature selection and long-range dependency extraction module so as to pay more attention to channels from which text information is extracted and areas containing texts, and acquiring global information by capturing long-range dependency; and by the operation of the feature enhancement module, enabling the network to have better distinguishing performance on the text / non-text, thereby reducing false alarms; finally, predicting a series of text candidate boxes by using a plurality of convolution respectively; and carrying out text recognition and text category recognition on the text candidate boxes subjected to threshold processing and zooming through a text recognizer and a text category recognizer to obtain text contents and text categories. The method has high text detection and recognition performance and is suitable for multi-language application scenarios.

Description

technical field [0001] The invention relates to the technical field of text detection and recognition, in particular to a multilingual scene text detection and recognition method. Background technique [0002] Scene text reading refers to the detection and recognition of all text contained in natural scene images. It has many applications in image retrieval, scene understanding, automatic driving and text translation. At present, many research methods only do text detection or text recognition alone. There are some methods that can perform text detection and recognition at the same time, but they are mainly for text in one language (for example, English or Chinese), that is, these methods cannot be applied. in multilingual scenarios. [0003] Furthermore, these methods only use local operations such as convolutional neural networks and recurrent neural networks, which do not capture long-range dependencies. Furthermore, these methods generally use online hard case mining a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/62
CPCG06V20/63G06V30/10G06F18/241G06F18/214
Inventor 张勇东周宇谢洪涛
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products