Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Multi-Oriented Text Detection Method in Natural Images Based on Linked Text Fields

A technology of connection detection and text detection, which is applied in the field of computer vision, can solve problems such as difficult to generate bounding boxes, does not contain spaces, and cannot provide visual information for dividing different words

Active Publication Date: 2019-10-08
HUAZHONG UNIV OF SCI & TECH
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although existing technologies have achieved great success in applying object detection methods to text detection, there are still several obvious deficiencies in object detection methods in locating text areas.
First, the aspect ratio of words or text strips is usually much larger than that of general objects, and it is difficult for previous methods to generate bounding boxes of this ratio; second, some non-Latin texts have no gaps between adjacent words. Contains spaces, such as Chinese characters
Existing technologies can only detect words, which are not applicable when detecting this kind of text, because this kind of text without spaces cannot provide visual information to divide different words
Third, in large natural scene pictures, the text may be in any direction, but most of the existing technologies can only detect horizontal text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Multi-Oriented Text Detection Method in Natural Images Based on Linked Text Fields
  • A Multi-Oriented Text Detection Method in Natural Images Based on Linked Text Fields
  • A Multi-Oriented Text Detection Method in Natural Images Based on Linked Text Fields

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not conflict with each other.

[0064] The following first explains and illustrates the technical terms of the present invention:

[0065] Convolutional Neural Network (CNN): A neural network that can be used for tasks such as image classification, regression, etc. The network usually consists of convolutional layers, downsampling layers and fully connected layers. The convolutional and downsampling layers are responsible for ex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for detecting multi-directional text in natural pictures based on connecting text fields. Text fields and connections are two key steps in the detection method, which are defined as follows: text fields refer to the division of many individual text fields on a picture Multi-directional bounding box regions, which surround a text strip or part of a word; concatenation refers to connecting adjacent fields, meaning that they belong to the same word or sentence. Text fields and connections are combined to detect equidistantly at multiple scales using a fully convolutional neural network trained end-to-end. The final detection result is obtained by first connecting multiple text fields to form new regions, and then combining these new regions. Compared with the existing technology, the detection method proposed by the present invention has achieved excellent results in terms of accuracy, speed, and model simplicity. It has high efficiency and strong robustness, can overcome complex picture backgrounds, and can also detect images in images. Long text in non-Latin scripts.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more particularly, relates to a multi-directional text detection method in natural pictures based on connected text fields. Background technique [0002] Reading text in natural images is a challenging and popular task with many practical applications in photo optical recognition, geolocation, and image retrieval. In text reading systems, text detection is to locate text regions with bounding boxes at the word level or text strip level, which is usually a very critical first step. In a sense, text detection can also be regarded as a special object detection, which takes words, characters or text strips as the detection target. [0003] Although existing techniques have achieved great success in applying object detection methods to text detection, object detection methods still have several significant deficiencies in locating text regions. First, the aspect ratio of words or text st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06N3/08G06K9/20
CPCG06N3/08G06V10/225G06V30/10G06F18/214
Inventor 白翔石葆光
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products