Method and apparatus for text detection and positioning in natural scene based on focus loss function

A loss function and text detection technology, applied in the field of computer vision, can solve problems such as different picture quality, long text area, poor effect, etc., achieve high recall rate, improve accuracy, and good sensitivity

Active Publication Date: 2019-01-01
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The third is that the picture quality of the processing objects is different
However, there is a big difference between the text in the natural scene and the object in the object detection, mainly reflected in the fact that the text area may be longer, and the direct use of the object detection method is not targeted and the effect is not good
Therefore, it is still a big challenge to design a reasonable and efficient text detection method according to the characteristics of text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for text detection and positioning in natural scene based on focus loss function
  • Method and apparatus for text detection and positioning in natural scene based on focus loss function
  • Method and apparatus for text detection and positioning in natural scene based on focus loss function

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be described in further detail below through specific embodiments and accompanying drawings.

[0046] The text detection and positioning method in natural scenes based on the focus loss function of the present invention is mainly divided into a training phase (corresponding to the training module) and a testing phase (corresponding to the testing module).

[0047] The steps in the training phase are as follows:

[0048] 1) Preprocess the labeled data set to construct a text / background binary classification truth map and a five-dimensional truth map of the corresponding relationship between text pixels and their text boxes.

[0049] The label conversion of step 1) is as follows figure 1 As shown in , the pixels in the marked text box are marked as 1, and the background pixels are marked as 0, and the text / background binary classification truth map is constructed. If the text box is marked as any quadrilateral box, it needs to be uniformly exp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text detection and positioning method and a device under a natural scene based on a focus loss function. Firstly, the annotated data is preprocessed, then a text detection and location network is constructed, and then the focus loss function is used as a part of the training process loss function, and then the natural scene images to be detected are detected. The method adjusts the existing tagging to make the tagging more suitable for the designed character detection network. FCN-based network combine multiple convolution layers to make them more consistent with thattask of text detection; by introducing the focus loss function to balance the positive and negative samples in the training process, the detection accuracy is improved. The invention can obtain the effect of high precision and high recall on character detection and positioning.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a method and a device capable of accurately locating text regions in natural scene pictures. Background technique [0002] There are many ways for human beings to disseminate information. As the carrier of information dissemination, text itself directly contains rich semantic information. In natural scenes, text is ubiquitous. Whether it is shop signs, traffic signs, or even street advertisements, posters, etc., words are used to convey information. Accurately locating and recognizing text areas from natural scenes can help machines better understand the semantic content of scenes, and is helpful in many fields. For example, in the field of street view recognition, recognizing the text on building plaques will help us better understand street view information; in the field of assisted driving, recognizing the text on traffic signs will help us better assist ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32
CPCG06V10/25
Inventor 操晓春田晓玮伍蹈代朋纹
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products