Natural scene character detection method based on attention mechanism

A natural scene, text detection technology, applied in the field of computer analysis, can solve the problem of inability to return direction information, not considering the importance of different features, and text line sticking.

Pending Publication Date: 2022-03-25
XINJIANG NORMAL UNIVERSITY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The direction of curved text is not easy to regress: the scene text detection method based on box regression needs to regress the direction information when solving the direction of multi-directional scene text examples; however, for text examples of arbitrary shapes, such as curved shapes, the direction information cannot be make regression;
[0006] Glue between different text lines: The scene text detection method based on semantic segmentation has achieved good performance in solving the scene text problem of any shape and any direction; however, if different text lines are relatively close to each other, it is easy to Causes glue between lines of text;
[0007] Multi-level feature integration produces information redundancy: the semantic segmentation-based scene text detection method uses shallow and deep multi-level features when predicting text region information; however, text objects are mainly concentrated in deep features; in addition, feature integration In the process, the importance of different features is not considered,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Natural scene character detection method based on attention mechanism
  • Natural scene character detection method based on attention mechanism
  • Natural scene character detection method based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0056] In this embodiment, a comparative experiment is carried out to verify the technical effect of the scene text detection method described in Embodiment 1. The experimental environment and experimental results are as follows:

[0057] (1) Experimental environment

[0058] System environment: Ubuntu 16.04;

[0059] Hardware environment: GPU: GTX 1080Ti, memory: 512G.

[0060] (2) Experimental data set

[0061] Training data: First, use the 7200 training data of MLT2017 to pre-train the text center block model 4×10 5 times; then, on the Total-test (1255 training sets), the text center block model and the word stroke area model are fine-tuned 4 × 10 5 Second-rate.

[0062] Test data: Total-test (300 test sets).

[0063] (3) Evaluation method

[0064] Curved Shaped Text: A Pascal Evaluation Method.

[0065] In order to demonstrate the effectiveness of the present invention, four sets of experiments were set up using the same training set to train the model, and were eva...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a natural scene character detection method based on an attention mechanism, and the method comprises the steps: designing a convolutional neural network model for extracting a text target according to the feature information of a text center block and a stroke region, and training the model through employing the text center block and stroke information as supervision data; in a model test stage, respectively inputting a test image into the text center block model and the stroke model to obtain a probability graph of a text center block and a word stroke area; a final text area is obtained through reasoning and marked, and a scene image text detection task is completed; according to the method, the problems that in the prior art, in scene character detection, direction information of a curved text is not prone to regression, adhesion between adjacent different text lines and information redundancy generated by multi-level feature integration occur are solved.

Description

technical field [0001] The invention relates to the field of computer analysis, in particular to a method for detecting characters in natural scenes based on an attention mechanism. Background technique [0002] As the carrier of human knowledge and information, text widely exists in real daily life scenes. Extracting the text information embedded in the scene image is very valuable and beneficial in many applications based on image content information. The text extraction technology in the scene image is used in blind navigation, blind reading, image retrieval and labeling, human-computer interaction, It has broad application prospects in unmanned driving and other scenarios. Scene text detection is to determine the specific position of the text in the image, and text recognition is to recognize the text information in the bounding box into scale text. Scene text detection plays an important role in extracting and understanding text information in scene images, and its per...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V20/62G06V10/40G06V10/774G06V10/764G06V10/80G06V10/82G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/044G06N3/045G06F18/253G06F18/24G06F18/214
Inventor 刘占东张海军
Owner XINJIANG NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products