Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semantic enhanced scene text recognition method and device

A text recognition and enhancement technology, applied in the computer field, can solve the problems of high complexity and low accuracy of scene text recognition, and achieve the effect of high accuracy and enhanced correlation

Pending Publication Date: 2021-11-02
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a semantically enhanced scene text recognition method and device to solve the defects of low accuracy and high complexity of scene text recognition in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic enhanced scene text recognition method and device
  • Semantic enhanced scene text recognition method and device
  • Semantic enhanced scene text recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] To make the objectives, technical solutions, and advantages of the present invention clearer, the following will be incorporated in the present invention, the accompanying drawings, the present invention is the technical solution will be clearly and completely described, obviously, the described embodiments are part of the embodiments of the present invention rather than all embodiments. Based on the embodiments in the present invention, all other embodiments obtained without creative labor are not made in the premise of creative labor.

[0047] Scene mainly conventional text recognition using a convolutional neural network to extract information or a visual cycle neural network context information extracted text recognition, however, the encoder in the method of extraction of a single feature, will cause loss of information. For text morphology varied, complex background character image data, attention additive point mechanism or the conventional algorithm employed by the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a semantic enhanced scene text recognition method and device, and the method comprises the steps of extracting a visual feature map and a context feature sequence of a scene text image through an encoder of a scene text recognition model; and determining enhanced feature expression based on the visual feature map, the context feature sequence and the position code of the feature map; obtaining global visual information and semantic information of a scene text image; adopting a specially designed recurrent neural network unit for decoding by a decoder, wherein the unit can balance independence and correlation of context information; and performing multi-head attention operation on the implicit state vector and the expanded enhanced feature expression to obtain a local apparent feature vector. The local apparent feature vector and the hidden layer output of the recurrent neural network unit jointly participate in character prediction at the current moment, so that the correlation between semantic information and visual information is enhanced. The multi-head attention mechanism design can capture saliency information and auxiliary information of the features, so that the accuracy of a scene text recognition result is relatively high.

Description

Technical field [0001] The present invention relates to the field of computer technology, and more particularly to a semantic enhanced scene text identification method and apparatus. Background technique [0002] Compared with the traditional optical character identification (OCR) of high quality document images, natural scene text recognition can be applied in a broader field, such as photo analysis, license plate identification, picture advertising filtering, scene understanding, commodity identification, street view, bill Identification, etc. Due to the complexity of the text text, it is more difficult to identify, the main difficulties include: picture background is extremely rich, often facing low-brightness, low contrast, illumination, perspective deformation and incident shielding, etc.; text layout There may be problems such as distortion, pleats, commutation; the text may also be diverse, the word is different color. [0003] Conventional scene text recognition mainly us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06N3/04
CPCG06N3/044
Inventor 崔萌萌王威王亮
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products