Character recognition method and system based on attention mechanism
A text recognition, attention technology, applied in character recognition, character and pattern recognition, computer parts and other directions, can solve the problem of forming noise area, limited attention area, attention drift and so on
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0070] like figure 1 As shown, a text recognition method based on attention mechanism includes the following steps:
[0071] S1: Build a character recognition model for recognizing characters in images; the character recognition model consists of the following modules:
[0072] Convolutional neural network for extracting feature maps of input images;
[0073] An attention mechanism module, including a sequence encoder, a forward sequence decoder and a reverse sequence decoder, is used to encode and decode the feature map, and output the feature vector of the predicted character;
[0074] The character decoding layer is used to compile the feature vector of the predicted character into a text recognition result, and at the same time compile the feature map into a feature map character probability vector;
[0075] S2: build a training sample set, the training sample set includes a training image and an image label corresponding to the training image, wherein the image label is...
Embodiment 2
[0084] like figure 1 As shown, a text recognition method based on attention mechanism includes the following steps:
[0085] S1: Construct a character recognition model for recognizing characters in images; the character recognition model consists of a convolutional neural network, an attention mechanism module, and a character decoding layer, wherein the attention mechanism module includes a sequence encoder, a regular Towards sequence decoder and reverse sequence decoder.
[0086] In the S1 step, the convolutional neural network includes a multi-layer convolutional filter bank and a pooling sub-module, the convolutional filter bank adopts a residual structure, and the character decoding layer is fully connected by a multi-layer neural network layer structure, wherein the multi-layer convolutional filter bank extracts image features, the pooling sub-module changes the resolution of the feature map, and the output of the convolutional neural network is a feature map with a ce...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com