Text Recognition Method Fused with Multi-layer Feature Enhanced Attention Mechanism
A text recognition and feature enhancement technology, applied in the field of optical character recognition, can solve problems such as different results, complexity, and slow recognition speed, and achieve the effect of increasing relevance and performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0082] For the convenience of description, first explain the relevant technical terms that appear in this embodiment:
[0083] reshape: Reconvert the shape of the matrix to a new shape;
[0084] LSTM (Long short-term memory): long short-term memory, a special recurrent neural network
[0085] CTCLoss (Connectionist Temporal Classification loss): A loss function that aligns output in text recognition;
[0086] argmax: a function that finds parameters (sets) for functions;
[0087] softmax: mapping function, which maps the output of multiple neurons to (0-1);
[0088] synthtext: a synthetic dataset for text recognition;
[0089] mjsynth: a synthetic dataset for text recognition;
[0090] ICDAR2013: A public real scene text recognition dataset;
[0091] ICDAR2015: A public real scene text recognition dataset;
[0092] IIIT: A publicly available real-scene text recognition dataset;
[0093] SVT: A publicly available real-world text recognition dataset.
[0094] see Figure ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com