Coding and decoding-based mathematical formula identification method and device, and readable storage medium
A technology of mathematical formulas and recognition methods, applied in the field of image recognition, to achieve the effect of enhancing feature extraction, simplifying network structure, and enhancing the ability to distinguish
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] A kind of mathematical formula recognition method based on encoding and decoding described in the present invention, such as figure 1 As shown, it specifically includes the following steps:
[0040] Step 1. Image preprocessing: crop and adjust the size of the picture containing only handwritten mathematical formulas. The picture only contains all the formula parts and as few blank areas as possible, and the size is cut to 256*256 pixels;
[0041] Step 2. Image feature encoding: pass the processed image through the encoding network fused by the improved ResNet convolutional network and the position encoding module to obtain the input of the decoding network.
[0042] The traditional convolutional neural network will face the problem of gradient disappearance / gradient explosion after the depth of the network is deepened. Therefore, ResNet introduces a residual network structure, that is, a feed-forward shortcut connection is introduced between the input and output, so tha...
Embodiment 2
[0070] A device applied to the codec-based mathematical formula recognition method, the device comprising:
[0071] The image processing module is used to crop and grayscale the pictures containing only formulas;
[0072] The feature encoding module connected with the module is used to complete the extraction of image feature information, and by position encoding, calculate and add position information;
[0073] The feature decoding module connected with the module is used for image feature sequence decoding and character prediction. The encoding network is connected by sub-networks, each sub-network contains a multi-head self-attention network and a forward network; the decoding network is used to calculate the positional relationship of the feature sequence and output a predictive sequence, using the L-softmax function for prediction The relationship between sequences imposes stronger constraints, and through these predicted sequences, the best character path is selected, a...
Embodiment 3
[0076] Based on the off-line handwritten mathematical formula recognition method based on the encoding and decoding model in the first embodiment, the present invention also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor , to realize the steps of the above method.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com