Ticket recognition training sample synthesis method and computer storage medium
A technology of training samples and synthesis methods, applied in the field of text recognition, can solve problems such as unbalanced characters and uncontrollable number of real samples, and achieve the effect of solving unbalanced character coverage and uncontrollable number of samples
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0059] Please refer to figure 1 , this embodiment proposes a method for synthesizing training samples for ticket recognition, which can be applied to the text recognition model training of various tickets, such as real estate certificates, land certificates, etc. Uncontrollable samples for model training with real training samples, unbalanced character coverage and other problems. The method will be described in detail below.
[0060] Step S100, perform character sampling from the corpus according to preset rules to obtain a character sampling set, read characters from the character sampling set to generate a sample character string with a predetermined length, and form a plurality of sample character strings into sample characters string collection.
[0061] In this implementation, in order to replace real ticket samples with artificially synthesized training samples, the text characters needed for the training samples to be synthesized are obtained first. Among them, the ...
Embodiment 2
[0113] Please refer to Figure 8 , based on the ticket recognition training sample synthesis method of the above-mentioned embodiment, this embodiment proposes a ticket recognition training sample synthesis device 10, comprising:
[0114] The sample character string acquisition module 100 is used to perform character sampling from the corpus according to preset rules to obtain a character sample set, and read characters from the character sample set to generate a sample character string with a predetermined length. Sample strings make up a sample string set.
[0115] The foreground text mask image generating module 200 is configured to perform text mask preprocessing on each sample character string and generate a corresponding foreground text mask image.
[0116] The secondary image fusion module 300 is configured to perform secondary image fusion on each foreground text mask image and the corresponding selected ticket background image to obtain a synthetic training sample se...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com