OCR training sample generation method, device and system
A technology for training samples and generating regions, applied in the field of computer vision, can solve problems such as expensive and time-consuming labeling work, large differences between OCR image features and text features, and inability to obtain template images, so as to eliminate the need for answer labeling work.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0101] The invention relates to a method for generating OCR training samples. This method can adaptively generate a large number of high-quality OCR samples to solve the problem of lack of OCR training samples.
[0102] If you want to figure 1 For the original image, generate a large number of samples of this type of layout. For the convenience of display, the added text box is not only the "erasing area" where we want to erase the text, but also the "generated area" where we want to generate text (the erasing area and the generated area do not need to overlap).
[0103] The first step is to enter the text contour extraction module to extract all text contours in the "erased area" on the entire picture, such as figure 2 shown.
[0104] The second step is to enter the image repair module, and the text outline ( figure 2 The white part in ) is used as the damaged part of the original image, and the image is repaired and filled according to the pixel information around the da...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com