Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

OCR training sample generation method, device and system

A technology for training samples and generating regions, applied in the field of computer vision, can solve problems such as expensive and time-consuming labeling work, large differences between OCR image features and text features, and inability to obtain template images, so as to eliminate the need for answer labeling work.

Pending Publication Date: 2022-04-29
BEIJING YIDAO BOSHI TECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method has two main defects: one is that we cannot obtain the template image in most cases; the other is that the template image is often a clean and undistorted ideal image, which is quite different from the real image mixed with various noises. Sample authenticity is poor
This type of method has four main defects: The first defect is that the generalization ability of the model is poor. Since the OCR image features and text features of different formats are very different, for image formats or text styles that have not appeared in the model training process, The model often performs poorly; the second defect is that it is difficult to obtain training data. Due to the limited generalization ability of the model, if you want to train an image editing model for a certain format, you first need a large amount of training data similar to the format, and the current scene is originally It is because of the lack of data that image editing technology is needed; the third defect is that the samples for training image editing models need pixel-level annotation, which is very expensive and time-consuming; the fourth defect is that the model reasoning speed is relatively slow, and the operation often A deep learning reasoning framework is required, which greatly increases engineering complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • OCR training sample generation method, device and system
  • OCR training sample generation method, device and system
  • OCR training sample generation method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0101] The invention relates to a method for generating OCR training samples. This method can adaptively generate a large number of high-quality OCR samples to solve the problem of lack of OCR training samples.

[0102] If you want to figure 1 For the original image, generate a large number of samples of this type of layout. For the convenience of display, the added text box is not only the "erasing area" where we want to erase the text, but also the "generated area" where we want to generate text (the erasing area and the generated area do not need to overlap).

[0103] The first step is to enter the text contour extraction module to extract all text contours in the "erased area" on the entire picture, such as figure 2 shown.

[0104] The second step is to enter the image repair module, and the text outline ( figure 2 The white part in ) is used as the damaged part of the original image, and the image is repaired and filled according to the pixel information around the da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an OCR training sample generation method, device and system, and relates to the field of computer vision. The method comprises the following steps: a character contour extraction step: extracting all character contours based on an original image, determining an erasure area mask by combining an erasure area coordinate, and obtaining a repair area mask; an image repairing and filling step: performing image repairing and filling according to the mask of the repairing area and the pixel information around the repairing area to obtain a background template after the characters are erased; and a random text generation step: generating a random text in each generation area so as to obtain a new sample picture and a labeling information file corresponding to the new sample picture. According to the method, a character contour extraction algorithm, an image restoration technology and other technologies are combined, background information of an original picture is fully utilized, a high-quality training picture is generated, meanwhile, an annotation file (including character content and position information) corresponding to the picture is generated, the burdensome and labor-consuming annotation work is omitted, and the method can be directly used for OCR model training.

Description

technical field [0001] The present invention relates to the field of computer vision, in particular to a method, device and system for generating OCR training samples. Background technique [0002] OCR (Optical Character Recognition, Optical Character Recognition) tasks widely exist in life and business scenarios. The current best method for this task is to use deep learning technology for text positioning and recognition. However, in real scenarios, there is often a shortage of training samples for a certain format, and deep learning relies on huge training samples. Therefore, a method for automatically generating OCR training samples emerges at the historic moment. [0003] The current methods for generating OCR samples are roughly divided into two categories: [0004] The first category is template-based methods. That is, input a clean template picture, and then write random text in the specified position of the template to generate an OCR sample of the template style...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V30/40G06V30/148G06T5/30G06T7/12G06T7/90G06K9/62
CPCG06T5/30G06T7/12G06T7/90G06F18/214
Inventor 沈达伟王勇朱军民康铁钢
Owner BEIJING YIDAO BOSHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products