Sample set construction method and device, computer equipment and storage medium

A construction method and sample set technology, applied in the field of devices, sample set construction methods, computer equipment and storage media, can solve the problem of inaccurate character recognition model model parameters, unbalanced distribution of sample images of certificate pictures, inability to obtain certificate pictures, etc. problem, to achieve the effect of improving diversity, sample balance, and high recognition accuracy

Pending Publication Date: 2019-07-26
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the number of pictures required for training character recognition models is very large, and it is usually impossible to obtain a large number of real ID pictures
[0003] At present, most of the ID images used in training character recognition models are obtained by batch-generating electronic ID images from templates. Although the electronic ID images obtained in this way are relatively clear, their randomness is not strong, which will cause ID images to exist as sample images. The problem of uneven distribution
When directly using a large number of unbalanced certificate pictures to train the character recognition model, the model parameters of the obtained character recognition model will not be accurate enough. When the trained character recognition model is used to recognize the certificate information, the recognition results obtained will not be accurate too accurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample set construction method and device, computer equipment and storage medium
  • Sample set construction method and device, computer equipment and storage medium
  • Sample set construction method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0049] The sample set construction method provided by this application can be applied to such as figure 1shown in the application environment. Wherein, the terminal 102 communicates with the server 104 through the network. The server 104 can obtain through the network the certificate template map that does not include the certificate information generated according to the certificate map; generate multiple sets of virtual certificate information according to the styles of various certificate information in the certificate map; The location of the information is written i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a sample set construction method and device, computer equipment and a storage medium. The method relates to sample generation for training a model. The sample set constructionmethod comprises the steps of obtaining a certificate template graph which is generated according to a certificate graph and does not comprise certificate information; generating a plurality of groups of virtual certificate information according to the styles of various certificate information in the certificate graph; writing the virtual certificate information into a certificate template graphaccording to the position of each type of certificate information in the certificate graph to generate an electronic certificate graph; carrying out image collection on the entity certificate corresponding to the electronic certificate picture to obtain a certificate collection picture; and constructing a picture sample set according to the electronic certificate image and the certificate collection image, wherein the picture sample set is used for training a character recognition model. By adopting the method, the balance of the samples for training the character recognition model can be improved.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a sample set construction method, device, computer equipment and storage medium. Background technique [0002] In the technical field of automatic identification of document information, a large number of document pictures are required to train the character recognition model, which can improve the accuracy of the character recognition model in identifying document information. However, the number of pictures required for training character recognition models is very large, and it is usually impossible to obtain a large number of real document pictures. [0003] At present, most of the ID images used in training character recognition models are obtained by batch-generating electronic ID images from templates. Although the electronic ID images obtained in this way are relatively clear, their randomness is not strong, which will cause ID images to exist as sample images...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/62
CPCG06V30/153G06F18/214
Inventor 高梁梁
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products