Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document key information extraction method based on zero sample learning

A key information and sample learning technology, applied in the field of computer vision, can solve the problems of increased parameter capacity, large amount of data, insufficient resource time, etc., to achieve the effect of reducing resources and time, improving prediction speed, and strong generalization ability.

Pending Publication Date: 2021-06-15
BEIJING YIDAO BOSHI TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, no matter how finely the template is designed, it is difficult to fully consider the problems that may arise in reality
Due to the huge amount of data, and the accuracy of subsequent training and model prediction has great requirements on the high precision of sample labeling, this process is time-consuming and laborious.
Third, training a large number of samples requires a lot of resources and time, but in reality, there may be insufficient resources or high time requirements, so that it is impossible to obtain a higher-precision model
Fourth, in order to solve complex tasks and obtain higher accuracy, the complexity of the model will become higher and higher, the capacity of parameters will increase accordingly, and the training time and prediction time will also increase
This makes it difficult to directly apply it to actual production scenarios even if a higher-precision model is trained

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document key information extraction method based on zero sample learning
  • Document key information extraction method based on zero sample learning
  • Document key information extraction method based on zero sample learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0069] Before preprocessing, a sample is selected as a meta-sample, which is the benchmark for subsequent similarity measurement. Since we are dealing with samples of fixed templates, we can randomly select one of the labeled samples of the same type of template as the meta-sample of this type of template. Such as Figures 1 to 2 As shown, it specifically includes the following steps:

[0070] Step 1: Input Preprocessing

[0071] This step performs preprocessing operations on the input, which includes images, text block boxes, and text.

[0072] For the input image, the most important thing is to normalize the size of the aspect ratio and fill the boundary with 0, so that the size of the image can support the convolution and downsampling operations required by the neural network in the encoding module, and maximize the Both global and local feature information are preserved. During training, the image preprocessing stage also needs to complete the necessary data enhancemen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a document key information extraction method based on zero sample learning, and belongs to the field of computer vision. The method comprises the following steps: randomly selecting a element sample; respectively carrying out feature coding on the image corresponding to each text block in the document image, the text block content and the coordinate value of the text block box; fusing the features after feature coding to obtain a plurality of first fusion coding features; performing feature coding and fusion on the plurality of text blocks in the meta sample to obtain a plurality of second fusion coding features; and similarity comparison: selecting the entity category of the text block corresponding to the second fusion coding feature with the highest similarity as the entity category of the text block in the document image. According to the technical scheme of the invention, the method greatly reduces the energy required for collecting and labeling data, greatly reduces the size of the model, and greatly reduces the time required for training and prediction, so that the method can be better applied to an actual production scene.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to a method for extracting document key information based on zero-sample learning. Background technique [0002] The information extraction process is the process of automatically extracting and converting unstructured information embedded in documents into structured data. A traditional key information extraction method is based on template matching, which has obvious limitations. First, depending on the complexity of the document layout, it takes a lot of energy and time to build a high-precision template. In addition, manpower needs to be invested in the continuous maintenance of all customized templates in the later stage to ensure that the functions of existing templates will not be affected when new templates are added. Second, no matter how finely the template is designed, it is difficult to fully consider the problems that may arise in real situations. For example, the docu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/20G06K9/62G06N3/04G06N3/08G06T3/40
CPCG06T3/4038G06N3/08G06V10/22G06V10/751G06N3/045G06F18/253Y02D10/00
Inventor 宋佳奇朱军民王勇
Owner BEIJING YIDAO BOSHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products