Document key information extraction method based on zero sample learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A key information and sample learning technology, applied in the field of computer vision, can solve the problems of increased parameter capacity, large amount of data, insufficient resource time, etc., to achieve the effect of reducing resources and time, improving prediction speed, and strong generalization ability.

Pending Publication Date: 2021-06-15

BEIJING YIDAO BOSHI TECH

View PDF0 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Second, no matter how finely the template is designed, it is difficult to fully consider the problems that may arise in reality

Due to the huge amount of data, and the accuracy of subsequent training and model prediction has great requirements on the high precision of sample labeling, this process is time-consuming and laborious.

Third, training a large number of samples requires a lot of resources and time, but in reality, there may be insufficient resources or high time requirements, so that it is impossible to obtain a higher-precision model

Fourth, in order to solve complex tasks and obtain higher accuracy, the complexity of the model will become higher and higher, the capacity of parameters will increase accordingly, and the training time and prediction time will also increase

This makes it difficult to directly apply it to actual production scenarios even if a higher-precision model is trained

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0069] Before preprocessing, a sample is selected as a meta-sample, which is the benchmark for subsequent similarity measurement. Since we are dealing with samples of fixed templates, we can randomly select one of the labeled samples of the same type of template as the meta-sample of this type of template. Such as Figures 1 to 2 As shown, it specifically includes the following steps:

[0070] Step 1: Input Preprocessing

[0071] This step performs preprocessing operations on the input, which includes images, text block boxes, and text.

[0072] For the input image, the most important thing is to normalize the size of the aspect ratio and fill the boundary with 0, so that the size of the image can support the convolution and downsampling operations required by the neural network in the encoding module, and maximize the Both global and local feature information are preserved. During training, the image preprocessing stage also needs to complete the necessary data enhancemen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a document key information extraction method based on zero sample learning, and belongs to the field of computer vision. The method comprises the following steps: randomly selecting a element sample; respectively carrying out feature coding on the image corresponding to each text block in the document image, the text block content and the coordinate value of the text block box; fusing the features after feature coding to obtain a plurality of first fusion coding features; performing feature coding and fusion on the plurality of text blocks in the meta sample to obtain a plurality of second fusion coding features; and similarity comparison: selecting the entity category of the text block corresponding to the second fusion coding feature with the highest similarity as the entity category of the text block in the document image. According to the technical scheme of the invention, the method greatly reduces the energy required for collecting and labeling data, greatly reduces the size of the model, and greatly reduces the time required for training and prediction, so that the method can be better applied to an actual production scene.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to a method for extracting document key information based on zero-sample learning. Background technique [0002] The information extraction process is the process of automatically extracting and converting unstructured information embedded in documents into structured data. A traditional key information extraction method is based on template matching, which has obvious limitations. First, depending on the complexity of the document layout, it takes a lot of energy and time to build a high-precision template. In addition, manpower needs to be invested in the continuous maintenance of all customized templates in the later stage to ensure that the functions of existing templates will not be affected when new templates are added. Second, no matter how finely the template is designed, it is difficult to fully consider the problems that may arise in real situations. For example, the docu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/20G06K9/62G06N3/04G06N3/08G06T3/40

CPCG06T3/4038G06N3/08G06V10/22G06V10/751G06N3/045G06F18/253Y02D10/00

Inventor宋佳奇朱军民王勇

OwnerBEIJING YIDAO BOSHI TECH

Document key information extraction method based on zero sample learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology