Few-sample document layout analysis method based on metric learning

A technology of metric learning and layout analysis, applied in neural learning methods, instruments, biological neural network models, etc., can solve the problem of time-consuming and labor-intensive acquisition of training data

Active Publication Date: 2020-12-11
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF8 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Although based on the above two methods and derived many new deep semantic segmentation networks with remarkable effects, these networks all have the same serious problem: these networks require a large amount of finely labeled data for training
However, the training data requires pixel-by-pixel marking, which makes the acquisition of training data a very time-consuming and laborious task, especially in the case of complex document content.
One way to delay is to use weakly supervised learning for training, but still requires a lot of weakly labeled training data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Few-sample document layout analysis method based on metric learning
  • Few-sample document layout analysis method based on metric learning
  • Few-sample document layout analysis method based on metric learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0075] For the convenience of description, the relevant technical terms appearing in the specific implementation are explained first:

[0076] Support image: support image

[0077] Query image: query image

[0078] COS (cosine): cosine distance

[0079] VGG-16 (Visual Geometry Group Network-16): Visual Geometry Group Network

[0080] RGB(Red-Green-Blue): RGB color mode

[0081] Maxpool: maximum pooling

[0082] Conv (convolution): Convolution

[0083] Reshape: Reshape

[0084] Transpose: Transpose

[0085] Softmax: logarithmic function

[0086] Argmax: parameter maximum function

[0087] k-shot: k pictures

[0088] DSSE-200 (Document semantic structure extraction): document semantic structure extraction dataset

[0089] Layout Analysis Dataset: layout analysis dataset

[0090] PASCAL-5i: PASCAL-5i dataset

[0091] SG-One (Similarity guidance network for one-shot semanticsegmentation): a similarity guidance network for semantic segmentation

[0092] figure 1 It is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a few-sample document layout analysis method based on metric learning, and the method comprises the steps: constructing position attention features and channel attention features of different regions through extracted original feature graphs of a document image, and enabling the representation features of different regions in the feature graphs to be fully utilized throughthe fusion of a convolution network; carrying out the prototype construction by using representation features obtained by fusion, and carrying out the alignment operation by using a result obtained bysegmentation, so document content segmentation is carried out by using a small amount of data, and meanwhile, the segmentation precision is improved.

Description

technical field [0001] The invention belongs to the technical field of semantic segmentation in computer vision, and more specifically, relates to a method for analyzing the layout of a few-sample document based on metric learning. Background technique [0002] In the era of mobile Internet, the acquisition and sharing of electronic documents has become very convenient, and the analysis of document layout can effectively extract valuable information. As the number of documents continues to increase and the contents of documents become more diverse, analyzing the layout content of documents has become a new trend of semantic segmentation. The goal of document layout analysis is to classify different regions in the document image and obtain segmentation results with different label information. [0003] The existing method for analyzing document layout is better based on semantic segmentation based on deep network. The semantic segmentation method based on deep network mainly...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V30/40G06V10/40G06N3/045G06F18/241G06F18/253G06F18/214
Inventor 徐行赖逸张鹏飞邵杰陈李江
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products