Document information extraction method and device

A technology of document information and extraction method, which is applied in the field of image processing and can solve problems such as low error tolerance rate and large amount of computation

Pending Publication Date: 2021-12-07
BEIJING MATARNET TECH
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the existing technology, when calculating the matching relationship between line segments and line segments, it is necessary to perform matching line segment by line segment, which requires a large amount of calculation and low error tolerance rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document information extraction method and device
  • Document information extraction method and device
  • Document information extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0041] In order to overcome the above-mentioned problems in the prior art, the inventive idea of ​​the embodiment of the present invention is: by defining a template image, and defining the anchor point and the relative position between the region of interest and the anchor point in the template image, the anchor point represents an indicator ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a document information extraction method and a device. The method comprises the steps of determining an area serving as an anchor point area in a to-be-extracted document image, adjusting an image with a to-be-extracted contour to be in the size of a preset template image, obtaining a target image, adjusting the area to be in the size of a preset template anchor point, and obtaining a target anchor point; determining the position of the target anchor point in the target image, and obtaining the position of the region of interest in the target image by combining the pre-defined region of interest of the template image with the relative position of the template anchor point; according to the size of a pre-defined region of interest in the template image, in combination with the position of the region of interest, extracting the region of interest from the target image; wherein the template image is defined according to a document image to be extracted. The embodiment of the invention is particularly suitable for extracting document information in a large number of document images with fixed formats.

Description

technical field [0001] The present invention relates to the technical field of image processing, and more specifically, to a method and device for extracting document information. Background technique [0002] In the digital age, a large number of paper documents need to be digitally archived. Document photography is a simple and effective way; but it is a problem to detect the content and quality of massive photos. [0003] Existing document identification usually includes steps such as customizing templates, type distinction, matching and positioning, area identification, and post-identification processing. First, customize the identification template for the document to be identified, including classification features and identification element information. During the identification process, the identification Feature extraction is performed on the form image, and the matching templates are screened in the template library according to the extracted features, and the best...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06T7/13G06T7/73
CPCG06T7/13G06T7/75G06T2207/30176
Inventor 邱效辉
Owner BEIJING MATARNET TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products