Form image layout analysis method and system

A layout analysis and image technology, applied in the field of information processing, can solve the problems of difficult to identify models, form image layout analysis without table lines, and acquisition of table images without table lines, etc., to achieve the effect of text recognition

Active Publication Date: 2019-09-06
AGRICULTURAL BANK OF CHINA +1
View PDF6 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the large number of types, it is difficult to have a unified recognition model
For example, the existing recognition model cannot perform layout analysis on table images without table lines, so that text recognition technology cannot be used to obtain relevant information for table images without table lines

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Form image layout analysis method and system
  • Form image layout analysis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0047] The terms "first" and "second" in the specification and claims of the present invention and the above drawings are used to distinguish different objects, rather than to describe a specific order. Furthermore, the terms "comprising" and "having", and any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or apparatus comprising a series of steps or units is not defined by lis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a form image layout analysis method and system, and the method comprises the steps: carrying out the image processing of a first image, obtaining a second image, enabling the first image to represent that at least a part of areas of the image have no form lines, and enabling the second image to represent a plain text image; performing image projection processing on the second image to obtain a projection result; analyzing the projection result based on a preset threshold to obtain table information; converting the first image based on the table information to obtain a target image, the target image representing a table image with a table line; and performing character recognition on the target image to obtain character information of the target image. Due to the fact that the table information is the row and column related data and the coordinate condition, the table image without the table lines in the partial area can be converted into the target image with the table lines, layout analysis can be conducted on the target image through the character recognition technology, and then character recognition of the table image without the table lines is achieved.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a form image layout analysis method and system. Background technique [0002] As the important reference data of the enterprise, the financial statement data of the enterprise is usually entered manually, which will bring problems of low efficiency and high error rate. Therefore, the existing technology uses character recognition technology to solve the problems caused by manual entry. [0003] In terms of application scenarios, text recognition technology is generally divided into general recognition and layout recognition. General recognition is to simply extract all the text information in the image; layout recognition is to extract text information for images with a specific format, and to structure the data, that is, to clarify the meaning of the data in the target area. Table images, as a typical layout recognition scenario, have a large number of applicatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00
CPCG06V30/412G06V30/43G06V30/413
Inventor 王佳赵焕芳杨声钢高峰田瑞云赵思远张愉婧
Owner AGRICULTURAL BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products