A form image layout analysis method and system

A layout analysis and image technology, applied in the field of information processing, can solve problems such as difficult to recognize models, cannot analyze the layout of table images without table lines, obtain table images without table lines, etc., and achieve the effect of text recognition

Active Publication Date: 2021-04-27
AGRICULTURAL BANK OF CHINA +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the large number of types, it is difficult to have a unified recognition model
For example, the existing recognition model cannot perform layout analysis on table images without table lines, so that text recognition technology cannot be used to obtain relevant information for table images without table lines

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A form image layout analysis method and system
  • A form image layout analysis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046]Next, the technical solutions in the embodiments of the present invention will be apparent from the embodiment of the present invention, and it is clearly described, and it is understood that the described embodiments are merely embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, there are all other embodiments obtained without making creative labor without making creative labor premises.

[0047]The description and claims of the present invention and the terms "first" and "second" or the like are used to distinguish different objects rather than to describe a particular order. Moreover, the terms "including" and "have" and any deformed, intended to cover the inclusion of his inclusion. For example, a series of steps or units comprising a series of steps or units are not set to the listed procedures or units, but may include a step or unit that is not listed.

[0048]In the embodiment of the present invention, a table ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a table image layout analysis method and system. The method includes: performing image processing on a first image to obtain a second image. The first image represents at least part of the image without form lines, and the second image represents plain text. image; perform image projection processing on the second image to obtain a projection result; analyze the projection result based on a preset threshold to obtain table information; convert the first image based on the table information to obtain a target image, and the target image represents a table line Form image; perform text recognition on the target image to obtain the text information of the target image. Since the table information is row and column related data and coordinates, the table image without table lines in some areas can be converted into the target image with table lines. Therefore, the target image of text recognition technology can be used for layout analysis, thus realizing the detection of table lines without table lines. Text recognition for table images.

Description

Technical field[0001]The present invention relates to the field of information processing, and in particular, to a table image layout analysis method and system.Background technique[0002]As an important reference data of enterprises, it is often used by manual entry, and there will be problems with low efficiency and high error rate, so text recognition techniques will be used to solve the problem brought by manual entry.[0003]Division from the application scenario, text identification technology is generally divided into general identification and layout recognition. General identification is a simple extraction information in the image; layout identification is aimed at images with a specific format, extracting text information, and structured data, that is, the data meaning of the target area. The table image is a typical layout identification scenario with a large number of application scenarios and urgent text identification requirements, where corporate financial statements ar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00
CPCG06V30/412G06V30/43G06V30/413
Inventor 王佳赵焕芳杨声钢高峰田瑞云赵思远张愉婧
Owner AGRICULTURAL BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products