Unlock instant, AI-driven research and patent intelligence for your innovation.

An image classification method for tabular documents based on frame features and pixel distribution

A document image and classification method technology, applied in character and pattern recognition, instruments, calculations, etc., can solve problems such as low efficiency and unguaranteed classification accuracy, and achieve enhanced table frame structure, improved accuracy, and improved The effect of image quality

Active Publication Date: 2022-07-08
FUZHOU UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the continuous development and progress of the economy and society, various industries and departments of the country will produce a large number of form documents in daily production and life. Manually classifying these form documents is not only inefficient, but also the accuracy of the classification cannot be guaranteed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An image classification method for tabular documents based on frame features and pixel distribution
  • An image classification method for tabular documents based on frame features and pixel distribution
  • An image classification method for tabular documents based on frame features and pixel distribution

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The present invention will be further described below with reference to the accompanying drawings and embodiments.

[0049] like figure 1 As shown, this embodiment provides a table document image classification method based on frame line features and pixel distribution, which specifically includes the following steps:

[0050] Step S1: acquiring and reading the table document image to be classified, that is, the image to be classified, and performing grayscale, binarization, and frame-line structure enhancement operations based on connected domain analysis;

[0051] Step S2: using a deep learning method based on a multi-layer perceptron to denoise the image to be classified after the enhancement, and complete the preprocessing of the image to be classified;

[0052] Step S3: Using the straight line detection method based on morphology, respectively detect and extract the horizontal and vertical frame lines of the image to be classified and refine them, and use the NPca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a table document image classification method based on frame line features and pixel distribution. First, grayscale and binarization are performed on the image to be classified, and then a frame line enhancement operation based on connected domain analysis is performed on the obtained binary image. Perform image denoising based on the deep learning method on the image after frame line enhancement; for the preprocessed image to be tested, use the morphological-based line detection method to detect and extract the horizontal and vertical frame lines respectively, and refine them , and then use the Npcanny-based line detection method to obtain the frame line number information; project the horizontal frame line image in the horizontal direction, and perform the vertical direction projection on the vertical frame line, and record the projected position and pixel value. The length information is matched with the standard template information that has been entered in the template library, the template image with the highest similarity with the image to be classified is selected, and finally the classification result of the image to be classified is output. The present invention can effectively classify table document images.

Description

technical field [0001] The invention relates to the fields of morphology and computer vision, in particular to a table document image classification method based on frame line features and pixel distribution. Background technique [0002] With the continuous development and progress of the economy and society, various industries and departments in the country will generate a large number of table documents in daily production and life. Manually classifying these table documents is not only inefficient, but also the classification accuracy cannot be guaranteed. The classification features of table documents generally include titles, frame lines, special characters, etc. Since the method for classifying table documents based on frame lines is more versatile, the frame line features are used here as the classification features of table documents. [0003] The table document image classification detects and extracts the frame line features of the table document image read into t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V30/412G06V30/413G06V30/18G06V30/164G06V30/168G06V30/19G06K9/62
CPCG06V30/412G06V30/413G06V10/30G06V10/34G06V10/44G06F18/24
Inventor 柯逍王俊强
Owner FUZHOU UNIV