A method and device for identifying tables in digital format documents
A format file and table technology, applied in the field of identifying tables in digital format files, can solve problems such as unrecognizable and incorrect recognition of complex tables, and achieve the effect of saving data processing costs and improving work efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] An embodiment of the present invention provides a method for identifying tables in a digital format file, including: extracting straight lines in the layout, and dividing the extracted straight lines into horizontal straight lines and vertical straight lines; The vertical straight lines in the class intersect, if they intersect, the straight lines intersecting in the horizontal straight line class and the vertical straight line class are determined as intersecting straight line groups; whether the quantity of the intersecting straight line groups is detected is greater than the first threshold, if so, then determine the The first area where the intersecting line group is located is a table area; otherwise, perform a vertical projection operation on the text in the first area, and determine whether the first area is a table area according to the vertical projection result.
[0044] Such as figure 1 As shown, the embodiment of the present invention provides a method for i...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 