Text paragraph structure restoration method, device and equipment and computer storage medium

A paragraph and text technology, applied in the field of computer readable storage medium, text paragraph structure restoration

Active Publication Date: 2020-12-11
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to provide a text paragraph structure restoration method, device, equipment and c

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text paragraph structure restoration method, device and equipment and computer storage medium
  • Text paragraph structure restoration method, device and equipment and computer storage medium
  • Text paragraph structure restoration method, device and equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] Such as figure 1 as shown, figure 1 It is a structural schematic diagram of a text paragraph structure restoration device of the hardware operating environment involved in the solution of the embodiment of the present invention.

[0042] Such as figure 1 As shown, the device for restoring text paragraph structure may include: a processor 1001 , such as a CPU, a network interface 1004 , a user interface 1003 , a memory 1005 , and a communication bus 1002 . Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. Optionally, the network interfac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of image processing, and discloses a text paragraph structure restoration method, device and equipment and a computer storage medium, and the method comprises the steps: carrying out the recognition of a target image, and determining all textboxes in the target image and the textbox positions of all textboxes based on a recognition result of recognition; sorting the textboxes according to the textbox positions, and inputting text features of the textboxes into a preset deep learning model for training based on a sorting result of sorting; and combining the textboxes based on the training result of the training to obtain all text paragraphs corresponding to the target picture. According to the method, the text paragraph structure restoration accuracy is improved.

Description

technical field [0001] The present invention relates to the technical field of image processing, in particular to a text paragraph structure restoration method, device, equipment and computer-readable storage medium. Background technique [0002] In the process of digitizing paper documents, it is necessary to enter the documents and retain the original format. Currently, text line-based detection and recognition methods cannot directly obtain text paragraph information. Currently, there are two methods, that is, top-down, that is, the layout analysis of the entire page is performed first, paragraphs are segmented, and then text lines in the paragraph area are detected and recognized. This type of method cannot capture local text detail features when doing layout analysis, and only uses image information without text content information, and the accuracy rate is not high. Or bottom-up, that is, first detect the text lines, and then merge the text lines to obtain paragraphs....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06F40/166
CPCG06F40/166G06V30/412G06V30/10
Inventor 高超徐国强
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products