Publication PDF layout analysis and recognition method based on mixing of multiple neural networks

A technology of layout analysis and identification method, applied in the field of PDF layout analysis, can solve the problems of inability to provide physical and logical structure analysis, inability to distinguish text line headers and other structures, to improve recognition accuracy, alleviate model overfitting, and reduce impact Effect

Inactive Publication Date: 2020-03-06
重庆华龙网海数科技有限公司
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the object of the present invention is to provide a published PDF layout analysis and recognition method based on a mixture of multiple neural networks, which solves th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Publication PDF layout analysis and recognition method based on mixing of multiple neural networks
  • Publication PDF layout analysis and recognition method based on mixing of multiple neural networks

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0033] The following describes the implementation of the present invention through specific specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific embodiments, and various details in this specification can also be modified or changed based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the illustrations provided in the following embodiments only illustrate the basic concept of the present invention in a schematic manner. In the case of no conflict, the following embodiments and the features in the embodiments can be combined with each other.

[0034] Among them, the drawings are only used for exemplary description, and they are only schematic diagrams rather than physical drawings, and cannot be und...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a publication PDF layout analysis and recognition method based on mixing of multiple neural networks, belongs to the technical field of image recognition and PDF layout analysis. The method comprises: using a multi-task training mode to recognize, segmenting and marking PDF layouts including paragraphs, titles and illustrations, locating text lines and then recognizing texts. According to the method, on the aspect of layout recognition, through a multi-task training mode, row and structure recognition annotation is completed at the same time, manual participation is not needed in the whole process, and PDF text structure information is effectively reserved. According to data with PDF text structure information obtained through layout analysis, a common Chinese dictionary for version data is constructed, and a text recognition model is trained in a targeted manner, so that the recognition precision of the model in a PDF printed text recognition task is greatly improved. The recognized text also has structural information, an original PDF layout structure is restored, and subsequent secondary editing, electronic book manufacturing and book content knowledge mining are also facilitated.

Description

technical field [0001] The invention belongs to the technical field of PDF layout analysis, and relates to a publishing PDF layout analysis and recognition method based on a mixture of multiple neural networks. Background technique [0002] With the rise of big data and artificial intelligence technology, many traditional industries have brought opportunities for digital and intelligent transformation, including the publishing industry. [0003] One of the biggest challenges in the transformation of the publishing industry is that it has a large amount of unstructured data resources that are difficult to process, such as: books, papers, etc. In order to meet the needs of publishing and printing, such resources are mostly contained in PDF format files. Due to the complexity of the PDF format itself, the diversity of publishing and printing requirements, and the lack of PDF editing specifications, the existing PDF books and paper data cannot be edited again, and the text info...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/205G06F40/279G06N3/04G06N3/08
CPCG06N3/049G06N3/08G06N3/045
Inventor 李翀卢云龙蒋路曦
Owner 重庆华龙网海数科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products