Unlock instant, AI-driven research and patent intelligence for your innovation.

Document hierarchy division method, document hierarchy division device and readable storage medium

A hierarchical and document technology, which is applied in computer-readable storage media, document hierarchical division devices, and document hierarchical division fields, can solve problems such as confusing structures, inability to meet the requirements of automatic typesetting or other digital publishing, and few book outline structures

Active Publication Date: 2021-10-26
NEW FOUNDER HLDG DEV LLC +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the current field of book publishing, because the traditional publishing process and technology mainly focus on the content of the book, all kinds of documents circulated in the links of author, editor, typesetting, printing, etc. mainly include the content of the book, and the outline structure of the book is rarely included or the structure is chaotic. Inability to meet automatic typesetting or other digital publishing requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document hierarchy division method, document hierarchy division device and readable storage medium
  • Document hierarchy division method, document hierarchy division device and readable storage medium
  • Document hierarchy division method, document hierarchy division device and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] Such as figure 1 As shown, in one embodiment of the present invention, a method for dividing document levels is provided, including:

[0047] Step S102, obtaining the title in the document, and extracting the text features of the title;

[0048] Step S104, classify the titles according to the text features to determine the title categories;

[0049] Step S106, determining the level of the title according to the category of the title and the arrangement order;

[0050] Among them, the text features include: keyword information, word meaning information and font information.

[0051] In this embodiment, before the document needs to be typeset and laid out, all the titles in the document are obtained, and the text features of the titles are extracted, and the titles obtained are classified according to the extracted text features to determine the category of each title, according to The category and sorting order of the title determine the level of the title, and the do...

Embodiment 2

[0063] Such as Figure 4 As shown, in one embodiment of the present invention, a method for dividing document levels is provided, including:

[0064] Step S402, obtaining the title in the document, and extracting the text features of the title;

[0065] Step S404, the word meaning information of the title does not conform to the preset word meaning and determine that the title is an unrated title;

[0066] Step S406, classifying undecided titles according to the keyword information to determine item titles and digital titles;

[0067] Step S408, classifying the item titles into levels according to the arrangement order, so that the item titles are graded titles, and determine the level of the rated titles;

[0068] Step S410, searching for the grading titles located before the numerical titles according to the arrangement order, and determining the level of the grading title closest to the numerical titles;

[0069] Step S412, determining the level of the digital title acco...

Embodiment 3

[0082] Such as Figure 6 As shown, in a specific embodiment of the present invention, a method for dividing document levels is provided, including:

[0083] Step S602, identifying and classifying different types of titles contained in the document;

[0084] Step S604, determining various title levels;

[0085] Step S606, optimize the overall title level, clear the overall empty level, and set the title level.

[0086] In this embodiment, the document hierarchical division method solves the problem of specific types of titles containing "module one", "task one", "project one", "knowledge point one" and "one", "(one)" and "1" in the existing method "(1)" "1)" and other forms as the title sequence number are not handled well, including: title identification and classification device, title level determination device and title level optimization device, through the title identification and classification device, identify the document Different types of titles contained in the con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a document hierarchy division method, a document hierarchy division device and a readable storage medium. Among them, the method for classifying the document hierarchy includes: obtaining the titles in the document, and extracting the text features of the titles; classifying the titles according to the text features to determine the title category; determining the level of the title according to the title category and arrangement order; wherein, the text features include : keyword information, word meaning information and font information. Automatic recognition of the hierarchical relationship of the titles in the document is realized, and the outline structure of the document can be quickly extracted according to the hierarchical relationship of the titles. It meets the needs of editors and publishers for rapid inspection of book content logic, automatic typesetting, and structured processing.

Description

technical field [0001] The present invention relates to the technical field of document typesetting, in particular to a method for dividing document levels, a device for dividing document levels and a computer-readable storage medium. Background technique [0002] In the current field of book publishing, because the traditional publishing process and technology mainly focus on the content of books, all kinds of documents circulated in the author, editor, typesetting, printing and other links mainly include the content of books, and the outline structure of books rarely contains or is chaotic. Unable to meet automatic typesetting or other digital publishing requirements. How to meet the needs of editors and publishers for rapid logic checking, automatic typesetting, and structured processing of book content has become a technical problem that needs to be solved urgently. Contents of the invention [0003] The present invention aims to solve at least one of the technical pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/189G06F40/137G06F16/35
CPCG06F16/353
Inventor 魏超鹏黄媞
Owner NEW FOUNDER HLDG DEV LLC