Document hierarchy division method, document hierarchy division device and readable storage medium
A hierarchical and document technology, which is applied in computer-readable storage media, document hierarchical division devices, and document hierarchical division fields, can solve problems such as confusing structures, inability to meet the requirements of automatic typesetting or other digital publishing, and few book outline structures
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0046] Such as figure 1 As shown, in one embodiment of the present invention, a method for dividing document levels is provided, including:
[0047] Step S102, obtaining the title in the document, and extracting the text features of the title;
[0048] Step S104, classify the titles according to the text features to determine the title categories;
[0049] Step S106, determining the level of the title according to the category of the title and the arrangement order;
[0050] Among them, the text features include: keyword information, word meaning information and font information.
[0051] In this embodiment, before the document needs to be typeset and laid out, all the titles in the document are obtained, and the text features of the titles are extracted, and the titles obtained are classified according to the extracted text features to determine the category of each title, according to The category and sorting order of the title determine the level of the title, and the do...
Embodiment 2
[0063] Such as Figure 4 As shown, in one embodiment of the present invention, a method for dividing document levels is provided, including:
[0064] Step S402, obtaining the title in the document, and extracting the text features of the title;
[0065] Step S404, the word meaning information of the title does not conform to the preset word meaning and determine that the title is an unrated title;
[0066] Step S406, classifying undecided titles according to the keyword information to determine item titles and digital titles;
[0067] Step S408, classifying the item titles into levels according to the arrangement order, so that the item titles are graded titles, and determine the level of the rated titles;
[0068] Step S410, searching for the grading titles located before the numerical titles according to the arrangement order, and determining the level of the grading title closest to the numerical titles;
[0069] Step S412, determining the level of the digital title acco...
Embodiment 3
[0082] Such as Figure 6 As shown, in a specific embodiment of the present invention, a method for dividing document levels is provided, including:
[0083] Step S602, identifying and classifying different types of titles contained in the document;
[0084] Step S604, determining various title levels;
[0085] Step S606, optimize the overall title level, clear the overall empty level, and set the title level.
[0086] In this embodiment, the document hierarchical division method solves the problem of specific types of titles containing "module one", "task one", "project one", "knowledge point one" and "one", "(one)" and "1" in the existing method "(1)" "1)" and other forms as the title sequence number are not handled well, including: title identification and classification device, title level determination device and title level optimization device, through the title identification and classification device, identify the document Different types of titles contained in the con...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


