Complex document separating and organizing method and complex document automatic generating method

A technology for automatic generation and documentation, applied in natural language data processing, special data processing applications, instruments, etc., can solve problems such as unfavorable OfficeOpenXml standard promotion and cross-platform use

Active Publication Date: 2015-05-06
JIANGNAN INST OF COMPUTING TECH
View PDF8 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] With the establishment of the Office OpenXml standard, for complex documents such as Word, Excel, and PowerPoint, whether it is the content information of the document, or the format and style of the document template, it can be described uniformly using the Xml language, which gives the content of complex documents The separation of style and style br

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Complex document separating and organizing method and complex document automatic generating method
  • Complex document separating and organizing method and complex document automatic generating method
  • Complex document separating and organizing method and complex document automatic generating method

Examples

Experimental program
Comparison scheme
Effect test

specific example 1

[0053] Taking the organization and description of word processing document templates as an example, the document outline, data and style template composition and their interrelationships are as follows: image 3 As shown, the document template is divided into nine categories: outline, data group, metadata, style group, section, style (font, table, paragraph), header, footer, numbering.

[0054] The outline template is used to describe the outline of a complex document, which is a macroscopic description of the document structure. In the process of generating a complex document, dozens of document templates may be used. Most document templates can be shared in the process of generating multiple different documents, but for a certain document, only the outline template is necessary. . In the outline template, through the multi-level hierarchical design of Layer→Group[Layer], the layer-by-layer refinement of the document structure and the macro-structure description of the docum...

specific example 2

[0063] Taking the automatic generation process of word processing documents as an example, the descriptions of the outline template, data group template, style group template, data template, and section template of the example refer to Figure 4 , Figure 5 , Image 6 , Figure 7 , Figure 8 .

[0064] The specific steps of the automatic generation process of the document are as follows:

[0065] 1) Obtain such as from the data source Figure 4 The outline template shown contains child nodes such as Layer, Properties, and Parts.

[0066] 2) Analyze the nodes and their attributes at all levels of the outline template. First parse the Properties node to obtain the public property information of the document. Then parse the Parts node to obtain the relevant data of the document and the definition of the style template. Finally, analyze the Layer node and its Group child nodes, and analyze the structure of the document layer by layer.

[0067] 3) Obtain and load the data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a complex document separating and organizing method and a complex document automatic-generating method. The complex document separating and organizing method comprises the following step: decomposing a complex document into a document outline, document data and document styles, wherein the document outline of the complex document is defined as macroscopic description of a document structure, is used for hierarchical decomposition, definition and management of the document according to document content, and is a uniform organization of the document data and a document style template, the document data of the complex document is used for organization and description of document metadata, and the document style of the complex document is used for organization and description of the document styles.

Description

technical field [0001] The present invention relates to the field of document generation, and more specifically, the present invention relates to a complex document separation and organization method and a complex document automatic generation method based on XML (Extensible Markup Language, Extensible Markup Language) description. Background technique [0002] Documents have always been one of the important tools for carrying information and an important means for people to exchange information. Due to the different content of the information to be described, the types of documents are colorful, including text files (TXT), rich text files (RTF, DOC) mainly based on text information, and spreadsheet files (Excel) mainly based on chart data. There are presentation files (PPT) based on image presentations, and drawing files (Visio) based on graphic drawing. Moreover, due to the differences in various storage methods and various tool analysis methods, the formats of documents ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/21
CPCG06F40/151G06F40/186
Inventor 董国良吴利董超群黄东海
Owner JIANGNAN INST OF COMPUTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products