Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for splitting documents

A document and new document technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of low efficiency and long time of manual splitting of documents

Inactive Publication Date: 2015-09-16
PEKING UNIV FOUNDER GRP CO LTD +1
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention aims to provide a method and device for splitting documents to solve the above-mentioned problems of low efficiency and long time for manually splitting documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for splitting documents
  • Method and device for splitting documents
  • Method and device for splitting documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention will be described in detail below with reference to the accompanying drawings and in combination with embodiments. see figure 1 , embodiment one includes the following steps:

[0019] Step S11: Parsing out the content file and paragraph style file of the original document in xml format.

[0020] The original document is composed of multiple files, including at least a content file in xml format that records word count data stored in the original document, and a paragraph structure style that specifies the display of character data is stored in a paragraph style file in xml format. For example, documents in word format can extract content files and paragraph style files in xml format through compression / decompression algorithms.

[0021] Step S12: In the content file, find the paragraph position to which each paragraph style in the paragraph style file is applied.

[0022] Step S13: output the found paragraph content at each paragraph position in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a device for splitting a document. The method comprises the following steps of: analyzing a content file with an xml format of an original document and a paragraph style file; searching a paragraph position of each paragraph style applied to the paragraph style file in the content file, and respectively outputting the searched paragraph content at each paragraph position to different new documents. The invention provides the device for splitting the document. According to the embodiment of the invention, content in the content file is split through a paragraph style in the paragraph style file by analyzing the content file of the original document and the paragraph style file, and the extracted content is stored into a new document, so that the problem of lower efficiency of directly extracting the content from the document manually and splitting into the new document is overcome, and an efficient and quick effect is achieved.

Description

technical field [0001] The present invention relates to the field of printing, in particular to a method and device for splitting documents. Background technique [0002] A book usually consists of several parts: the main title page, the pre-text supplementary text, the main text, the text supplementary text, and the post-text supplementary text. Among them, the text is composed of articles, chapters and sections. By digitizing books, books can be saved in the form of electronic documents. [0003] Before a book can be published, it needs to be edited. Since a book is made up of multiple parts, it is possible for each part to be compiled by a different author during the compilation process. For example, a document containing the content of a book is split into three documents, and then the three documents are handed over to three different authors for processing, such as reviewing, revising, or typesetting. [0004] In the current process of splitting the document, manua...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/25G06F17/30G06F40/189
Inventor 岳永强
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products