Structured digital content extraction and reorganization method

A digital content and unstructured data technology, applied in the information field, can solve problems such as not being suitable for the publishing industry, and achieve the effect of reducing information redundancy and improving efficiency

Inactive Publication Date: 2012-08-22
CHINA NAT INST OF STANDARDIZATION
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the face of the wave of digital publishing, traditional content organization and release methods are no longer suitable for the publishing industry under the new situation. The development of the digital publishing industry needs to introduce new content organization methods and technical standards

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Structured digital content extraction and reorganization method
  • Structured digital content extraction and reorganization method
  • Structured digital content extraction and reorganization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The method of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments of the present invention.

[0029] The core idea of ​​the present invention is to adopt the characteristics and advantages of structured content extraction, reorganization and mapping to adapt to the characteristics of the multi-terminal, multi-form and multi-channel publishing mode in the digital publishing era, so as to realize the maximization of information production and dissemination effects .

[0030] figure 1 It is a flow chart of the method for extracting and recombining structured digital content of the present invention, such as figure 1 As shown, the method mainly includes the following steps:

[0031] Step 11: Store the candidate content for digital publishing in an unstructured data storage container represented by eXtensible Markup Language (XML) format.

[0032] During the content circulation process of digital pu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a structured digital content extraction and reorganization method which comprises the following steps: storing digitally-published alternative contents into an unstructured data memory with an extensible markup language (XML) format as the representative; formatting an information unit in the data memory according to a tag of an extracted information unit of standard definitions of structured digital content extraction and reorganization so as to form subject blocks of an information agent; carrying out correlation between the subject blocks through mapping implemented by taking an XML as a carrier, and under the action of the mapping, reorganizing the dispersed subject blocks into a structured document having a logical relationship; and carrying out style rendering on the structured document through extensible style language (XSL) and extensible style language transformation (XSLT) according to the needs of publication, thereby generating various target publication formats which can be converted and formed by the XML. By using the method disclosed by the invention, a characteristic that future publications are diversified in content bearing form, display form and terminal can be adapted.

Description

technical field [0001] The present invention relates to the field of information technology, in particular to a method for extracting and reorganizing structured digital content, using digital publishing technology and database document management technology to solve the problem of unfavorable storage of document content and information redundancy in traditional digital publishing production. remaining questions. Background technique [0002] As a new publishing industry, digital content publishing has gradually spread to various reading terminals with the development of the Internet and mobile communications. At present, the display terminals of digital content publications are becoming more and more abundant, the industrial service chain is becoming more and more perfect, and the technology is constantly innovating. It has become a new growth point in the publishing industry, and has attracted extensive attention and active participation from publishing practitioners and r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 高昂邢立强孙广芝程越
Owner CHINA NAT INST OF STANDARDIZATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products