Document structuration organizing method and device

A document structure and document technology, applied in the direction of unstructured text data retrieval, text database clustering/classification, special data processing applications, etc., can solve problems such as inability to reflect and understand difficulties, and achieve the effect of convenient reading

Active Publication Date: 2014-03-26
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

That is to say, there may be a sequential or hierarchical relationship between document contents, but these relationships cannot be reflected only by the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document structuration organizing method and device
  • Document structuration organizing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The ideal document organization method should have a relatively clear hierarchical division. Taking the "Guidelines for Patent Examination" as an example, the document organization structure is as follows:

[0065] Part I Preliminary Review

[0066] Chapter 1 Preliminary Examination of Invention Patents

[0067] 1 Introduction

[0068] 2. Review Principles

[0069] 3. Review procedure

[0070] 3.1 Passed the preliminary examination

[0071] 3.2 Supplement and Correction of Application Documents

[0072] 3.3 Handling of obvious substantive defects

[0073] ...

[0074] 4. Formal review of application documents

[0075] ...

[0076] Chapter II Preliminary Examination of Utility Model Patents

[0077] ...

[0078] Part II Substantive Examination

[0079] ...

[0080] Part III Examination of International Applications Entering the National Phase

[0081] ...

[0082] In some UGC platforms, users often upload some of their own documen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document structuration organizing method and device. The document structuration organizing method includes the steps of obtaining a theme framework of a hierarchical structure, forming a searching condition through a theme text in the theme framework, carrying out searching in a preset document set with the searching condition, and adding a document into a corresponding theme document set in the theme framework according to the matching condition of the searching result and the searching condition. Compared with the prior art, the technical scheme of the document structuration organizing method and device can be used for automatically building proper classification systems according to different knowledge fields; as the theme framework is built with mature expert knowledge, inner links of classifications can be well reflected, and a user can conveniently read a large number of texts in a systematized mode.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to a method and device for structured document organization. Background technique [0002] With the development of Internet technology, the amount of information on the Internet has exploded. In order to apply these information better, it is necessary to manage these information data effectively. Among them, document classification (document classification) is currently a widely used management technology. Document classification refers to determining a category for each document in the document collection according to the content or certain attributes of the document. In this way, users can not only browse documents in a specific category conveniently, but also make finding documents easier by limiting the search scope. [0003] However, for massive document resources, even after a certain classification process, there will still be a large number of documents unde...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 徐兴军
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products