Unlock instant, AI-driven research and patent intelligence for your innovation.

Generation method and system of structured document

A structured document and unstructured technology, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of unable to process and retrieve documents, and difficult to relate to the description content of unstructured documents.

Inactive Publication Date: 2015-07-01
HUADI COMP GROUP
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, most of the processing methods for unstructured documents in the existing technology can only perform structural processing on peripheral information such as the version and number of unstructured documents, and realize the conversion from unstructured documents to structured documents, which is difficult to involve to the description content of the unstructured document itself, therefore, the existing technology cannot really realize the content processing and retrieval of the document

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generation method and system of structured document
  • Generation method and system of structured document
  • Generation method and system of structured document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the description of the present invention refers to the presence of said features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Additionally, "connected" or "coupled" as used herein may include wirelessly connected or coupled. As used herein, the term "and / or" includes any and all combinations of one or more of the associated listed items.

[0033] Those skilled in the ar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a generation method and system of a structured document. The method comprises the steps of collecting a non-structured document, extracting properties of the non-structured document, setting and extracting keywords of the non-structured document and constructing the structured document corresponding to the non-structured document by using the properties and the keywords of the non-structured document. According to the generation method and system of the structured document, by extracting the properties such as the name, the number of pages, the release date, the format, the author, the release unit, the authorization unit and the version of the non-structured document and the keywords extracted based on custom rules and using the extracted properties and keywords to construct the structured document corresponding to the non-structured document, a complete set of structured documents is formed, the defects that a traditional non-structured document generally exists in a text mode, and the actual operation and application are not facilitated are overcome, and the content management and application of an original non-structured document are achieved through the structured document.

Description

technical field [0001] The invention belongs to the technical field of information processing and retrieval, and in particular relates to a method and system for generating structured documents. Background technique [0002] With the popularity of the network, information has become an indispensable part of life and work. Huge amount of information requires more effective information processing technology, and the utilization of huge amount of information requires efficient information retrieval technology. Documents, as a traditional information storage method, carry a large amount and variety of information. There are a large number of documents and materials in governments at all levels and industries, but the documents and materials of most institutions or organizations still exist in the form of unstructured texts. way to save. This form is not conducive to the understanding and publicity of the content of the document, and it is not conducive to the long-term and st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 支俊辉贾楠余洁玮
Owner HUADI COMP GROUP