Method and system for validating the content of technical documents

A content and document technology, applied in the field of confirming the content of technical documents and systems, and can solve complex problems

Inactive Publication Date: 2007-01-31
AGENCY FOR SCI TECH & RES
View PDF9 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0039] Therefore, the existing technology cannot meet the needs of the above problems, and more complex technology is needed here to automatically confirm the content of the document to meet the requirements of the above problem domain analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for validating the content of technical documents
  • Method and system for validating the content of technical documents
  • Method and system for validating the content of technical documents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] This document describes a method, apparatus and computer program product for validating the content of technical documentation. However, it will be understood by those skilled in the art that the invention may be practiced without some of these details. In some instances, well known features have not been described in detail so as not to obscure the invention.

[0053] Described herein is a method and system for automatic content validation of electronic free-text documents in various formats, specifying qualitative, quantitative, relational, or logical attributes of entities referenced in the document . The system and method identify and extract semi-structured representations from the document, such as domain-specific entities referenced in a document and attributes associated with those entities in the document. Artificial rules generated by domain experts are applied to these entities and their linguistically associated meanings derived from each document. Rules-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An automatic document validation system that can be trained to extract domain-specific entities and their linguistically-associated physical, abstract or relational properties, as described within an electronic document. Training of the system can be achieved through the provision of a set of example documents representative of the domain and that have been manually tagged by a domain expert in such a way as to identify the various types of entities and their associated set of recordable properties. Together with a domain-specific vocabulary (e.g.. a dictionary), the trained system is then able to automatically process new documents belonging to the same domain and to test the extracted information on any number of content-conditional rules that have been specified by the domain expert as necessary to confirm the completeness and validity of the new documents.

Description

technical field [0001] The invention relates to a method and system for validating the content of documents, especially technical documents, by extracting information and then comparing the extracted information with a set of rules. Background technique [0002] Currently, most information is transferred from one person to another or from one place to another in the form of electronic documents or files, which are mainly represented in the form of text. There are many forms of text-based electronic documents. These include shorter-form e-mail messages, bulletin messages, news, legal documents, scientific research papers, full-length news magazines or periodicals, and entire tomes or encyclopedias. Within these documents, we can define one of the categories and classify them as technical documents. [0003] Technical documents are defined here as those conforming to a set of generally accepted rules or even rules about specific forms. Simply put, this type of rule can desc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/28
CPCG06F17/2725G06F40/226G06F40/20
Inventor 赖鸿麟陈亚辉
Owner AGENCY FOR SCI TECH & RES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products