Entity relationship rapid extraction method based on automaton

An entity-relationship and automaton technology, applied in relational databases, structured data retrieval, semantic analysis, etc., can solve the problems of dependencies, slow model decoding, difficult extraction methods, fast and high portability requirements, etc.

Inactive Publication Date: 2016-08-03
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Among the existing entity relationship extraction methods, the rule-based method has good accuracy, but the quality of the rules is directly related to the quality of the entity relationship extraction results, so writing the rules requires expert experience, and the rules generally depend on the field. Therefore rule portability is also a major challenge
Statistics-based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity relationship rapid extraction method based on automaton
  • Entity relationship rapid extraction method based on automaton
  • Entity relationship rapid extraction method based on automaton

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to make the technical problems, technical solutions and beneficial effects solved by the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0071] The purpose of the present invention is to innovatively implement a customizable open-domain entity relationship fast extraction method based on automata for the problem that the existing technology cannot satisfy the fast and high portability of the entity relationship extraction method. The method has high customizability, and can realize the extraction of the newly customized entity relationship by simply modifying the configuration file, and the method can quickly process data to meet the actual needs of obtaining the entity relationship from a large amount of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an entity relationship rapid extraction method based on an automaton. The entity relationship rapid extraction method comprises the following steps: step 1, customizing a rule file; step 2, performing grammar checking on each rule in the rule file, and detecting whether the rules in the rule file meet grammar requirements or not; if so, executing step 3; step 3, performing semantic interpretation on each rule in the rule file through grammar checking; step 4, analyzing and compiling each rule in the rule file subjected to the semantic interpretation to finish conversion from the rule to a stacked finite-state automaton, so as to obtain the finite-state automaton; step 5, extracting an entity attribute and an entity relationship from input text data by utilizing the finite-state automaton, so as to obtain a final entity attribute and entity relationship. The entity relationship rapid extraction method based on the automaton has the advantages that the rapid extraction of the entity attribute and the entity relationship of open domain texts can be carried out. Meanwhile, the entity relationship of a specific field can be subjected to customized extraction.

Description

technical field [0001] The invention belongs to the technical field of entity relationship extraction in open domain texts, and in particular relates to an automata-based rapid entity relationship extraction method. Background technique [0002] With the explosive development of the Internet, the data on the Internet is increasing exponentially. There is a wealth of information in the huge mass of data. However, the original unstructured or semi-structured web pages, documents, Weibo, multimedia and other formats of data cannot directly provide us with the precise information we want. Therefore, information extraction techniques for open domain texts are becoming more and more important. Named entities are often the carrier of important information contained in text, so the extraction of named entities and entity relationships is an important information extraction technology. For example, using the extracted named entities and entity relationships, a person file system c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/288G06F40/30
Inventor 程工刘春阳庞琳王卿李雄张旭马宏远石瑾毕涛刘玮贺敏陈磊
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products