Chinese structured event extraction method

A technology of event extraction and structuring, which is applied in the fields of instruments, digital data processing, and computing, to achieve the effects of high precision and recall, clear layers, and strong practicability.

Pending Publication Date: 2021-01-05
万齐智 +3
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to overcome the shortcomings of the current Chinese structured event extraction model, the present invention proposes a Chinese structured event extraction method based on syntax and semantic dependency analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese structured event extraction method
  • Chinese structured event extraction method
  • Chinese structured event extraction method

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0068] Example 1. "Shougang Holdings purchased about 40.78 shares." Its syntactic dependency analysis results and syntactic dependency analysis tree are as follows: figure 2 shown. The relationship between "purchase" and the parent node Root is HED, which is the core word of this sentence, and the edges between the nodes represent the syntactic dependency.

[0069] Semantic Dependency Parsing (Semantic Dependency Parsing) is used to describe the semantic dependency relationship between words, and there is a certain relationship with semantic role labeling. Semantic role labeling only focuses on the relationship between sentence predicates and their main arguments, while semantic dependency analysis not only focuses on predicates and arguments, but also on predicates and predicates, arguments and arguments, and semantic relationships within arguments. The description of is more complete and comprehensive, which belongs to deep semantic analysis.

example 2

[0070] Example 2. "The prices of fruit sources are highly differentiated." Its SDP tree such as image 3 shown. Among them, the semantic dependency relationship between the Root node and the "differentiation" node is Root.

[0071] 2. Recognition scheme for events contained in sentences

[0072] In Chinese linguistics, parallel predicates should have the same status or properties in the theory of syntactic structure. Therefore, when analyzing the syntactic dependency structure of sentences, they should be related through a certain agreed parallel symbol. LTP tools can realize The COO symbol is used in the process of syntactic dependency analysis.

example 3

[0073] Example 3. "The prices of fruit sources are highly differentiated, and Apple futures have increased positions." Figure 4 Take the DP tree of Example 3. Among them, there are 3 events ET 1 (fruit source price, differentiated,), ET 2 (Apple futures, Masukura,) and ET 3 (Apple futures, up, ). For example 3, Figure 4 Only one statement is given in which the core verbs "Differentiation" and "Masukura" are used as ET 2 The core verb of ET is 1 The child node of the core verb "differentiate", and the syntactic dependency is COO, and ET 3 The core verb "rising" is also used as ET 2 "Massacura" child node.

[0074] Through the analysis of the syntactic dependency analysis tree, three clues are found: ①The predicate of the event is generally acted by a verb; ②The event predicate in a sentence is a parent-child node and remains continuous, such as "differentiation"→"Masukura"→"rise" ; ③ The edge of the parent-child node of the event predicate is COO. According to these...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Chinese structured event extraction method, and belongs to the technical field of information extraction. The Chinese structured event extraction method comprises the following steps of: performing syntactic dependency structure analysis on an unstructured text statement by using a syntactic dependency analysis tool to obtain a syntactic dependency analysis tree; analyzing the characteristics of the Chinese linguistic and syntactic dependency analysis tree, constructing a core verb chain, and identifying all events existing in the statement; adding a semantic dependency relationship to the syntactic dependency analysis tree by means of a semantic dependency analysis tool, and constructing a syntactic semantic dependency analysis tree; adjusting dependency structures of event core verbs, prepositions and passive language states in the syntactic semantic dependency analysis tree, and constructing a syntactic semantic dependency analysis event graph. According to the method, data does not need to be manually annotated, the structured event can be well extracted, and the extraction accuracy and recall rate are high.

Description

technical field [0001] The invention belongs to the technical field of information extraction, in particular to the technical field of event extraction, and relates to a Chinese structured event extraction method. Background technique [0002] With the rapid development of the network, a large amount of unstructured text data is generated every day. How to extract valuable and meaningful structured information from unstructured text data according to specific application requirements is of great significance. As a subtask of information extraction, event extraction has great application prospects. Taking the field of finance and economics as an example, investors and listed companies are more interested in stock market trends. Trend forecasting can provide strong support for market analysis and decision-making, and the extracted events can help forecasting. Event extraction is mainly to extract all the events contained in this article. In the field of finance and economics...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/211G06F40/30
CPCG06F40/211G06F40/30
Inventor 万齐智万常选胡蓉刘德喜
Owner 万齐智
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products