Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Event extraction method and system fusing dependency information and pre-trained language model

An event extraction and language model technology, applied in the Internet field, can solve problems such as limitations, modeling dependent features, and insufficient extraction performance

Active Publication Date: 2020-11-06
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF4 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are ways to use external knowledge bases or corpus resources as a supplement, and use weak supervision to expand the training data, but this type of method is limited to artificial rules and assumptions. Although the scale of the expanded data is large, the improvement of the performance of the extraction model is very limited.
[0007] To sum up, the main defect in the existing technology is that the dependent features and labeled data cannot be well modeled, which leads to insufficient extraction performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Event extraction method and system fusing dependency information and pre-trained language model
  • Event extraction method and system fusing dependency information and pre-trained language model
  • Event extraction method and system fusing dependency information and pre-trained language model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] A method for extracting Chinese events that integrates dependency information and a pre-trained language model, comprising the following steps: 1) preprocessing training corpus, 2) precoding using BERT pre-trained language model, 3) learning dependency syntax using graph convolutional neural network Features, 4) dependency prediction, 5) trigger word extraction, 6) argument extraction. The Chinese event extraction method that the present invention proposes comprises the following steps:

[0065] 1) training corpus preprocessing, the training corpus used in the present invention is selected from the ACE 2005 Chinese data set, and the processing process includes sentence segmentation, word segmentation, labeling entity extraction, sentence-level dependency analysis, and then the trigger word is converted into BIO labeling format;

[0066] 2) Use the BERT pre-training language model for precoding. This step takes the word sequence of the sentence as input. After using the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an event extraction method and system fusing dependency information and a pre-trained language model. The method comprises the steps of taking a dependency syntax tree of a sentence as input, learning dependency syntax features by using a graph convolutional neural network, adding a dependency relationship prediction task, capturing a more important dependency relationship in a multi-task learning mode, and finally enhancing underlying syntax expression by using a BERT pre-training language model to complete event extraction of Chinese sentences. Therefore, the performance of trigger word extraction and argument extraction under the event extraction task is improved.

Description

technical field [0001] The invention relates to the field of Internet technology, in particular to a method and system for extracting Chinese events that can be used in the fields of knowledge graphs and information extraction. Background technique [0002] Events, as a structured representation of information, refer to actual happenings involving certain participants. As a special class of information extraction tasks, the goal of event extraction is to extract instances of predefined event types from a given text. An event generally consists of two parts: a trigger word (Trigger) and an argument (Argument). The trigger word is the word in the text that can most clearly express the occurrence of the event, and is generally the core verb of the sentence where the event is located; the argument is related to the event, and An entity that plays a role in an event. Generally speaking, event extraction can be divided into two tasks: trigger word extraction and argument extract...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/31G06F16/35G06F16/36G06F40/211G06F40/289G06F40/295G06N3/04
CPCG06F16/313G06F16/353G06F16/367G06F40/289G06F40/295G06F40/211G06N3/045
Inventor 靳小龙郭嘉丰程学旗延浩然官赛萍范意兴席鹏弼
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products