Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document-level event argument extraction method based on sequence labeling

A sequence tagging and document technology, applied in the field of information extraction of natural language processing, can solve problems such as not considering the influence of semantic information between words, and achieve the effect of improving the effect

Pending Publication Date: 2021-11-02
CHONGQING UNIV OF POSTS & TELECOMM
View PDF1 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the traditional event extraction tasks are based on the sentence level, which has obvious defects: an event will involve trigger words and multiple arguments. Usually, very few event trigger words and their related arguments appear in the ideal situation in the same sentence
In addition, previous studies have shown that capturing the semantics of long-sequence text requires an understanding of sentence semantics and inter-sentence semantics, but previous studies have not considered the impact of inter-word semantic information on document-level event argument extraction tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document-level event argument extraction method based on sequence labeling
  • Document-level event argument extraction method based on sequence labeling
  • Document-level event argument extraction method based on sequence labeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.

[0067] The technical scheme that the present invention solves the problems of the technologies described above is:

[0068] The document-level event argument extraction task is to identify the event arguments and the corresponding event roles in the document according to the predefined event types and their corresponding role types.

[0069] The document-level event argument extraction method based on sequence labeling proposed by the present invention adopts the commonly used document-level event argument extraction data set MUC-4 in the implementation stage. The present invention adopts the same data division standard, that is, 1300 documents are used as the training set, 200 documents (TST1+TST2) are u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention requests to protect a document-level event argument extraction method based on sequence labeling, which comprises the following steps of obtaining Wikipedia priori knowledge related to a corpus entity, and generating a word span entity semantic enhancement embedding representation; splicing the word span entity semantic enhancement embedded representation with a context representation obtained by a pre-training language model to obtain word vector input of an embedded layer; inputting the word representation into a multi-span bidirectional recurrent neural network to obtain a multi-span context feature representation of the word; inputting the multi-span context feature representation into a context attention mechanism module and a gated attention mechanism module, and obtaining a context semantic fusion feature representation of the word; and finally, carrying out event argument extraction on the output feature representation by adopting sequence labeling, and carrying out event argument extraction on an unknown document by utilizing an optimal model obtained by training. According to the method, the extraction effect of the document-level event argument is effectively improved by integrating priori knowledge and multi-span upper and lower semantic feature representation.

Description

technical field [0001] The invention belongs to the field of information extraction of natural language processing, and proposes a document-level event argument extraction model based on sequence annotation. The model can provide basic services for tasks such as knowledge graph construction, relation extraction, information retrieval, and automatic question answering. Background technique [0002] The constituent elements of an event include trigger words, event types, arguments, and argument roles. The goal of event extraction is to automatically obtain the above information from unstructured information and display it in a structured manner. However, most of the traditional event extraction tasks are based on the sentence level, which has obvious defects: an event will involve trigger words and multiple arguments. Usually, very few event trigger words and their related arguments appear in the Ideal situation in the same sentence. Therefore, it is more conducive to the p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F40/211G06N3/04
CPCG06F40/30G06F40/211G06N3/044G06N3/045
Inventor 邱东王海霞
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products