Supercharge Your Innovation With Domain-Expert AI Agents!

End-to-end text event extraction method and system based on deep learning

A deep learning and event extraction technology, applied in computer parts, instruments, unstructured text data retrieval, etc., can solve problems such as wasted computing resources, event loss, and inability to achieve commercial use, and achieve improved event extraction performance. Effect

Pending Publication Date: 2022-05-10
CHENGDU FOURIER ELECTRONICS TECH +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Shun Zheng et al in the document Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction, and Hang Yang et al in the document Document-level Event Extraction via Parallel Prediction Networks.Proceedings of the 59 th Annual Meeting of the Association for Computational Linguistics and the 11 th In the International Joint Conference on Natural Language Processing, all attempts are made to extract chapter events. The events of the former are decoded by way of path extension, and event extraction cannot be performed in parallel. The latter uses a parallel multi-event decoder, which needs to pre-specify the events to be extracted. The number of events is likely to cause problems such as event loss and waste of computing resources. At the same time, the performance of the current chapter event extraction method is low, and the F1 value is lower than 80%, which cannot reach the level of commercial use.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end text event extraction method and system based on deep learning
  • End-to-end text event extraction method and system based on deep learning
  • End-to-end text event extraction method and system based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] Such as Figure 1 ~ Figure 2 As shown, an end-to-end text event extraction method based on deep learning is provided, including:

[0060] S10, Candidate Entity Recognition: Each element of an event is composed of entities in the document, such as time, location, person name, action, etc. in the document. In this embodiment, the first Transformer model is used to encode each sentence of the document, and the candidate entities are identified within the scope of the sentence, and word embedding tensor representations of multiple candidate entities are obtained.

[0061] S20, document-level encoding: S21, using the Attention mechanism to combine multiple word embedding tensors of the same type into an embedding tensor of a candidate entity, and form a candidate entity, and there are multiple word embedding tensors in total and corresponding to multiple Candidate entities; S22, add event role information to each obtained candidate entity, and form multiple candidate entity...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An end-to-end chapter event extraction method and system based on deep learning, the method comprising: candidate entity identification, document level coding, candidate entity relationship coding and event decode.The end-to-end chapter event extraction technology proposed by the invention can effectively solve the problem of event element dispersion in a chapter document, and improve the extraction efficiency. Meanwhile, multi-event prediction can be flexibly performed in parallel, candidate word embedding is performed by adopting an attention mechanism, word embedding of [CLS] sentence header identification is used as sentence embedding, semantic representation of word embedding and sentence embedding is well stored, and the final event extraction performance is improved.

Description

technical field [0001] The invention relates to the field of natural language processing in computer data processing technology, in particular to an end-to-end text event extraction method and system based on deep learning. Background technique [0002] With the continuous development of modern information technology, various industries have accumulated a large amount of information data, and the world has entered the era of big data. Accurately obtaining useful information from massive data in real time is of great significance in reality. Event extraction aims to extract structured events from unstructured text. It is widely used in intelligence work in commercial and military fields. Event extraction technology makes it possible to automatically extract structured event information from massive data. Extracting the events and their elements mentioned in the plain text faces two specific challenges. One is that the elements of an event are scattered, and the elements that...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/211G06F40/279G06F16/35G06F16/33G06K9/62
CPCG06F40/211G06F40/279G06F16/3344G06F16/353G06F18/24323G06F18/214
Inventor 高小童吴施楷杜红林
Owner CHENGDU FOURIER ELECTRONICS TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More