A Causal Event Extraction Method Based on Self-Training and Noise Model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A noise model and event extraction technology, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as limited effects, and achieve the effect of improving model performance

Active Publication Date: 2021-06-18

HARBIN INST OF TECH

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the present invention is to solve the problem that the existing deep learning model-based causal event extraction method relies on a large amount of labeled data, which leads to its limited effect in fields or scenes with insufficient labeled data, and proposes a method based on self-training and noise model The causal event extraction method of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment approach 1

[0024] Specific implementation mode 1: In this implementation mode, a causal event extraction method based on self-training and noise model, the specific process is as follows:

[0025] Step 1. Collect a small amount of labeled text in the target domain, or label a small amount of unlabeled text in the target domain, and label causal event pairs. When labeling, use the labeling method of the sequence labeling task to mark a word for each word in the text a label, indicating that the word belongs to a causal event, a consequential event, or other constituents;

[0026] Step 2. First use the existing word segmentation tool to segment the marked text in step 1. Use a neural network structure, such as a pre-trained language model based on the self-attention mechanism, to calculate a word in the marked text after word segmentation. vector representation;

[0027] Step 3. Use the conditional random field model to calculate the label sequence with the highest probability from the ve...

specific Embodiment approach 2

[0036] Specific embodiment 2: The difference between this embodiment and specific embodiment 1 is that the tagging method of the sequence tagging task in the step 1 is to use BIO or BIOES and other tagging specifications, such as "money / super issue / result / got / house price / The label "B-cause / I-cause / O / O / B-effect / I-effect / I-effect / I-effect" under the BIO label specification is "B-cause / I-cause / O / O / B-effect / I-effect / I-effect / I-effect", where B-cause means the reason I-cause means the middle of the cause, B-effect means the beginning of the effect, I-effect means the middle of the result, O means other text that does not belong to the cause or effect.

[0037] Other steps and parameters are the same as those in Embodiment 1.

specific Embodiment approach 3

[0038] Specific embodiment three: the difference between this embodiment and specific embodiment one or two is: in the step two, first use the existing word segmentation tool to carry out word segmentation for the marked text in step one, and use a neural network structure, such as based on The pre-trained language model of the self-attention mechanism calculates a vector representation for the words in the labeled text after word segmentation; the specific process is:

[0039] Find the word vector corresponding to each word in the labeled text after word segmentation from the pre-trained word vector matrix, and use the word vector corresponding to each word in the labeled text after word segmentation (the word vector corresponding to each word is A row in the pre-trained vector matrix) is input into a neural network to obtain a vector representation of the fusion context information of each word;

[0040] The neural network is a recurrent neural network, a long short-term mem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A causal event extraction method based on self-training and noise model, the invention relates to a causal event extraction method. The purpose of the present invention is to solve the problem that the existing deep learning model-based causal event extraction method has limited effect in fields or scenes with insufficient labeled data. The process is: 1. Collect the labeled target domain text; 2. Calculate a vector representation; 3. Calculate the label sequence with the highest probability; 4. Train the model in 3 and fine-tune the model in 2; 5. Get A large amount of self-labeled data; 6. Calculate a vector representation for each word, and calculate the probability of each possible label sequence generated by the word sequence; 7. Calculate the noise matrix of each word in the self-labeled text; 8. Get generated by the word sequence The probability of the self-labeled label sequence; 9. Use the labeled data of the first and the self-labeled data of the fifth to jointly train the overall model of the second, third, sixth, and seventh. The invention is used in the field of causal event extraction.

Description

technical field [0001] The invention relates to a method for extracting causal events based on self-training and noise models. Background technique [0002] In recent years, deep learning methods have achieved impressive results on various challenging natural language processing tasks, such as machine translation (Kyunghyun Cho, Bart Van Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language EM NLP ( Danqi Chen, Adam Fisch, Jason Weston, and Antoine Bordes. 2017. Reading wikipedia to answer open-domain questions. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pages 1870–1879.). The deep learning method uses the deep neural network to automatically learn the functional relationship between the input and output data. Compar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06F40/284G06N3/04G06N3/08

CPCG06F40/284G06N3/08G06N3/047G06N3/045

Inventor 丁效刘挺秦兵廖阔

Owner HARBIN INST OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Causal Event Extraction Method Based on Self-Training and Noise Model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment approach 1

specific Embodiment approach 2

specific Embodiment approach 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology