Remote supervision relation extraction method and device based on consistency text enhancement

A technology of remote supervision and relation extraction, applied in neural learning methods, unstructured text data retrieval, text database clustering/classification, etc. question

Active Publication Date: 2021-09-14
WUHAN UNIV
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] 2) Many research methods reduce the weight of noise samples in the training set or directly filter them out, so that the effective information contained in these noise samples cannot be fully utilized;
[0009] 3) The disturbance added by methods such as confrontation generation, although it can increase the anti-disturbance ability of the model, it usually cannot provide disturbances that meet the actual situation, is not stable, and tends to deviate the direction of model training

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Remote supervision relation extraction method and device based on consistency text enhancement
  • Remote supervision relation extraction method and device based on consistency text enhancement
  • Remote supervision relation extraction method and device based on consistency text enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0074] In the first aspect, the embodiment of the present invention provides a method for extracting distance supervision relations based on consistent text enhancement.

[0075] In one embodiment, refer to figure 1 , figure 1 It is a schematic flow chart of an embodiment of the method for extracting remote supervisory relations based on consistent text enhancement in the present invention. Such as figure 1 As shown, the distance supervision relation extraction method based on consistent text enhancement includes:

[0076] Step S10, obtain multiple sentence instances, align each sentence instance to the knowledge base based on the assumption of remote supervision, determine the relationship label corresponding to each sentence instance, and divide the sentence instances with the same entity pair and relationship la...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a remote supervision relation extraction method and device based on consistency text enhancement, and the method comprises the steps: dividing a plurality of sentence instances according to entity pairs and relation labels, and obtaining a plurality of sentence packages; adopting different text enhancement methods for each sentence instance in each sentence packet, and obtaining a strong enhancement sample and a weak enhancement sample corresponding to each sentence instance in each sentence packet; determining a noise sample, and training the relation prediction model through the unrelated sentence instance and the strong enhancement sample and the weak enhancement sample of the noise sample to obtain a trained relation prediction model; and predicting a to-be-predicted sentence package by using the trained relationship prediction model to obtain a relationship label corresponding to the to-be-predicted sentence package. According to the method, the scale of a data set can be increased through consistent text enhancement, the generalization learning ability of the model can be enhanced, and the NA category and noise sample constraint model can learn more supervision information.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a method and device for extracting remote supervision relations based on consistent text enhancement. Background technique [0002] Massive information on the Internet can extract a lot of valuable knowledge and information through related technologies of information extraction. As an important link in information extraction, Relation Extraction (Relation Extraction, RE) aims to extract the relationship between entities from text, and provide other natural language applications such as building knowledge graphs, search engines, dialogue generation, natural question answering, information retrieval, etc. provided important support. [0003] The training of relation extraction models requires a large number of labeled samples to provide supervision information. However, the same relation type may have different text expressions, and at the same time, relations of differ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/216G06N3/08G06N5/02
CPCG06F16/35G06F40/216G06N3/08G06N5/02
Inventor 彭敏罗娟胡刚廖庆文
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products