Cascade linear entity relationship extraction method for social text

A technology of entity relationship and social text, applied in the field of cascading linear entity relationship extraction, can solve the problems of error propagation, ignore potential interactive information, etc., and achieve the effect of improving evaluation indicators

Pending Publication Date: 2022-04-05
NORTHEASTERN UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing entity relationship extraction methods mainly include the pipeline method and the joint extraction method. The traditional pipeline method divides the extraction into two independent entity recognition and relationship classification subtasks. This method ignores the potential interactive information between the two subtasks. Can suffer from error propagation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cascade linear entity relationship extraction method for social text
  • Cascade linear entity relationship extraction method for social text
  • Cascade linear entity relationship extraction method for social text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0041] Such as figure 1 As shown, the social text-oriented cascaded linear entity relationship extraction method in this embodiment is as follows.

[0042] The public entity relationship extraction datasets used in this embodiment are NYT and WebNLG datasets.

[0043] Step 1: Obtain public entity relationship extraction datasets with overlapping entity relationships to simulate social text with overlapping entity relationships, and preprocess the obtained public datasets;

[0044] The method for preprocessing the public dataset is as follows:

[0045]Step 1.1: Use the BertTokenizer in the pre-trained model BERT to segment each sentence in the data set, and convert the w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a social text-oriented cascade linear entity relationship extraction method, which adopts a cascade linear extraction mode, that is, the method firstly detects relationships contained in a given text, and then takes each relationship as additional knowledge to guide the subsequent extraction process of a head entity and a tail entity. According to the method, two decoders, namely a relation decoder and an entity decoder, are further designed, and the two decoders are jointly used for extracting the entity relation triad. By means of the method, the extraction accuracy of entity pairs (the head entity and the tail entity) and the performance of combined extraction can be improved, the overlapping problem can be naturally solved through a relation-first cascade extraction method, and then a more accurate premise can be provided for construction of the knowledge graph.

Description

technical field [0001] The invention relates to the technical field of natural language processing information extraction, in particular to a social text-oriented cascading linear entity relationship extraction method. Background technique [0002] Knowledge graph is a structured semantic knowledge base, which is intended to describe concepts, entities and their relationships in the objective world in a structured form, and express Internet information into a form that is easy for humans to recognize. The basic unit of a knowledge map is an entity-relationship triple, which is shaped like (head entity, relation, tail entity). Therefore, entity-relationship extraction is a key technology for building a knowledge map. [0003] Entity-relationship extraction is to extract entities and their semantic relations from unstructured text, that is, given a natural language text, its task is to extract entity-relationship triples in the text. The existing entity relationship extractio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F16/28G06N3/04G06N3/08
Inventor 马连博任慧敏王兴伟黄敏
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products