Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Fast Relation Extraction Method Based on Convolutional Neural Networks and Improved Cascade Labeling

A convolutional neural network and relation extraction technology, applied in the field of fast relation extraction based on convolutional neural networks and improved cascade annotation, can solve the problems of slow model training and prediction, achieve good generalization and stability, good Application prospects and promotion scope, the effect of alleviating relationship redundancy

Active Publication Date: 2022-07-26
SOUTHEAST UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, a text encoder based on a deep neural network composed of dilated convolutions, gating units, and residual connections encodes the initial text to obtain a text encoding representation with rich contextual semantics, thus solving the problem of using a pre-trained model in relation extraction. As an encoder, the efficiency bottleneck problem that leads to slow model training and prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Fast Relation Extraction Method Based on Convolutional Neural Networks and Improved Cascade Labeling
  • A Fast Relation Extraction Method Based on Convolutional Neural Networks and Improved Cascade Labeling
  • A Fast Relation Extraction Method Based on Convolutional Neural Networks and Improved Cascade Labeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0116] As a specific embodiment of the present invention, the present invention provides training and prediction of a fast relation extraction method based on convolutional neural network and improved cascade annotation as shown in the following table for the training configuration flow table.

[0117]

[0118] The fast relation extraction model is used to train and predict relation extraction tasks on real Chinese and English data sets, and all other embodiments use the same data as this embodiment. Among them, the Chinese data set uses the DuIE relation extraction training set released by Baidu, which contains 11958 training data, 1498 verification data and 1489 test data, and defines 48 different relationship types; the English data set uses the New York Times corpus and The NYT relation extraction dataset obtained by Freebase's remote supervision contains 70,339 training data and 4,006 test data, and defines 29 different relation types. Due to the good robustness and ge...

Embodiment 2

[0121] The fast relation extraction model based on convolutional neural network and improved cascade annotation has fast reasoning ability. First of all, the model has less parameters, which is about 1% of the parameters of other relation extraction models at present, so the model requires calculation The time and computing resources are greatly reduced; when extracting relational triples in real-world natural text, the model takes less than 5ms on average to process a natural text, and the speed is 7-15 times faster than other methods.

[0122] In practical applications, it is often required that the model can process many natural texts at a time to ensure the parallelism of model operations. In the case of limited computing resources, the traditional relation extraction method using pre-trained language model as an encoder can only process and extract less than 8 or even 4 natural texts at a time, while the fast relation extraction model is limited by the amount of parameters...

Embodiment 3

[0124] The fast relation extraction model based on convolutional neural network and improved cascade annotation is easy to train and takes less time to train. Compared with other relation extraction methods, it has shorter turnaround time for single parameter update and less total parameters Update turnaround times so less total training time is required. For example, when the amount of training data is about 70,000, the batch size is set to 32, and the model is trained for 60 epochs, and it only takes about 100 minutes in the end, which is 3 to 10 times faster than other methods.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A fast relation extraction method based on convolutional neural network and improved cascade annotation. First, a text encoder based on a deep neural network consisting of dilated convolutions, gating units, and residual connections encodes the initial text to obtain a text-encoded representation with rich contextual semantics. Then, according to the obtained text encoding, using the improved cascade annotation, the head entity tagger annotates the spans of all head entities and their corresponding entity types. Next, through the text encoding representation and the feature representation of the head entity, the tail entity tagger annotates all tail entities corresponding to each head entity. Finally, it is verified by a real-world relation extraction task. The invention has the advantages of fast training and prediction, and can meet the requirements of relation extraction scenarios oriented to massive texts.

Description

technical field [0001] The invention belongs to the field of artificial intelligence natural language processing, in particular to a fast relation extraction method based on a convolutional neural network and an improved cascade annotation. Background technique [0002] Natural language processing technology is one of the core technologies of cognitive intelligence. It is the knowledge base to support artificial intelligence application scenarios such as semantic search, intelligent question answering, and auxiliary big data analysis. The goal of relation extraction is to determine two entities in natural text. The relationship between. Recent research on relation extraction has made significant progress, however, in real application scenarios, relation extraction models need to maintain high demands on both speed and performance. For example, the investment decisions of investors in the financial sector rely on knowledge graphs consisting of relational triples extracted fr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/279G06F40/126G06N3/04G06N3/08
CPCG06F40/279G06F40/126G06N3/08G06N3/045
Inventor 汪鹏李国正
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products