Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Named entity identification method capable of combining attention mechanism and multi-target cooperative training

A named entity recognition and collaborative training technology, applied in the field of neural network named entity recognition, can solve problems such as flexible adjustment of character features and word feature weights for difficult training data, simple and rude splicing methods, and difficulty in learning the distribution of characters

Active Publication Date: 2018-10-09
SUN YAT SEN UNIV
View PDF8 Cites 76 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although these methods take into account the morphological characteristics of words at the character level, the splicing method is relatively simple and crude, and it is difficult to flexibly adjust the weight between character features and word features according to the training data; in addition, the specific training data distribution is not considered. The influence of weight, only through the backpropagation of the entity recognition task to adjust the weight of the character vector, it is difficult to learn the distribution law between characters

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method capable of combining attention mechanism and multi-target cooperative training
  • Named entity identification method capable of combining attention mechanism and multi-target cooperative training
  • Named entity identification method capable of combining attention mechanism and multi-target cooperative training

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0104] Taking the CONLL2003 data as an example, apply the above method to the text for text naming recognition. The specific parameters and methods are as follows:

[0105] 1. Perform sentence and word segmentation operations on the training data, process the document as a collection of sentences, process each sentence as a collection of single words, and process each word as a collection of single characters. If the training data is Chinese, you need to use Natural language processing tools such as jieba word segmentation;

[0106] 2. Perform statistics on words and labels to obtain vocabulary list W and label table L, perform statistics on characters in the word list to obtain character list C; training data labels contain "PER (person's name)", "LOC (place name)", "ORG (organization)", "MISC (miscellaneous)" four categories, training documents a total of 14987 sentences, 4915 words (in The result after replacing low-frequency words);

[0107] 3. For the word x i A sin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a named entity identification method capable of combining an attention mechanism and multi-target cooperative training. The method comprises the following steps that: (1) carrying out a preprocessing operation on training data, and through character hierarchy mapping, obtaining the character vector representation of a sentence; (2) inputting the character vector representation obtained in (1) into a bidirectional LSTM (Long Short Term Memory) network, and obtaining the character vector representation of each word; (3) through word hierarchy mapping, obtaining the word vector representation of each sentence; (4) through the attention mechanism, splitting the word vector representation obtained in (3) with the character vector representation obtained in (1), and transmitting into the bidirectional LSTM network to obtain the semantic characteristic vector of the sentence; and (5) aiming at the semantic characteristic vector obtained in (4), carrying out entity annotation on each word by a conditional random field, and decoding to obtain an entity tag.

Description

technical field [0001] The present invention relates to the field of neural network named entity recognition methods, and more specifically, relates to a named entity recognition method combined with an attention mechanism and multi-task cooperative training. Background technique [0002] Named Entity Recognition (NER), as the basic work of some complex tasks in the field of natural language processing (NLP) (such as information extraction, question answering system, machine translation), is to find relevant entities from a piece of natural language text, and Mark its location and type. As a research hotspot in the field of NLP, named entity recognition is a challenging task. On the one hand, it is usually difficult for people to obtain a large amount of labeled data for model training; on the other hand, the characteristics of entities are often ever-changing. Therefore, to obtain a model with strong generalization ability, a large number of features are often required. p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04G06N3/08
CPCG06N3/049G06N3/08G06F40/295
Inventor 卓汉逵付豪
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products