Neural network-based boundary combination named entity recognition method

A technology of named entity recognition and neural network, which is applied in the field of named entity recognition and boundary combination named entity recognition based on neural network, which can solve the problems of unfavorable feature weighting, dependence effect, inability to effectively identify internal nested entities, etc. Sparse problem, the effect of reducing the loss of semantic information

Active Publication Date: 2019-07-19
GUIZHOU UNIV
View PDF3 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Sequence models set tags through each character at the sentence level to obtain the most probable labeling path, but cannot effectively identify internal nested entities; grammatical analysis is identified by using a grammatical analysis tree, but often depends on the effect of grammatical analysis; based on embedding The nested model can better deal with the nesting problem of named entity recognition
However, these methods have four shortcomings: first, they are all in the sentence expansion task, and there is a problem of sparse features; second, in the sequence model, changing the annotations of internal (or external) entities will not be conducive to feature weighting; third, Treating different classes separately will not be able to effectively use tag information; finally, entity recognition is affected to a certain extent by cascading errors caused by word segmentation or grammatical parsing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network-based boundary combination named entity recognition method
  • Neural network-based boundary combination named entity recognition method
  • Neural network-based boundary combination named entity recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] Embodiment 1: as attached Figure 1~3 As shown, a neural network-based boundary combination named entity recognition method, the method includes the following steps: Step 1: extract entity boundary information based on the neural network model, and construct a boundary recognition model; Step 2: implement boundary combination strategies, and identify entities Combine the boundaries to obtain candidate entity sets; Step 3: Build a neural network classifier to screen candidate entity sets.

[0026] In step 1, this step is based on the classic BiLSTM-CRF model, combined with BERT pre-training technology, to establish a neural network model for entity boundary information recognition, see attached figure 2 Part (A) in the dotted box in the middle and lower part. The expected result of this step is to obtain accurate entity boundary classification results and perform local persistence, realizing the acquisition of multi-layer nested named entity boundary information.

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neural network-based boundary combination named entity recognition method, which comprises the following steps of 1, extracting the entity boundary information based on a neural network model, and constructing a boundary recognition model; 2, implementing a boundary combination strategy, and combining the entity boundaries to obtain a candidate entity set; and step 3, constructing a neural network classifier, and screening the candidate entity set. According to the method disclosed by the present invention, by employing the boundary combination strategies and introducing the neural network techniques, the characteristic that the neural network automatically extracts the high-dimensional abstract features in a layered manner is fully exerted; by dividing the entityrecognition into three steps of boundary recognition, boundary combination and candidate entity recognition, the defects of a traditional sequence model are overcome, and the problem of feature sparseness generated by a traditional machine learning method is avoided to a certain extent, so that the performance of the nested named entity recognition is improved, and a very good effect is achieved.

Description

technical field [0001] The invention relates to a named entity recognition method, in particular to a neural network-based boundary combination named entity recognition method, belonging to the technical fields of natural language processing and machine learning. Background technique [0002] With the popularity of computers and the rapid development of the Internet, a large amount of information appears in front of people in the form of electronic documents. In order to cope with the severe challenges brought by the information explosion, there is an urgent need for professional automated tools to extract truly valuable information from massive amounts of data, and information extraction has emerged as the times require. Named entities refer to the proper nouns in the text that represent the names of people, places, and organizations. As an important semantic knowledge carrier in the text, named entity recognition plays an important role in information extraction. After it ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/04G06N3/08
CPCG06N3/08G06F40/295G06N3/044G06N3/045
Inventor 陈艳平武乐飞扈应秦永彬
Owner GUIZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products