Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Boundary Composition Named Entity Recognition Method Based on Neural Network

A technology of named entity recognition and neural network, which is applied in the field of neural network-based named entity recognition and named entity recognition, can solve the problems of dependence effect, unfavorable feature weighting, and feature sparsity, and achieve high performance and prevent feature sparsity. , the effect of reducing the loss of semantic information

Active Publication Date: 2022-03-22
GUIZHOU UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Sequence models set tags through each character at the sentence level to obtain the most probable labeling path, but cannot effectively identify internal nested entities; grammatical analysis is identified by using a grammatical analysis tree, but often depends on the effect of grammatical analysis; based on embedding The nested model can better deal with the nesting problem of named entity recognition
However, these methods have four shortcomings: first, they are all in the sentence expansion task, and there is a problem of sparse features; second, in the sequence model, changing the annotations of internal (or external) entities will not be conducive to feature weighting; third, Treating different classes separately will not be able to effectively use tag information; finally, entity recognition is affected to a certain extent by cascading errors caused by word segmentation or grammatical parsing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Boundary Composition Named Entity Recognition Method Based on Neural Network
  • A Boundary Composition Named Entity Recognition Method Based on Neural Network
  • A Boundary Composition Named Entity Recognition Method Based on Neural Network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] Embodiment 1: as attached Figure 1~3 Shown, a kind of boundary combination named entity recognition method based on neural network is characterized in that: described method comprises the following steps:

[0025] Step 1: Construct a double boundary recognition cascade model based on the neural network to obtain the start and end boundaries of the entity;

[0026] Step 2: implement boundary combination, combine entity boundaries, and obtain candidate entity sets through screening;

[0027] Step 3: Construct a multi-segment neural network classifier to screen candidate entity sets.

[0028] In the first step, on the basis of the BiLSTM-CRF model, combined with the BERT pre-training technology, a multi-step cascaded neural network model for entity boundary information identification is established, see the attached figure 2 In part (A), the expected result of this step is to obtain accurate entity boundary classification results and perform local persistence, realizin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a neural network-based boundary combination named entity recognition method, comprising the following steps: step 1: extracting entity boundary information based on a neural network model, and constructing a boundary recognition model; Combine to obtain candidate entity sets; Step 3: Build a neural network classifier to screen candidate entity sets. The method disclosed in the present invention adopts the boundary combination strategy, introduces neural network technology, fully utilizes the characteristics of neural network layered automatic extraction of high-dimensional abstract features, divides entity recognition into three steps of boundary recognition, boundary combination and candidate entity recognition, and makes up for It overcomes the shortcomings of the traditional sequence model, and to a certain extent, avoids the feature sparsity problem caused by the traditional machine learning method, thereby improving the performance of nested named entity recognition and achieving good results.

Description

technical field [0001] The invention relates to a named entity recognition method, in particular to a neural network-based boundary combination named entity recognition method, belonging to the technical fields of natural language processing and machine learning. Background technique [0002] With the popularity of computers and the rapid development of the Internet, a large amount of information appears in front of people in the form of electronic documents. In order to cope with the severe challenges brought by the information explosion, there is an urgent need for professional automated tools to extract truly valuable information from massive amounts of data, and information extraction has emerged as the times require. Named entities refer to the proper nouns in the text that represent the names of people, places, and organizations. As an important semantic knowledge carrier in the text, named entity recognition plays an important role in information extraction. After it ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/295G06N3/04G06N3/08
CPCG06N3/08G06F40/295G06N3/044G06N3/045
Inventor 陈艳平武乐飞扈应秦永彬
Owner GUIZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products