Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fine-grained named entity recognition method based on pre-training coding feature hierarchical processing

A technology for named entity recognition and encoding features, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as non-transferability, dependence, inability to represent words or word polysemy, etc.

Pending Publication Date: 2021-08-27
ARMY ENG UNIV OF PLA +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The named entity recognition method of traditional machine learning, which formalizes the named entity recognition task into a sequence labeling task, predicts the label of each word or word, and jointly predicts the entity boundary and entity type, which is heavily dependent on the dictionary library and cannot be recognized. Disadvantages such as entity nesting, inability to characterize words or polysemy of words, and lack of migration

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fine-grained named entity recognition method based on pre-training coding feature hierarchical processing
  • Fine-grained named entity recognition method based on pre-training coding feature hierarchical processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] In order to clearly explain the present invention and make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the drawings in the embodiments of the present invention, in order to Those skilled in the art can implement it by referring to the text of the description. The technology of the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0019] This embodiment of the present invention proposes a fine-grained named entity recognition method based on layered processing of pre-trained coding features. The structure diagram is as follows figure 2 As shown, the specific implementation steps are as follows:

[0020] A. Obtain the training corpus data set THUCNEWS of the Chinese language model, and perform preprocessing, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a fine-grained named entity recognition method based on pre-training coding feature hierarchical processing, and relates to the technical field of natural language processing, and the method comprises the steps: obtaining a Chinese pre-training coding vector through employing a pre-training model ALBERT, then carrying out the hierarchical extraction of useful bidirectional context features in the pre-training features, and fusing the features. And then the objective function is accurately optimized by utilizing the advantages of optimal solution solving of a supervised learning technology and a conditional random field CRF, so that model parameters are automatically learned to improve the recognition effect of the model on a to-be-recognized data set. Experiments show that by adopting the method of the invention, the problem of fine-grained named entity recognition can be effectively solved, and a better recognition effect is achieved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, specifically to the technical field of named entity recognition, and more specifically to a fine-grained named entity recognition method based on layered processing of pre-trained coding features. Background technique [0002] BERT is a large-scale pre-trained language model based on Transformers encoder. In recent years, BERT has demonstrated strong strength in many downstream tasks. Named Entity Recognition (NER) is a basic task in natural language processing with a wide range of applications. [0003] The named entity recognition method of traditional machine learning, which formalizes the named entity recognition task into a sequence labeling task, predicts the label of each word or word, and jointly predicts the entity boundary and entity type, which is heavily dependent on the dictionary library and cannot be recognized. There are shortcomings such as entity ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06N3/04G06N3/08
CPCG06F40/295G06N3/08G06N3/044G06N3/045
Inventor 郝文宁靳大尉陈刚郝建东郑志明邱望洁吴发国袁波
Owner ARMY ENG UNIV OF PLA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products