Named entity recognition method based on deep learning

A named entity recognition and deep learning technology, applied in character and pattern recognition, semantic analysis, instrumentation, etc., can solve problems such as inaccurate entity classification, difficulty in identifying nested entities, etc., and achieve the effect of improving the collapse problem and improving the accuracy rate

Pending Publication Date: 2021-12-31
XIAN UNIV OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a named entity recognition method based on deep learning, which solves the problem in the prior art that it is difficult to identify nested entities by using the entity relationship extraction method, resulting in inaccurate entity classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognition method based on deep learning
  • Named entity recognition method based on deep learning
  • Named entity recognition method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0050] refer to Figure 4 , the identification method of the present invention takes "the bank formulates business behavior norms for employees" as an example.

[0051] Step 1, build a dictionary of synonyms D, in which the synonym of the word "behavior" is "action";

[0052] Step 2, get entity class set E={"organization", "name", "address", "company", "government", "book", "game", "movie", "position", "scene"} ;

[0053] Step 3: Perform comparative learning pre-training on the BERT model that has completed domain pre-training, and obtain the pre-trained Pre_Train_BERT. The specific process of using positive samples and negative samples for training can refer to figure 2 and image 3 ;

[0054] Step 4, perform fine-tuning training on the named entity recognition task for the deep learning model used, and obtain the Fine_Tuning_BERT encoder and the trained softmax classifier;

[0055] Step 5, take the sentence "the bank formulates a code of business conduct for employees"...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a named entity recognition method based on deep learning. The named entity recognition method comprises the following steps: 1) constructing a one-to-one synonym dictionary; 2) selecting a data set, and defining an entity class set; 3) performing comparative learning pre-training on a BERT model; 4) performing fine tuning training on the named entity recognition task to obtain an encoder and a classifier; 5) collectively referring the rest sentences to be processed in the test set as sentences S1; 6) inputting the sentence S1 into the encoder to obtain a word embedding vector set and a sentence vector u; 7) selecting a text segment in the sentence S1 based on the span, and constructing a word embedding vector of the text segment; (8) replacing the text segments selected in the sentence S1 in the step (7) with a synonym dictionary to obtain a sentence S2; 9) processing the sentence S2 by using the encoder to obtain a sentence vector v; and 10) calculating span_em, and performing classification through the classifier to obtain an entity set C. The method is high in recognition accuracy.

Description

technical field [0001] The invention belongs to the technical field of computer natural language processing, and relates to a named entity recognition method based on deep learning. Background technique [0002] The purpose of named entity recognition is to identify entities with a certain meaning, category or part of speech in the text, such as names, countries, emotional words, subject proper nouns, phone numbers, etc. With the rise of big data technology, named entity recognition has been more and more widely used in knowledge graphs, data analysis, intelligent data processing and other fields. [0003] Early named entity recognition methods were generally based on rules. After the gradual progress of deep learning, many named entity recognition methods based on deep learning have emerged in recent years. However, the current named entity recognition method based on deep learning is difficult to recognize fuzzy entities such as nested entities, and the accuracy rate need...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/247G06F40/242G06F40/30G06F16/35G06K9/62
CPCG06F40/295G06F40/247G06F40/242G06F40/30G06F16/35G06F18/2415G06F18/214
Inventor 黑新宏李育璠朱磊王一川姬文江彭伟董林靖
Owner XIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products