Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese named entity recognition method suitable for multiple fields

A named entity recognition, multi-domain technology, applied in the field of Chinese named entity recognition, can solve the problems of expensive labeling, performance degradation, spending a lot of time retraining, etc., to improve the recognition effect, with generalization ability and robustness. Effect

Pending Publication Date: 2022-02-15
CHONGQING UNIV OF POSTS & TELECOMM
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current problems are: single-domain named entity recognition requires a large amount of labeled data, and most fields require professional labeling, which is expensive; when the domain transfer occurs in the training set and test set, the performance will drop significantly; in order to get usable The effect takes a lot of time to retrain the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese named entity recognition method suitable for multiple fields
  • Chinese named entity recognition method suitable for multiple fields
  • Chinese named entity recognition method suitable for multiple fields

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0047]A Chinese named entity recognition method applicable to multiple fields, the method includes acquiring entity data to be recognized; inputting the entity data to be recognized into a Chinese named entity recognition model, obtaining recognition results, and marking the recognition results.

[0048] The process of training the Chinese named entity recognition model includes:

[0049] S1: Obtain the original Chinese named entity dataset, and perform domain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of named entity recognition, and particularly relates to a Chinese named entity recognition method suitable for multiple fields. The method comprises the following steps: carrying out specific field classification on a Chinese named entity data set; adopting a sample learning method to sample the data after field classification to obtain a data set, and inputting the data set into a shared code presentation layer of the model; obtaining the probability distribution of the field to which the data belongs through a domain classifier, enabling the expert layer of each field to extract the unique characteristics of the field, enabling the public expert layer to synthesize the characteristics of experts of each field according to the probability distribution of the field to which the public expert layer belongs, and inputting the characteristics extracted by each expert layer into the corresponding CRF layer to obtain an entity recognition result. According to the method, the multi-task learning technology is applied to the field of Chinese named entity recognition, data of different domains are independently regarded as a training task, and a specific multi-expert model structure is designed to extract unique features and common features of the domains, so the different domains assist one another, and the recognition effect is improved.

Description

technical field [0001] The invention belongs to the fields of deep learning, transfer learning, natural language processing, and named entity recognition, and specifically relates to a Chinese named entity recognition method applicable to multiple fields. Background technique [0002] Named entity recognition technology is a key technology in the field of natural language processing and the basis of other natural language processing applications. It aims to extract entity fragments that people care about from text, such as person names, organization names, place names, etc. At present, Chinese named entity recognition for a single domain has achieved good performance. [0003] With the in-depth application of natural language processing technology and the development of various industries in society. There are also more and more types of texts, such as radio conversations, TV news, Internet blogs, etc. The named entities defined in different domains are also different. Ho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06K9/62G06N3/04G06N3/08
CPCG06F40/295G06N3/08G06N3/047G06N3/048G06F18/24
Inventor 王进林兴王猛旗何晓莲陈乔松杜雨露胡珂
Owner CHONGQING UNIV OF POSTS & TELECOMM