Named entity recognition model training method and device

A technology for named entity recognition and entity recognition, which is applied in the computer field and can solve the problems of inability to obtain training data, high cost, and large human and material resources.

Pending Publication Date: 2021-05-18
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] NER refers to the extraction of entities with specific meaning or strong referentiality from the input text. At present, the main methods used are entity recognition methods based on rules and dictionaries, machine learning methods and deep learning methods. In some specific fields ( Such as finance, medical care, military, government affairs, etc.), especially in some emerging fields, it is often impossible to obtain a large amount of labeled training data, so that the named entity recognition model cannot be directly trained. The way of labeling will cost a lot of manpower and material resources, and the cost is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognition model training method and device
  • Named entity recognition model training method and device
  • Named entity recognition model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0042] Terms used in one or more embodiments of the present application are for the purpose of describing specific embodiments only, and are not intended to limit the one or more embodiments of the present application. As used in one or more embodiments of this application and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a named entity recognition model training method and device. The named entity recognition model training method comprises the steps of obtaining labeled training data and unlabeled training data; training a target named entity recognition model according to the labeled training data; inputting the unlabeled training data into the target named entity recognition model to obtain at least one entity word output by the target named entity recognition model and a confidence score corresponding to each entity word; determining a target entity word according to the confidence score corresponding to each entity word, and labeling the unlabeled training data according to the target entity word to generate newly added labeled training data; training the target named entity recognition model contineusly according to the newly added labeled training data. According to the method, through a weak supervised learning mode, the number of the labeled training data is increased, model training overfitting is effectively prevented, and meanwhile the cost of manually labeling the labeled training data is reduced.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a named entity recognition model training method and device, a named entity recognition method and device, computing equipment, and a computer-readable storage medium. Background technique [0002] Named entity recognition (NER) is a basic task in natural language processing, which has a wide range of applications in many scenarios such as text information understanding, knowledge question answering, retrieval, and graph construction. [0003] NER refers to the extraction of entities with specific meaning or strong referentiality from the input text. At present, the main methods used are entity recognition methods based on rules and dictionaries, machine learning methods and deep learning methods. In some specific fields ( Such as finance, medical care, military, government affairs, etc.), especially in some emerging fields, it is often impossible to obtain a large am...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06N3/08
CPCG06N3/08G06F40/295
Inventor 弓源李长亮
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products