Entity recognition model generation method, entity recognition method, device and apparatus

An entity recognition and model generation technology, applied in the field of data processing, can solve the problems of inaccurate entity recognition results, poor performance of entity recognition models, and difficulty in obtaining training data in large quantities.

Active Publication Date: 2020-09-01
SHENYANG NEUSOFT XIKANG MEDICAL SYST
View PDF8 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in some fields, it is difficult to obtain a large amount of training data with labels. When the training data is small and cannot meet the needs of entity recognition model training in quantity, it will lead to poor performance of the trained entity recognition model. The entity recognition result obtained by recognizing the text is not accurate enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity recognition model generation method, entity recognition method, device and apparatus
  • Entity recognition model generation method, entity recognition method, device and apparatus
  • Entity recognition model generation method, entity recognition method, device and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0103] In order to make the above objects, features and advantages of the present application more clearly understood, the embodiments of the present application will be described in further detail below with reference to the accompanying drawings and specific implementation manners.

[0104] The inventor's research on the traditional entity recognition model found that the traditional entity recognition model is trained by the supervised model training method, and the entity recognition model is obtained by training with the training data with labels. Among them, the number of training data will affect the performance of the entity recognition model. However, in some fields, it is difficult to obtain a large number of training data with labels. When the number of training data is small, it will lead to the features learned by the entity recognition model. Insufficient, so that the recognition results of the entity recognition model are not accurate enough.

[0105] For exampl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention disclose an entity recognition model generation method, an entity recognition method, device and apparatus. The method comprises the steps of performing training by utilizing standard text data to obtain a mapping model of segmented words and word vectors; performing word segmentation on the first training text to obtain first segmented words included in the first training text; determining an approximate word of the first segmented word through the mapping model of the segmented word and the word vector, and generating an approximate sentence corresponding to thefirst training text through the approximate word of the first segmented word; obtaining feature representation of each character in the first training text and feature representation of each characterin an approximate sentence corresponding to the first training text; and training to generate an entity recognition model by utilizing the feature representation of each character in the first training text, the feature representation of each character in the approximate sentence corresponding to the first training text and the entity category label and the character sequence label of each character in the first training text.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular, to a method, device, and device for generating an entity recognition model, and a method, device, and device for entity recognition. Background technique [0002] In the process of building an entity recognition model using named entity recognition technology, a supervised model training method is usually used, and a corresponding entity recognition model is obtained by training a large amount of labeled training data. Among them, the amount of training data has an important impact on the performance of the trained entity recognition model. [0003] However, in some fields, it is difficult to obtain a large amount of training data with labels. When the training data is small and cannot meet the needs of entity recognition model training, the performance of the trained entity recognition model will be poor. The entity recognition result obtained by recognizing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/126G06F40/247G06F40/289G06F40/295G06N3/04G06N3/08
CPCG06F40/295G06F40/289G06F40/247G06F40/126G06N3/049G06N3/08G06N3/044
Inventor 杨贺羽李晓东付博
Owner SHENYANG NEUSOFT XIKANG MEDICAL SYST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products