Unlock instant, AI-driven research and patent intelligence for your innovation.

Named entity extraction method and device and medium

A named entity, named entity recognition technology, applied in special data processing applications, instruments, electrical digital data processing and other directions, can solve problems such as confusing subdivision types, and achieve the effect of low overall cost

Pending Publication Date: 2019-04-26
ULTRAPOWER SOFTWARE
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, it is easy to confuse the subdivision types of multiple similar named entities when using the recognition model trained by the above training samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity extraction method and device and medium
  • Named entity extraction method and device and medium
  • Named entity extraction method and device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] Using the recognition model to extract subdivided named entities not only has poor accuracy, but also has the problems of high training cost and low reuse rate of the recognition model. Specifically, training the recognition model requires a large number of training samples, and training samples in the form of "text-organization name-subdivision type" need to manually label corpus according to different extraction tasks and text fields, and construct training samples, which leads to the high cost of model training. At the same time, since each recognition model is trained with targeted training samples according to different extraction tasks and text fields, such recognition models cannot be used for other extraction tasks, nor can they be used to deal with different fields. text, which leads to a low reuse rate of the recognition model.

[0043]To this end, the present application provides a new named entity extraction method, which combines named entity recognition m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a named entity extraction method and device and a computer readable storage medium. The method comprises the following steps: identifying a first named entityfrom a to-be-extracted text by utilizing a named entity identification model; Obtaining an extraction rule, the extraction rule comprising a positioning expression and an extraction expression, and the extraction rule corresponding to a preset subdivision type of the named entity; Determining an effective extraction area in the text by utilizing the positioning expression, wherein the effective extraction area comprises a first named entity; Extracting a second named entity from the effective extraction area; Wherein the second named entity is a character string matched with the extraction expression, and the subdivision type of the second named entity is a subdivision type corresponding to the extraction rule. By adopting the method in the technical scheme, the named entity of the subdivision type can be extracted from the text more accurately, and the named entity recognition model with higher universality can be adopted, so that the cost required for completing the extraction taskis reduced.

Description

technical field [0001] The invention relates to the fields of information extraction and text mining, in particular to a named entity extraction method. In addition, the invention also relates to a named entity extraction device and medium. Background technique [0002] Named entities generally refer to person names, organization names, place names, and all other entities identified by names. More broadly, named entities also include numbers, dates, and currencies. The types of named entities can be defined according to the problem. For example, in an existing definition, named entities can include three categories: entity class, time class and value class. Among them, the entity category includes names of people, places, and institutions; the time category includes dates, time, etc.; the value category includes currency, weights and measures, percentages, etc. Named entities occupy an important position in many application fields such as information extraction, question a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/295
Inventor 吴云鹤李德彦
Owner ULTRAPOWER SOFTWARE