Named entity recognition method and device, storage medium and terminal device

A named entity recognition and corpus technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of poor named entity recognition model recognition effect, consuming a lot of time and energy, and increasing training costs, etc. The effect of enhancing context understanding ability, reducing training cost, and improving recognition effect

Active Publication Date: 2019-09-27
GUANGZHOU DUOYI NETWORK TECH +2
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]The existing technology generally builds a named entity recognition model, and recognizes the named entities in the text according to the trained named entity recognition model, and then trains the named entity recognition model However, manual labeling takes a lot of time and energy, and the available labeled corpus is less, resulting in The training cost is increased, and the recognition effect of the named entity recognition model obtained by training with a small amount of labeled corpus is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity recognition method and device, storage medium and terminal device
  • Named entity recognition method and device, storage medium and terminal device
  • Named entity recognition method and device, storage medium and terminal device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0046] The embodiment of the present invention provides a named entity recognition method, see figure 1 As shown, it is a flow chart of a preferred embodiment of a named entity recognition method provided by the present invention, and the method includes steps S11 to S15:

[0047] Step S11, obtaining unlabeled corpus;

[0048] Step S12, training a preset language model according to the unmarked corpus;

[0049] Step S13, labeling the unlabeled corpus to obtain th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a named entity recognition method. The named entity recognition method comprises the steps of obtaining the unlabeled corpora; training a preset language model according to the unlabeled corpus; labeling the unlabeled corpora to obtain the labeled corpora; training a preset named entity recognition model according to the tagged corpus, wherein the named entity recognition model is constructed according to a trained language model; and identifying the named entity in the to-be-identified text according to the trained named entity identification model. Correspondingly, the invention further discloses a named entity recognition device, a computer readable storage medium and a terminal device. By adopting the technical scheme of the invention, the unlabeled corpus can be fully utilized to train the language model, and the context understanding capability of the language model is enhanced, so that the training cost is reduced, and the recognition effect is improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a named entity recognition method, device, computer-readable storage medium and terminal equipment. Background technique [0002] Natural Language Processing (NLP) is the field of interaction between computer and human language that computer science, artificial intelligence, and linguistics focus on. It is an important direction in the field of computer science and artificial intelligence. As a basic task in NLP, named entity recognition (Named Entity Recognition, NER) refers to the recognition of entities with specific categories from text, such as names, place names, organization names, proper nouns, etc. In artificial intelligence research, named entity recognition is a task that must be overcome, and the recognition effect of named entities has an important impact on a series of subsequent artificial intelligence technologies. [0003] The existin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/279G06F40/295
Inventor 徐波
Owner GUANGZHOU DUOYI NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products