Named entity identification method and device, equipment and storage medium

A named entity recognition and entity recognition technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of difficult field migration, large data labeling workload, etc., to reduce the training threshold and improve field versatility , The effect of reducing the workload of data labeling

Pending Publication Date: 2019-12-10
ZTE CORP
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a named entity recognition method, device, device, and storage medium to solve the problems of heavy data labeling workload and difficult domain migration

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method and device, equipment and storage medium
  • Named entity identification method and device, equipment and storage medium
  • Named entity identification method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described below are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0033] figure 1 is a flowchart of a named entity recognition method provided by an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0034] Step S101: Perform entity recognition on text data in the new domain to obtain seed entity words in the new domain.

[0035] The new field refers to a field in which entity words have not been mined or a field in which entity words are not sufficiently mined. In this new field of entity words to be mined, there is no or lack of labeled corpus.

[0036] As a first way to realize step S101, the new domain text data can be split into new domain single sentences, and then according t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a named entity identification method and device, equipment and a storage medium, and relates to the fields of natural language processing, semantic analysis and understanding,artificial intelligence and the like. The method comprises the steps of performing entity identification on new domain text data to obtain new domain seed entity words; labeling the new domain text data according to the new domain seed entity words to obtain labeled new domain text data; training a named entity recognition model by using the labeled new domain text data to obtain a named entity recognition model suitable for the new domain; and identifying entity words in other text data of the new domain by utilizing the named entity identification model suitable for the new domain. Accordingto the embodiment of the invention, the data annotation workload can be reduced, the model migration training threshold is reduced, and the field universality of the algorithm is improved.

Description

technical field [0001] The present invention relates to the fields of natural language processing, semantic analysis and understanding, artificial intelligence, etc., and particularly relates to a NER (Named Entities Recognition, named entity recognition) method, device, equipment and storage medium. Background technique [0002] Named entity recognition is a basic branch of NLP (Natural Language Processing) and one of the key technologies in information extraction, targeting proper nouns in the field. Common areas mainly include: names of people, places, organizations, etc. The specific field mainly refers to the proper nouns in the field, such as "credit card" and "debit card" in the banking field. [0003] Existing technologies can be divided into three categories, one is the method based on dictionaries and rules, which relies on the construction of dictionaries and rules, and has great limitations in dealing with new words and new fields; the other is the method based ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/295
Inventor 温海娇陈虹牛国扬董修岗
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products