Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Entity recognition method and device and computer equipment

A technology for entity recognition and to-be-recognized, which is applied in computing, special data processing applications, instruments, etc., and can solve problems such as unrecognizable, poor maintenance and transferability, and difficult to reuse

Active Publication Date: 2019-06-21
TENCENT TECH SHANGHAI
View PDF10 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this entity recognition method requires a large amount of high-quality labeled corpus for model training, and the feasibility is poor; it can only recognize some simple entities. For proper nouns and ambiguous nouns, the recognition accuracy is low, and the update frequency is high. entity that doesn't even recognize
[0004] In this regard, the technicians proposed a method of using rule templates to identify named entities in the corpus, that is, to match the pre-constructed templates with the target corpus, and identify the named entities in the target corpus. Although this method can accurately identify all complex However, in the case of complex product functional domains, it is necessary to construct a large number of templates, the workload is huge, and the post-maintenance and transferability are poor, and it is difficult to reuse them in different scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity recognition method and device and computer equipment
  • Entity recognition method and device and computer equipment
  • Entity recognition method and device and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Combined with the analysis of the background technology part, the existing technology uses such as BiLSTM-CRF (Bi-directional LongShort-TermMemory-ConditionalRandomFields, long-short-term memory-conditional random field) algorithm or other deep learning algorithms to realize the method of named entity recognition, which has universal It is easy to implement, and has good maintainability and transferability. It can extract complex grammatical structures and some simple entities in different human speaking habits. The method of using rule templates to realize named entity recognition is suitable for fields with complex entities but simple sentence patterns, such as stocks, sports, and media-related fields.

[0037] Through the above analysis, for fields with complex entities and complex sentence patterns, such as the music field of voice assistants, this application needs a solution that can recognize all complex entities and reduce the complexity of the "stack template" a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an entity recognition method and device and computer equipment. The method comprises the steps of performing recall processing on to-be-identified corpora; adopting a dictionarymatching module to recall possible candidate slot position combinations of the corpus to be identified; marking the to-be-identified corpus according to the to-be-identified corpus; acquiring a corresponding slot position labeling sequence: respectively obtaining a coding vector of a corpus to be identified and a coding vector of each slot marking sequence by utilizing a deep learning network, selecting the coding vector of the slot marking sequence most similar to the coding vector of the corpus to be identified, and taking the slot marking sequence corresponding to the coding vector as an optimal slot marking sequence, thereby obtaining a named entity of the corpus to be identified. Therefore, by utilizing the advantages of the rule template and the deep learning algorithm, the named entities of various types of corpora can be quickly, simply and accurately identified, the cold start problem is solved, the obtained identification result is generally a computer language, and the electronic equipment can directly respond to the identification result.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to an entity recognition method, device and computer equipment. Background technique [0002] In recent years, with the development of artificial intelligence, human-computer dialogue systems have been widely applied to application platforms in various fields. The man-machine dialogue system is a computer system that can communicate with people. After obtaining the questions raised by the users, it needs to identify the named entities in the questions, so as to give the answers or corresponding operations required by the users, which simplifies the human-computer dialogue system. The flow of interaction. [0003] At present, in the application of named entity recognition (Named Entity Recognition, NER), it is proposed to use deep learning to identify named entities in the corpus, that is, to label named entities as sequences, and use large-scale corpus to ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/22
Inventor 杨奇杨君吴丹
Owner TENCENT TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products