An entity identification method and related equipment

An entity recognition and entity technology, applied in the field of information processing, can solve problems such as low accuracy, poor entity recognition performance, and inability to effectively capture entity dependencies, and achieve the effect of improving accuracy

Active Publication Date: 2019-06-18
TENCENT TECH (SHENZHEN) CO LTD +1
View PDF11 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, for entity extraction tasks, the mainstream extraction models are conditional random field models and neural network-conditional random field models. Such models cannot directly handle nested structures, and can only complete nested entity recognition by superimposing multiple models. However, the way of superimposing multiple models will not be able to effectively capture the dependencies between entities because each conditional random field model is independent of each other, resulting in poor performance of entity recognition and low accuracy of entity extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An entity identification method and related equipment
  • An entity identification method and related equipment
  • An entity identification method and related equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0034] See figure 2 , figure 2 It is a schematic structural diagram of an information extraction system provided by an embodiment of the present invention. The information extraction system includes information processing equipment, databases and other equipment. Among them, the information processing device can be a computer, a mobile phone, a server (such as a database server, a file server), etc., and a large amount of voice information and text...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses an entity identification method and related equipment, and the method comprises the steps: firstly, obtaining a plurality of annotation corpora, and enablingeach annotation corpus in the plurality of annotation corpora to carry annotation information; Establishing a hypergraph model according to a preset entity labeling rule; Determining a labeling path diagram corresponding to each labeling corpus according to the labeling information and an entity labeling rule, and establishing a to-be-trained model according to the hypergraph model and a preset neural network model; And finally, inputting the labeled path diagram into a to-be-trained model for training to obtain an entity recognition model, and recognizing at least one named entity in the input corpus according to the entity recognition model. By adopting the embodiment of the invention, the entity of the nested structure can be effectively identified, so that the accuracy of entity identification and entity extraction is improved.

Description

technical field [0001] The present invention relates to the technical field of information processing, in particular to an entity recognition method and related equipment. Background technique [0002] In the era of information explosion, how to quickly and effectively extract the required information from massive data has become a hot research topic, which has led to the research on natural language processing. For a long time, the task of entity extraction has been widely concerned in the field of natural language processing. It is the pre-step of many natural language processing tasks, so its performance also directly affects the performance of downstream natural language processing tasks, such as entity connection and entity relationship. Classification, knowledge graph reasoning, etc. Among them, entities are named entities, which refer to the names of people, institutions, places, and all other entities identified by names in natural language. More extensive entities ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06N3/08
Inventor 林浚玮邵轶男王巨宏陈伟
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products