Named entity identification method and device

A technology for named entity recognition and named entity, which is used in special data processing applications, instruments, electrical digital data processing, etc.

Active Publication Date: 2016-05-04
CHINA CONSTRUCTION BANK
View PDF5 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are many named entity recognition methods, such as the recognition method with the patent number 201310201310674046.7. The process is: recognize the special words in the text to be processed; Process the replacement of special words identified as model entities in the text, and then identify entities such as commodity entities, commodity classification entities, brand entities, commodity attribute name entities, and commodity attribute value entities on this basis. This recognition method is mainly for General texts, while the texts in social networks are mainly short texts. In social networks such as Weibo or QQ, most of the texts posted by users are short texts, and users in social networks will follow each other. However, the current named entity recognition method It is not based on this feature, so there is an urgent need for a named entity recognition method suitable for social networks such as Weibo or QQ

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method and device
  • Named entity identification method and device
  • Named entity identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0091] see figure 1 , which shows a flow chart of a named entity recognition method provided by an embodiment of the present invention, which is used to identify the named entity of each word in each test document in a social network, and may specifically include the following steps:

[0092] 101: Based on the initially constructed first seq...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a named entity identification method and device. After first entity probability distribution of a train document and second entity probability distribution of a test document are obtained by utilizing an initially constructed first sequence annotation model, features, such as the first context similarity of the train document and the first object similarity of the train document and the second context similarity of the test document and the second object similarity of the test document, can be extracted from social network information; therefore, a second sequence annotation model is obtained by training the first context similarity of the train document and the first object similarity of the train document, such that the second sequence annotation model is more suitable for a social network; and in addition, the named entity identification result, which is obtained by performing sequence annotation of the test document based on the second sequence annotation model suitable for the social network, is more accurate.

Description

technical field [0001] The present invention belongs to the technical field of named entities, and more specifically, relates to a named entity recognition method and device. Background technique [0002] Named entities refer to entities with specific meanings, such as the name Li San, and named entity recognition is to identify entities with specific meanings in the text, mainly including personal names, place names, organization names, and proper nouns. These identified entities are used as follow-up The input of information extraction tasks, such as relationship extraction, event extraction, and fine-grained sentiment analysis, can be used as the input of information extraction tasks. Therefore, the quality of named entity recognition results directly affects the effect of subsequent information extraction tasks. [0003] At present, there are many named entity recognition methods, such as the recognition method with the patent number 201310201310674046.7. The process is:...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06Q50/00
CPCG06F16/367G06Q50/01
Inventor 张晨谢隆飞尹泓钦王全礼
Owner CHINA CONSTRUCTION BANK
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products