Named entity identification method and device

A named entity recognition, entity technology, applied in neural learning methods, special data processing applications, instruments, etc., can solve the problems of inaccurate recognition, single model features, long training time, etc., to achieve high accuracy, accurate recognition results, Consider the full effect of the information

Active Publication Date: 2015-09-09
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF5 Cites 66 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing methods have at least the following disadvantages: (1) they mainly rely on the word itself as the input feature, the model features are single, and the front and back dependencies between entity tags are not directly introduced, resulting in low recognition accuracy, especially in When identifying named entities in the e-commerce fi

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method and device
  • Named entity identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0018] The following describes exemplary embodiments of the present invention with reference to the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and should be regarded as merely exemplary. Therefore, those of ordinary skill in the art should realize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the present invention. Likewise, for clarity and conciseness, descriptions of well-known functions and structures are omitted in the following description.

[0019] In order to enable those skilled in the art to better understand, a brief introduction to related terms is first given.

[0020] Word: The text of the word itself.

[0021] Word vector: the vectorized representation of words, each word is represented by a multi-dimensional vector.

[0022] Part of speech: the nature of the word. Words are usually divided into two c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a named entity identification method and a named entity identification device capable of accurately identifying a named entity, in particular to a named entity in the field of E-business. The method comprises: acquiring a vector library; carrying out word segmentation on a training corpus text string to obtain a plurality of sample words; inquiring the vector library of each sample word sequentially to obtain a first feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; taking all the first feature vectors integrally as an input quantity, and training a named entity identification model of a neutral network; carrying out word segmentation on a to-be-predicted text string to obtain a plurality of to-be-tested words; inquiring the vector library of each sample word sequentially to obtain a second feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; respectively inputting the second feature vectors corresponding to all the to-be-tested words into the model, and outputting entity identifiers of the to-be-tested words.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a named entity recognition method and device. Background technique [0002] With the rapid development of Internet technology, information services are becoming more and more popular. Among them, the identification of named entities is an important basic work in information service application fields such as information extraction, question answering system, syntax analysis, machine translation, and Internet-oriented metadata annotation. Named entities (referred to as entities) refer to names of people, institutions, places, and all other entities that are identified by names. A wider range of named entities also includes numbers, dates, currencies, addresses, etc. [0003] In the prior art, there has been a technology of using neural network technology to train named entity recognition. The existing methods have at least the following disadvantages:...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27G06N3/08
CPCG06F16/353G06F40/284G06F40/30G06N3/08
Inventor 姜文
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products