Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Named entity identification method and device

A named entity recognition, entity technology, applied in neural learning methods, special data processing applications, instruments, etc., can solve the problems of inaccurate recognition, single model features, long training time, etc., to achieve high accuracy, accurate recognition results, Consider the full effect of the information

Active Publication Date: 2015-09-09
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF5 Cites 66 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing methods have at least the following disadvantages: (1) they mainly rely on the word itself as the input feature, the model features are single, and the front and back dependencies between entity tags are not directly introduced, resulting in low recognition accuracy, especially in When identifying named entities in the e-commerce field, the recognition is often inaccurate; (2) Since the initial value of the network is randomly generated, the final parameter optimization result may not be good enough, and the long training time leads to low development efficiency; (3) Insufficient Considering the distribution of training data leads to uneven fitting of the model to entities

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method and device
  • Named entity identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0019] To enable those skilled in the art to better understand, a brief introduction to related terms is given first.

[0020] Word: The text itself of the word.

[0021] Word vector: The vectorized representation of words, each word is represented by a multidimensional vector.

[0022] Part of speech: the nature of a word. Words are usually divided into two categories ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a named entity identification method and a named entity identification device capable of accurately identifying a named entity, in particular to a named entity in the field of E-business. The method comprises: acquiring a vector library; carrying out word segmentation on a training corpus text string to obtain a plurality of sample words; inquiring the vector library of each sample word sequentially to obtain a first feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; taking all the first feature vectors integrally as an input quantity, and training a named entity identification model of a neutral network; carrying out word segmentation on a to-be-predicted text string to obtain a plurality of to-be-tested words; inquiring the vector library of each sample word sequentially to obtain a second feature vector which comprises a word vector and a word class vector corresponding to the same word as well as an entity marking vector corresponding to the last word of the sample word; respectively inputting the second feature vectors corresponding to all the to-be-tested words into the model, and outputting entity identifiers of the to-be-tested words.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a named entity recognition method and device. Background technique [0002] With the rapid development of Internet technology, information services are becoming more and more popular. Among them, the identification of named entities is an important basic work in information service application fields such as information extraction, question answering system, syntax analysis, machine translation, and Internet-oriented metadata annotation. Named entities (referred to as entities) refer to names of people, institutions, places, and all other entities that are identified by names. A wider range of named entities also includes numbers, dates, currencies, addresses, etc. [0003] In the prior art, there has been a technology of using neural network technology to train named entity recognition. The existing methods have at least the following disadvantages:...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27G06N3/08
CPCG06F16/353G06F40/284G06F40/30G06N3/08
Inventor 姜文
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products