Identification method and device for core product word in title

A recognition method and technology of product words, applied in the computer field, can solve problems such as inaccurate recognition, inaccurate recognition of core product words, indistinguishability, etc., and achieve the effects of improving accuracy, expanding recall, and improving ambiguity problems

Active Publication Date: 2017-05-10
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, there is a problem of inaccurate recognition in the method of using the vocabulary to analyze the core product words in the title. For example, the word "Xiaomi" has different meanings in different contexts: one is the brand word "Xiaomi mobile phone", and the other is Product word "millet porridge"
It is impossible to distinguish these two meanings simply by using a vocabulary, so the recognition of core product words is inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identification method and device for core product word in title
  • Identification method and device for core product word in title
  • Identification method and device for core product word in title

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. The following description of at least one exemplary embodiment is merely illustrative in nature and in no way taken as limiting the invention, its application or uses. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0035] The identification device of the core product word in the title in the embodiment of the present invention can each be realized by various computing devices or computer systems, below in conjunction with figure 1 as well as figure 2 to describe.

[0036] figure 1 It is a structural di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an identification method and device for a core product word in a title, and relates to the technical field of computers. When a word2vec model generates a word vector of a word, previous and next words adjacent to the word or words near the word in the title can be referenced, and therefore word vectors generated by the same word in different contexts are different; meanwhile, the n-gram characteristics of the product word contain the previous and next words adjacent to the product word or the words near the product word in the title, the context of the product word can be expressed, and therefore the n-gram characteristics are obtained by the same word in different contexts are different; accordingly, word vector expressions for the n-gram characteristics are different, results obtained when identification is conducted through a core product word identification model are different, the ambiguity problem of identification of the core product word in the title is solved, and the accuracy rate is increased.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for identifying core product words in a title. Background technique [0002] Sentence component analysis refers to the use of various methods to mark the basic components of sentences. Sentence component analysis is one of the basic problems in natural language processing and has a wide range of applications. E-commerce title component analysis is a branch of sentence component analysis, which is widely used in intent recognition, personalized ranking and other fields. But because the title is a pile of words (no subject, predicate verb, etc.), it is more complicated. [0003] The core product word in the title refers to the specific product involved in the title. For example, the core product word in the Korean version of casual pants men’s clothing is trousers. Identifying the core product word in the title is a major method for sentence composition anal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/211G06F40/284G06F18/22
Inventor 车天博高维国陈海勇
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products