Method and device of analyzing search keyword frequency

A technology of search keyword and analysis method, which is applied in the field of HLSA-based search keyword frequency analysis, can solve the problems of limited use occasions, fine keyword granularity, and error in similarity calculation results, so as to avoid misjudgment and improve calculation efficiency effect

Active Publication Date: 2017-09-26
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the Levenshtein distance model has the advantage of not needing to consider whether the keywords are linearly independent, its disadvantage is that if the order of keywords changes, the similarity calculation results will have a large error
However, the longest common subsequence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device of analyzing search keyword frequency
  • Method and device of analyzing search keyword frequency
  • Method and device of analyzing search keyword frequency

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] On the basis of making full use of the user's search keyword database, the present invention adopts similar commodity similarity aggregation technology HLSA (Hamming Latent Semantic Analysis, Hamming code + latent semantic analysis) and KNN (K-Nearest Neighbor, K nearest neighbor) classification method, to The frequency of appearance of keywords in product search is analyzed.

[0039] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0040] In an exemplary embodiment of the present invention, an HLSA-based search keyword frequency analysis method is provided. like figure 1 As shown, the search keyword frequency analysis method based on HLSA in this embodiment includes:

[0041] Step A: Preprocessing, that is, extracting the search keyword records, performing word segme...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and device of analyzing search keyword frequency based on HLSA. In the method, keyword aggregation is conducted by introducing the LSA space model which contains a theme, the deficiency that the Euclidean distance model based on VSM vector does not take into account the semantic information of a word per se is overcome and the error caused by the order changes of keywords based on an edit distance model is remedied. Additionally the method further combines with Hamming keywords to make computations on the similarity of eigenvectors between the keywords, new HLSA algorithm is formed, the computation efficiency of similarity is increased; the K-nearest neighbor algorithm is utilized to classify and statistically measure the frequency of keywords, aggregation is conducted on keywords of different granularities, and misjudgments due to too small particle size by the longest common substring model are effectively avoided.

Description

technical field [0001] The invention relates to the technical field of electronic commerce, in particular to an HLSA-based search keyword frequency analysis method and device. Background technique [0002] The keywords entered by the user in the search bar of the e-commerce platform are important reference information to express their willingness to purchase a certain product. Aggregating, classifying, and counting the frequency of product search keywords within a predefined time period can effectively quantify the degree of user demand for a certain product, and then provide sales personnel with information on whether a certain product needs to be purchased, put on the shelf or enhanced Its promotion efforts provide a strong reference decision-making basis. [0003] The premise of counting the frequency of search keywords is to classify them, and the basis of classification theory almost always depends on the similarity model. At present, the methods for calculating the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/374
Inventor 兰华勇
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products