Recommendation algorithm combining Word2vec word vector and LSH (Local Sensitive Hash)

A locally sensitive hashing and recommendation algorithm technology, applied in computing, complex mathematical operations, digital data information retrieval, etc., can solve problems such as poor user experience, inaccurate similarity calculation, and inaccurate similarity

Inactive Publication Date: 2020-04-24
KUNMING UNIV OF SCI & TECH
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such as: 1. Timeliness issues. The recommendation system needs to store relevant information of users and items. With the development of technology, the demand for recommendation systems has become larger. The information is growing rapidly, the calculation speed is very slow, the time is prolonged, and the user experience effect is not good. it is good
2. The problem of data sparsity. Most collaborative filtering recommendation algorithms use ratings to predict after sim

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recommendation algorithm combining Word2vec word vector and LSH (Local Sensitive Hash)
  • Recommendation algorithm combining Word2vec word vector and LSH (Local Sensitive Hash)
  • Recommendation algorithm combining Word2vec word vector and LSH (Local Sensitive Hash)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] combine Figure 1-Figure 3 , the present invention will be further described.

[0021] The present invention adopts the Word2Vec word vector model and the matrix decomposition recommendation algorithm of LSH local sensitive hash based on cosine similarity, which can solve the sparsity and accuracy of the traditional collaborative filtering recommendation algorithm, as well as the untimely recommendation in a large amount of data, thus causing The problem of poor user experience.

[0022] The first step: file processing and project similarity calculation

[0023] This step uses the u.Item file. Each line of the file u.Item is a description of the attributes of a movie. Different description items are separated by a single vertical line: the first item is the index number, the second item is the movie name, the third item is the release date, and the fourth item It is empty, the 5th item is URL information, and the 6th to 24th items are movie types described by bitmaps...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a matrix decomposition recommendation algorithm combining a Word2Vec word vector model and LSH local sensitive hash based on cosine similarity. The method comprises the following steps: firstly, converting word similarity into vector similarity by utilizing a Word2Vec model, then performing LSH locality sensitive hash high-speed calculation based on cosine similarity to obtain a project similarity matrix, combining the obtained project similarity matrix with an original score, outputting a pre-score of an unscored project, and filling the pre-score into a training set;and finally, taking the training set as the input of an ALS matrix decomposition algorithm to obtain a recommendation result. Compared with a traditional collaborative filtering recommendation algorithm, the improved algorithm is lower in MAE value and better in performance, and meanwhile the problem that recommendation is not timely in a large amount of data can be effectively solved.

Description

technical field [0001] The present invention proposes a matrix decomposition recommendation algorithm combining Word2Vec word vector model and LSH local sensitive hash based on cosine similarity, mainly aimed at ALS. Although the matrix decomposition recommendation algorithm is better than the neighborhood-based collaborative filtering algorithm, there are still data Sparseness, accuracy, and timeliness of recommendations in a large amount of data lead to poor user experience. The method field belongs to the field of data mining based on cloud computing platform. Background technique [0002] Machine learning and data mining continue to progress in the repeated updates and improvements of the big data platform. Among them, the recommendation system is one of the representatives. The basic idea of ​​a recommendation engine is to speculate on user preferences and assist the whole process by exploring the associations between objects. As such, it is complementary to search e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/16G06F40/194G06F40/151G06F16/9536
CPCG06F16/9536G06F17/16
Inventor 吴晟舒珏淋
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products