A Clustering Method of User Preference and Distance Weighted Integrating Time Factor

A clustering method and distance weighting technology, applied in other database clustering/classification, data processing applications, special data processing applications, etc., can solve problems such as user cold start data sparsity, to solve data sparsity, solve interest Migrate, resolve cold start effects

Active Publication Date: 2021-08-31
TIANJIN UNIVERSITY OF TECHNOLOGY
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The purpose of the present invention is to solve the problem of user cold start and data sparsity in the original collaborative filtering recommendation algorithm, optimize and improve the existing algorithm, and design a user preference and distance weighted algorithm that integrates time factors clustering method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Clustering Method of User Preference and Distance Weighted Integrating Time Factor
  • A Clustering Method of User Preference and Distance Weighted Integrating Time Factor
  • A Clustering Method of User Preference and Distance Weighted Integrating Time Factor

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0107] We verify the correctness and effectiveness of this algorithm through experiments, and verify the performance of the algorithm by comparing it with related algorithms. This experiment selected the 100K MovieLens dataset, which was collected by the GroupLens research team at the University of Minnesota. The file u.data included 100,000 ratings and timestamps for 1,682 movies by 943 users. Each user has at least 20 ratings, and the range of ratings is an integer from 1 to 5. The larger the value, the more the user likes the movie. This application mainly uses the mean absolute error (MAE) and F-Measure to analyze the experimental results.

[0108] The mean absolute error (MAE) is used to evaluate the degree of deviation between the user's predicted score and the actual score for an item. The smaller the value of MAE, the smaller the deviation and the better the recommendation effect. The formula is as follows:

[0109]

[0110] in: and Respectively represent the u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A user preference and distance-weighted clustering method that integrates time factors, and alleviates the user cold start problem by introducing a user-user attribute matrix constructed from the basic objective characteristics of users, while the sparsity problem is mainly improved by introducing item features, Since the characteristics of the item can reflect the user's preference from the aspect of content, the dimension of the matrix can be reduced; the item feature is introduced into the user-item rating to obtain a small-dimensional user-item attribute total score matrix; use TF‑IDF The algorithm introduces item features when constructing the user-item attribute preference matrix, and at the same time considers the impact of user interest drifting over time on user preference; based on the above three matrices, the weighted Euclidean distance is obtained, and then the K-Means algorithm is used for clustering. This method takes movie recommendation as an example. Experimental results on the MovieLens dataset show that this method has better recommendation quality and performance than other related algorithms.

Description

technical field [0001] The invention relates to a personalized recommendation algorithm, and specifically provides a clustering method that integrates user preferences of time factors and distance weighting. Background technique [0002] In recent years, with the development of information technology and Web2.0, the information on the Internet has experienced an unprecedented surge, and problems have followed, mainly including the problem of information overload and the problem that users cannot accurately select relevant information. The recommendation system is One of the effective tools to overcome the problem of information overload. The core of the recommendation system is to design the recommendation algorithm. Therefore, various recommendation algorithms have been proposed in academia. Currently, the main recommendation algorithms used include content-based recommendation algorithms, combination recommendation algorithms, and collaborative filtering recommendation alg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/28G06F16/906G06F16/9535G06Q30/02
CPCG06Q30/0269
Inventor 李文杰薛花张德干
Owner TIANJIN UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products