Recommendation system noise filtering method based on information entropies and fuzzy C-means clustering

A recommendation system and mean clustering technology, applied in character and pattern recognition, special data processing applications, instruments, etc., can solve the problems of large information resource space, low degree of intelligence, deviation of target item recommendation scores, etc., and achieve enhanced service. The effect of quality, improved accuracy

Active Publication Date: 2018-01-26
南京理工大学紫金学院
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] (1) Search engines can help users obtain information resources at the keyword level, but this method is less intelligent and cannot describe users' information needs at the knowledge level, resulting in a very large space for searched information resources
Although the existing information filtering can handle dynamically changing user needs and allow users to modify and adjust the needs, the processing method is relatively simple and cannot identify natural noise data caused by user behavior diversity
[0011] (2) The scores of some natural noise data are very similar to the scores of normal users, so when calculating user similarity, it is easy to enter the user's neighbor circle, and the recommendation scor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recommendation system noise filtering method based on information entropies and fuzzy C-means clustering
  • Recommendation system noise filtering method based on information entropies and fuzzy C-means clustering
  • Recommendation system noise filtering method based on information entropies and fuzzy C-means clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0121] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific examples.

[0122] The summary of the recommendation system noise filtering method based on information entropy and fuzzy C-means clustering proposed by the present invention is as follows: figure 1 shown.

[0123] Concrete process of the present invention sees figure 2 , the main variables and parameters in the process are shown in Table 1.

[0124] Table 1

[0125]

[0126]

[0127] The first step is to collect and sort out the user history rating data of the target recommendation system. The implementation steps are as follows:

[0128] (1.a) Select the user rating dataset R for the target recommender system. This data set is obtained by scoring M items (commodities / services) by N users, specifically expressed as R=i , I j , V ij >, where U i is the i-th user (1≤i≤N), I j is the jth item (1≤j≤M), V ij is the user U i For item I ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a recommendation system noise filtering method based on information entropies and fuzzy C-means clustering. The method comprises steps that first, user historical scoring dataof a target recommendation system is collected and arranged; second, Monte Carlo stochastic simulation is utilized to construct sub data sets of the user scoring data, a recommendation algorithm is utilized to acquire information entropies and recommendation precision of different sub data sets; third, the information entropies are classified according to uncertainty levels, recommendation precision is classified according to recommendation precision levels, and an empirical model is constructed to determine the potential natural noise data proportion; fourth, fuzzy clustering analysis on allthe user scoring data sets is carried out, and noise data is identified and deleted; and fifth, a recommendation algorithm operates for all the scoring data sets, and a recommendation precision indexis utilized to evaluate recommendation quality. The method is advantaged in that quantization measurement of the user scoring information can be realized, and the proposed natural noise data filteringtechnology has certain universality and portability.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence and machine learning, and relates to a noise data elimination method from massive user data, in particular to a noise data filtering method using information entropy and fuzzy C-means clustering. Background technique [0002] As of 2017, the number of monthly active users on Sina Weibo has reached 297 million, Taobao users have exceeded 800 million, stores have exceeded 5 million, and products have exceeded 800 million. According to incomplete statistics, 98% of products have the opportunity to be accepted by users with different hobbies. Relevant technicians systematically counted the sales records of electronic platforms such as Google, Amazon, eBay, Netflix, etc., arranged all products in reverse order of sales volume to form a long tail shape, and found that the total sales of products with low overall sales far exceeded that of major popular products. Total sales. This "long...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q30/06G06K9/62G06F17/30
Inventor 朱俊韩立新
Owner 南京理工大学紫金学院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products