Data flow clustering method integrating cluster existence strength

A data flow clustering and clustering technology, applied in the web field, can solve problems such as not being able to fully utilize clusters, and achieve good application value, small amount of calculation, and accurate results

Active Publication Date: 2014-10-08
ZHEJIANG GONGSHANG UNIVERSITY
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention aims at the disadvantage that the prior art cannot give full play to the impact of cluster existence intensity on clustering, and provides a data flow clustering method integrated into cluster existence intensity, which can realize the problem of adjusting data flow clustering by applying cluster existence intensity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data flow clustering method integrating cluster existence strength
  • Data flow clustering method integrating cluster existence strength
  • Data flow clustering method integrating cluster existence strength

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] A data stream clustering method that incorporates cluster presence strengths, such as figure 1 , 2 shown, including the following specific steps:

[0022] Preprocessing step 100: process the user characteristic information of a specific user to form a user attribute database, the user characteristic information refers to information including user background information and user behavior information that can be collected and collected by a human-computer interaction interface or a human-computer interaction device. It can be converted into user attribute data that can be represented by a data string of a specific length and format. User background information includes user basic information, user login IP, login time and other information. User behavior information includes user preference data. Information such as access frequency, scope and time range of a specific website, the user attribute database is used to store the user attribute data;

[0023] User clusterin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of the web, and discloses a data flow clustering method integrating cluster existence strength. The method includes the following specific steps of conducting preprocessing, wherein information of a specific user is preprocessed and stored to a user attribute database; conducting user clustering, wherein skill-oriented clustering is conducted on user attributes; forming association rules, wherein association rules based on user attribute data are formed; conducting drift detection, wherein the association rules are detected in real time so that effectiveness of the association rules can be ensured. The data flow clustering method has the advantages that the influences of the cluster existence strength on clustering are fully utilized, and the uncertain data flow clustering method can integrate three factors, namely, the distance, the cluster existence probability and the cluster existence strength, indeed.

Description

technical field [0001] The invention relates to the field of web technologies, in particular to a data stream clustering method incorporating cluster existence strength. Background technique [0002] In an e-commerce recommendation system, the acquisition of user information usually comes from the registration information submitted by the user and the implicit information such as user search keywords, browsing time, purchase behavior, etc. However, there are often dilemmas between users and e-commerce websites: Users are unwilling to provide personal information to the system due to the protection of personal privacy information. The survey shows that 80% of users can provide information on gender, age, educational background, and region when filling out the questionnaire, but for income level, occupation, etc. More private information is not expected to be disclosed; at the same time, website operators are very eager to obtain more information about users, so as to better m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35G06Q30/0201
Inventor 琚春华鲍福光肖亮魏建良
Owner ZHEJIANG GONGSHANG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products