Data mining method, device and system

A data mining and data technology, applied in the field of big data, can solve the problems of inaccurate clustering and inability to reflect user behavior characteristics well.

Inactive Publication Date: 2017-05-10
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the effect of dividing user groups based on user behavior characteristics in the clustering operation depends to a large extent on the quality of basic data, and the existing user group division based on cluste...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining method, device and system
  • Data mining method, device and system
  • Data mining method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

[0031] The flowchart of an embodiment of the data mining method of the present invention is as figure 1 shown.

[0032]In step 101, the scheduled behavior data of the user is acquired, and the scheduled behavior data includes utility data of the scheduled behavior and generation time of the scheduled behavior. The same user can have multiple pieces of scheduled behavior data, including the generation time and utility data of the scheduled behavior data. In one embodiment, predetermined behavior data of multiple users may be obtained.

[0033] In step 102, the users are classified according to the generation time and the quantity of the predetermined behavior data of each user, and a set of target users is determined. In one embodiment, the classification can be performed according to the generation time of the prede...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data mining method, device and system, and relates to the field of big data. The data mining method provided by the invention comprises the following steps: obtaining predetermined behavior data of users; classifying the users according to the generation time of the predetermined behavior data of each user and the number of the predetermined behavior data to determine a target user set; generating a single user feature vector of each user in the target user set according to the predetermined behavior data; and grading the target user set based on a clustering algorithm according to the single user feature vector to determine a grading user set. By adoption of such method, the users can be classified at first, user clustering is carried out in one category, so that appropriate target users can be selected to carry out clustering analysis, on one hand, better pertinence is guaranteed, and the operation data size is reduced, on the other hand, the interference of user data of different types to the clustering effect can be eliminated, and thus the user group division is more accurate.

Description

technical field [0001] The invention relates to the field of big data, in particular to a data mining method, device and system. Background technique [0002] In the field of big data applications, user groups can often be divided into several categories according to various behavioral characteristics of users, so as to provide precise and personalized services according to the characteristics of user groups. Clustering is a way to divide user groups. Clustering is the process of dividing data objects into classes, so that objects in the same class have a high degree of similarity, while objects in different classes are highly dissimilar. Dissimilarity is usually measured using distance. Cluster analysis has been widely used in various fields, such as market research, data analysis, pattern recognition and so on. [0003] However, the effect of dividing user groups based on user behavior characteristics in the clustering operation depends to a large extent on the quality ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/2465G06F2216/03
Inventor 侯捷李爱华葛胜利
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products