Method, system and device for data mining on basis of cloud computing

A data mining and cloud computing technology, applied in computing, transmission systems, multi-program devices, etc., can solve problems such as low efficiency and inability to meet the needs of massive data processing, and achieve the effect of improving efficiency

Inactive Publication Date: 2012-07-11
CHINA MOBILE COMM GRP CO LTD
View PDF3 Cites 46 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of this, the embodiments of the present invention provide a data mining method, system and device based on cloud computing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and device for data mining on basis of cloud computing
  • Method, system and device for data mining on basis of cloud computing
  • Method, system and device for data mining on basis of cloud computing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to effectively improve the efficiency of data mining and expand the application range of data mining methods, the embodiments of the present invention provide a data mining method, system and device based on cloud computing. The task adopts parallel mode, which effectively improves the efficiency of data mining.

[0031] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0032] figure 1 The structural representation of the data mining system based on cloud computing that the embodiment of the present invention provides, the system includes: Web server 11, parallel data mining (Parallel Data Miner, PDM) server 12 and cloud platform cluster control node 13, wherein,

[0033] The web server 11 is used to split the initiated data mining task into multiple subtasks, and send each subtask to the corresponding interface of the PDM server according to the execution logic between each subtask, and ret...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method, a system and a device for data mining on basis of cloud computing, which are used for solving the problems of low efficiency and unsatisfied mass data processing requirement during the data mining process. When a product data management (PDM) server receives all the corresponding subtasks of the data mining requests of a user after a web server is split, the system determines the parallel job task corresponding to each subtask according to a saved parallel algorithm, the parallel job tasks are sent to a clustered-control node of a cloud platform, and the received mining data which is fed back by the clustered-control node of the cloud platform is provided for the web server after being integrated. Because the data mining process is realized in a web mode in the embodiment of the invention, the data mining method can be simultaneously provided for a plurality of users, the data mining process mines on the basis of the parallel job tasks, so the data mining efficiency is effectively improved.

Description

technical field [0001] The present invention relates to the technical field of data mining, in particular to a data mining method, system and device based on cloud computing. Background technique [0002] Data mining is the process of extracting hidden, unknown but potentially useful information from a large amount of incomplete, noisy, fuzzy, and random practical application data. The data mining process usually includes four main steps: data preprocessing (ETL), data mining algorithm implementation, result display, and model solidification and online release. [0003] Existing data mining processes are generally implemented on stand-alone nodes, and stand-alone nodes are implemented in a serial manner in the process of data mining. When data mining is performed on a stand-alone node, since data preprocessing is implemented on the stand-alone node, the data mining algorithm, result display, and model solidification are all implemented on the stand-alone node. Therefore, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F9/46H04L29/08
Inventor 邓超徐萌高丹江志雄罗治国孙少陵陶涛段云峰何鸿凌
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products