Classification rule mining method under cloud computing environment

A cloud computing environment and rule technology, applied in computing, special data processing applications, instruments, etc., can solve problems such as overfitting, surge in search space and dimensions, and inefficient algorithms, so as to reduce computational complexity and solve The effect of poor parallelism and good parallelism

Inactive Publication Date: 2012-10-17
HEFEI UNIV OF TECH
View PDF5 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods still have bottleneck problems, including the need to scan and sort the data sets multiple times, resulting in inefficient algorithms; they are sensitive to noise and real data, and are prone to overfitting; the scalability of large training sets is not good. good wait
Especially in the cloud computing environment, the large-scale and dynamic nature of distributed massive data sets lead to a sharp increase in the search space and dimensions of the data classification process, which increases the computational complexity of classification and reduces the efficiency of traditional classification methods. The mining method of classification rules cannot be directly applied in the cloud computing environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification rule mining method under cloud computing environment
  • Classification rule mining method under cloud computing environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In the classification rule mining method under the cloud computing environment of this embodiment:

[0031] The cloud computing environment is composed of multiple distributed servers; when mining classification rules in the cloud computing environment, a master-slave organizational structure is adopted. The master-slave organizational structure is to set one server as the control center and other servers as slave servers; The control center arranges and deploys the execution of the entire mining task, schedules management and coordinates the operation of each slave server; each slave server is the specific execution unit of the task. Classification rule mining methods such as figure 1 As shown, proceed as follows:

[0032] 1. The control center divides the data set to be classified into training samples and test samples, and evenly divides the training samples to obtain data blocks of the same size, and assigns a slave server to perform classification mining tasks for...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a classification rule mining method under a cloud computing environment. The classification rule mining method is characterized in that a master-slave type organization structure consisting of a control center and a plurality of slave servers is adopted, and comprises the following steps of: dividing, by the control center, a data set to be classified into a training sample and a test sample, and distributing each of data blocks of the same size which are formed by uniformly dividing the training sample to one processing unit; training, by the processing units, the data blocks, by using a genetic algorithm to obtain an atomic rule for classification; and finally, reducing the atomic rule by a classifier, and selecting a reduction result which meets classification accuracy requirement as a final result of the classification rule mining. The classification rule mining method is suitable for data classification on distributed data storage under the cloud computing environment; distributed parallel processing of data classification tasks under the cloud computing environment can be performed; and the classification rule mining method has a positive effect on the classification processing problem of mass data under the cloud computing environment.

Description

technical field [0001] The invention belongs to the technical field of data analysis in a cloud computing environment, and in particular relates to a classification rule mining method in a cloud computing environment. Background technique [0002] Classification technology research is an important research field of data analysis and management in cloud computing environment. On the one hand, classification is an important task type of data mining. The data in the cloud computing environment has the characteristics of massiveness, distribution and dynamics. These characteristics bring challenges to data management in the cloud computing environment. The analysis of these data will help improve the efficiency of massive data analysis and management in the cloud computing environment. On the other hand, the cloud environment has ultra-large-scale storage and computing capabilities, dynamic scalability of resources and structures, and provides services on demand through virtual...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/12H04L29/08
Inventor 杨善林丁静罗贺丁帅徐达宇范雯娟
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products