Multi-objective parallel attribute reduction method based on Spark and ant colony optimization

An attribute reduction and multi-objective technology, applied in the field of big data cloud, can solve problems such as the inability to effectively deal with big data attribute reduction, achieve the effect of enriching methods and application scope, and eliminating redundancy

Active Publication Date: 2019-09-10
GUILIN UNIV OF ELECTRONIC TECH
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that traditional algorithms cannot effectively deal with big data attribute reduction and the combined optimization problem faced when solving attribute reduction, the present invention provides a multi-objective parallel attribute reduction method based on Spark and ant colony optimization for solving big data The attribute reduction problem can effectively obtain the minimum attribute reduction while processing large data, and at the same time reduce the time complexity of calculating the attribute importance from O(n 2 ) down to O(|n|)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-objective parallel attribute reduction method based on Spark and ant colony optimization
  • Multi-objective parallel attribute reduction method based on Spark and ant colony optimization
  • Multi-objective parallel attribute reduction method based on Spark and ant colony optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the objectives, technical solutions, and advantages of the present invention clearer, the present invention will be further described in detail below with reference to specific examples and drawings.

[0052] The invention is based on ant colony optimization algorithm and Spark parallel processing technology for solving the minimum attribute reduction under big data. Use the good global optimization ability of the ant colony optimization algorithm to solve the minimum attribute reduction set, and the parallelism of the "equivalence class" calculation in the rough set theory for parallel calculation under big data, and use the improved information gain rate as heuristic information In the process of calculating heuristic information, based on Spark distributed parallel computing, a new strategy for multi-objective parallel solving is proposed, which can simultaneously calculate the importance of multiple attributes relative to the current attributes, which g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-objective parallel attribute reduction method based on Spark and ant colony optimization. A thought of combining a cloud computing Spark parallel technology and an intelligent ant colony algorithm is introduced into rough set theoretical attribute reduction; on the basis, the information gain rate is used as heuristic information, and an innovative strategy of redundancy detection is carried out on the selected attribute and each generation of optimal solution, so that the algorithm can be quickly converged to the global optimal solution, the possibility that the redundancy attribute is added to a reduction set can be effectively avoided, and redundancy caused by random selection of the initial attribute is eliminated. Besides, a multi-target parallel solving strategy is adopted when heuristic information is calculated, the heuristic information of multiple attributes relative to the current attribute can be solved at the same time, and the time complexity is reduced from O (| n2 |) to O (| n |).

Description

Technical field [0001] The invention relates to the technical field of big data cloud, in particular to a multi-objective parallel attribute reduction method based on Spark and ant colony optimization. Background technique [0002] Attribute reduction is one of the important research contents of rough set theory and a key step in knowledge acquisition. The so-called attribute reduction refers to the deletion of unnecessary knowledge under the condition that the classification ability of the information system remains unchanged. By deleting redundant attribute information, the potential clarity of the information system can be improved, high-quality cleaning data can be obtained, and effective information with theoretical analysis and application value can be mined. [0003] The development of information technology and the continuous increase of data scale make traditional data mining methods, including attribute reduction algorithm, face the challenge of data scale and computing ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/182G06F16/174G06N3/00
CPCG06F16/182G06F16/174G06N3/006
Inventor 危前进魏继鹏
Owner GUILIN UNIV OF ELECTRONIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products