Feature selection method based on rough set and swarm intelligence
A feature selection method and rough set technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as high computational complexity, time-consuming, and inability to guarantee optimal feature subsets.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0084] Embodiment 1 applies the present invention to 12 discrete data sets (low dimension) to carry out the test of feature selection
[0085] Experimental environment and data:
[0086] Hardware environment: Intel Core i5 3470-3.20GHz, 8.0GB memory, 1TB hard disk.
[0087] Software environment: Matlab R2017a, 64-bit Windows 7 operating system.
[0088] Experimental data: Select 12 discrete datasets (see Table 1) in the machine learning UCI database (UCI Machine Learning Repository[DB / OL].http: / / archive.ics.uci.edu / ml / datasets.html) as a test data set.
[0089] Table 1 12 test data sets (low dimension)
[0090]
[0091] In table 1, the title of each data set, the number of instances and the number of conditional features it contains, the number of conditional features, the feature evaporation rate, the missing data ratio and the adoption of the Weka 3 tool ( Data MiningSoftware in Java[EB / OL].http: / / www.cs.waikato.ac.nz / ml / weka / index.html) to complete the specific metho...
Embodiment 2
[0146] Embodiment 2 applies the present invention to 8 discrete data sets (high dimension) to carry out the test of feature selection
[0147] Experimental environment and data:
[0148] Hardware environment: Intel Core i5 3470-3.20GHz, 8.0GB memory, 1TB hard disk.
[0149] Software environment: Matlab R2017a, 64-bit Windows 7 operating system.
[0150] Experimental data: 8 discrete data sets (see Table 6) in the machine learning UCI database are selected as test data sets.
[0151] Table 6 8 test data sets (high dimension)
[0152]
[0153]
[0154] In Table 6, there are 5 data sets (Arrhythmia, Hill, Musk1, Musk2, and Semeion) with less than 300 conditional features, and the remaining 3 data sets (Isolet, Micromass, and Secom) have more conditional features, all of which are within 500 Above, especially the dataset Micromass, the number of features reaches 1300. At the same time, there are 3 large-scale data sets, and the product of the number of conditional features...
Embodiment 3
[0156] Embodiment 3 Performance test of this method (RPA) on 12 low-dimensional data sets
[0157] In order to examine the performance of this method (RPA), 20 tests are carried out to each data set in embodiment 1, and four kinds of feature selection methods (heuristic method QUICKREDUCT (A.Chouchoulas, Q.Shen.Rough Set-AidedKeyword Reduction for Text Categorization[J].Applied Artificial Intelligence,2001,15(9):843-873) and MIBARK(Miao Duoqian, Hu Guirong. A heuristic algorithm for knowledge reduction[J]. Computer Research and Development, 1999,36(6):681-684), a hybrid method IDS based on swarm intelligence (C.S.Bae, W.C.Yeh, Y.Y.Chung, S.L.Liu. Feature Selection with Intelligent Dynamic Swarm and RoughSet[J]. Expert Systems with Applications, 2010 ,37(10):7026-7032) and NDABC(Y.R.Hu,L.X.Ding,D.T.Xie,S.W.Wang.A Novel Discrete Artificial Bee Colony Algorithm for Rough Set-Based Feature Selection[J].International Journal of Advancements in Computing Technology,2012 ,4(6):295-3...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


