Neighborhood rough set ensemble learning method based on attribute clustering

A technology of neighborhood rough set and integrated learning, which is applied in the direction of instruments, character and pattern recognition, computer components, etc., and can solve problems such as inability to deal with discrete data

Active Publication Date: 2016-07-13
CHONGQING UNIV OF POSTS & TELECOMM
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The neighborhood rough set model is an extended model of the classic rough set theory by Hu Qinghua et al. using

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neighborhood rough set ensemble learning method based on attribute clustering
  • Neighborhood rough set ensemble learning method based on attribute clustering
  • Neighborhood rough set ensemble learning method based on attribute clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Below in conjunction with accompanying drawing, the present invention will be further described:

[0040] A neighborhood rough set integrated learning method based on attribute clustering, including the following steps: First, in the data preprocessing stage, the data is normalized, and normalization refers to the linear transformation of the original data, so that the result value is mapped to [0-1]. Then, in the attribute clustering stage, by calculating the information gain of the attribute and selecting the attribute with a large information gain as the main attribute set, which is the center point of attribute clustering, and calculating the similarity between the remaining conditional attributes and the cluster center point, According to the similarity value, the attribute can be divided into several attribute clusters. Finally, in the base classifier ensemble training stage, first use the principle of neighborhood rough set to obtain the boundary domain of the t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention requests to protect a neighborhood rough set ensemble learning method based on attribute clustering, and relates to a data mining technology. First, the condition attributes of a decision system are divided into multiple clusters through attribute clustering, wherein the correlation between the attributes in the attribute clusters is large, and the correlation between different attribute clusters is small; second, different base classifiers are trained and integrated based on the difference between the clusters, guidance of a neighborhood rough set is added to the process of base classifier training and integrating, and the weights of the base classifiers are allocated according to the identification ability of the base classifiers to samples in the boundary region of the neighborhood rough set so as to get a final integrated classifier; and finally, test sets are classified by the obtained integrated classifier. According to the invention, a neighborhood rough set and the theory of ensemble learning are combined, the correlation and the difference between the condition attributes in a decision table are fully utilized, different base classifiers complement each other, and the knowledge in the decision system can be mined effectively from different angles.

Description

technical field [0001] The invention belongs to the field of data mining and pattern recognition, in particular to a neighborhood rough set integrated learning method after attribute clustering is carried out by using attribute correlation. Background technique [0002] The attributes of information systems in real life are not only diverse, but also often have certain correlations among attributes. If a single data mining algorithm is used directly for knowledge discovery, the effect is often not good. For this reason, methods such as attribute clustering and ensemble learning are used in data mining, which can effectively improve the effect of knowledge discovery. [0003] Attribute clustering: Clustering algorithm is an important method of unsupervised pattern recognition. It groups the sample space according to a certain similarity measure, so that the data in the group are similar to each other, and the similarity distance between the data in the group is large, so that...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/2321G06F18/285
Inventor 胡峰石瑾于洪张清华
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products