Unlock instant, AI-driven research and patent intelligence for your innovation.

Self-adaptive rapid K-means clustering method fusing feature learning

A technology of k-means clustering and feature fusion, which is applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems that the feature subspace is greatly different from the original feature space, and it is difficult to apply high-dimensional data processing. To achieve the best effect of clustering

Inactive Publication Date: 2019-07-05
XIAMEN UNIV OF TECH
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although such methods can effectively improve the clustering accuracy of K-means, they all need to use the eigendecomposition operation to solve the optimal feature subspace, and its computational complexity will increase with the quadratic level of the feature dimension of the data to be processed, and the obtained The feature subspace is quite different from the original feature space, and it is difficult to apply to high-dimensional data processing in real application scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-adaptive rapid K-means clustering method fusing feature learning
  • Self-adaptive rapid K-means clustering method fusing feature learning
  • Self-adaptive rapid K-means clustering method fusing feature learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The technical solutions and beneficial effects of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0042] Such as figure 1 As shown, the present invention provides an adaptive fast K-means clustering method for fusion feature learning. In this embodiment, unlabeled data is clustered by means of fusion feature selection and K-means clustering method, including the following steps:

[0043] Step 1. Preprocess the data to be processed, remove the missing attributes, data duplication and other problems in the data, and normalize each data attribute, and then obtain n groups of unlabeled data including D features X=[x 1 , x 2 ,...,x n ]∈R D ×n , where x i ∈R D×1 Indicates the i-th data sample, i=1, 2, ..., n;

[0044] Step 2, calculate the total scatter matrix of the data

[0045]Step 3, set the number of sub-features d, the number of categories c, parameters λ and σ, initialize the weight matrix Δ as the iden...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a self-adaptive rapid K-means clustering method fusing feature learning. The method comprises the steps: firstly, preprocessing data, eliminating the problems of attribute deficiency and data repetition, and conducting normalization processing on all data attributes; calculating a data total divergence matrix, and introducing sparse characteristics to construct a feature selection matrix; performing K-means clustering method on feature subspace, introducing an adaptive factor to dynamically adjust the weight of each data sample in the updating process of a clustering center; and updating the feature selection matrix according to distinguishable information between the clusters, and further screening out an optimal feature subset. The method makes the traditional K-means clustering method efficiently use distinguishable information between the clusters and in the clusters and correlation information between the features to improve the clustering accuracy. In addition, adaptive factors are fused in the clustering process, the clustering center is updated according to the distribution characteristics of different types of data, high practicability and expandability are achieved, and effective support can be provided for machine learning, computer vision and other related applications.

Description

technical field [0001] The invention belongs to the technical field of machine learning, in particular to an adaptive fast K-means clustering method for fusion feature learning. Background technique [0002] Clustering method is a technology widely used in the field of machine learning, among which the K-means method is the most widely used, and it has achieved good results in various fields such as data mining, medical treatment, and education. However, with the rapid development of multimedia technology and Internet technology, the explosive growth of high-dimensional data has brought great challenges to the traditional K-means method. Due to redundant features and noise features in high-dimensional data, directly applying K-means clustering to such data not only consumes a lot of computing resources, but also affects its clustering accuracy. The latest research shows that if the data features are dimensionally reduced in advance, the clustering efficiency of K-means will...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/21345G06F18/10
Inventor 王晓栋严菲曾志强陈玉明洪朝群
Owner XIAMEN UNIV OF TECH