Unlock instant, AI-driven research and patent intelligence for your innovation.

A Clustering Method Based on Deep Neural Networks and Pairwise Constraints

A deep neural network and neural network technology, which is applied in the field of clustering based on pairwise constraints between data, can solve the problems of ignoring the pairwise constraints of the original data, and the clustering accuracy cannot be further improved, so as to achieve the effect of improving the accuracy.

Active Publication Date: 2020-07-28
SOUTH CHINA UNIV OF TECH
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current use of neural networks for dimensionality reduction operations is to directly learn the characteristics of the original data, while ignoring the pairwise constraints between the original data.
[0004] Since most of the existing high-dimensional clustering methods using neural networks have the disadvantage of directly learning the characteristics of the original data, even if a large amount of data is used to train the neural network, the accuracy of clustering cannot be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Clustering Method Based on Deep Neural Networks and Pairwise Constraints
  • A Clustering Method Based on Deep Neural Networks and Pairwise Constraints
  • A Clustering Method Based on Deep Neural Networks and Pairwise Constraints

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] like figure 1 , a clustering method based on deep neural networks and pairwise constraints, including the following steps:

[0034] Step S1, a given data set S;

[0035] In this step, since labeled data is often difficult to obtain, experts are generally required to label; compared with labeled data, pairwise constraints are easier to obtain. So given a dataset including pairwise constraints.

[0036] Step S2: Preprocessing S to obtain the difference vector DV;

[0037] The difference vector DV is obtained by making a difference between the samples in the two data sets, and the obtained DV is used to train the autoencoder network.

[0038] Step S3: constructing an autoencoder network and a deep neural network;

[0039] Build an autoencoder network and a deep neural network, where the input to the deep neural network is the output of the encoding network in the autoencoder network. The autoencoder network is used to reduce the dimension of the network, and the deep ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a clustering method based on a deep neural network and pairwise constraints. A data set containing pairwise constraints between data is given; a difference vector between samples of the data set is obtained; an autoencoder network and a deep neural network are constructed. Network; the data set sample is used as the input of the self-encoding network, the input data set sample is used as the output of the self-encoding network to train the network, the output of the bottleneck of the self-encoding network is used as the input of the deep neural network, and the pairwise constraints are used as the correct label training network ; Combine trained autoencoder network and deep neural network to clustering algorithm; use clustering algorithm for clustering tasks. The invention combines the pairwise constraints between the data in the original data set, performs dimensionality reduction operations on the input data and deep neural network learning features through the autoencoder network, and proposes the loss function of the network model and its optimization algorithm based on gradient descent. Effectively improve the clustering accuracy of the clustering algorithm.

Description

technical field [0001] The invention relates to the technical field of clustering methods based on deep neural networks and pair constraints and high-dimensional clustering, in particular to a method for clustering based on pair constraints between data. Background technique [0002] Data clustering, also known as unsupervised learning, is an efficient method for dividing a set of data objects into several clusters. But unsupervised learning cannot know what each cluster represents, because it clusters unlabeled data. With the continuous deepening of network informatization, the total amount of data on the entire Internet is constantly increasing. How to fully explore and utilize the useful information contained in the data has become a hot issue in the field of computer science in recent years. High-dimensional clustering is a common problem. The specific performance is that traditional clustering algorithms encounter difficulties in clustering high-dimensional data spaces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/2321
Inventor 黄嘉桥王家兵
Owner SOUTH CHINA UNIV OF TECH