Image depth clustering method and system based on self-supervised contrast learning

A technology of image depth and clustering method, applied in the field of machine learning, can solve problems such as poor performance, wasting model performance, limiting model discrimination, etc., achieve strong discrimination, high cosine similarity, and prevent clustering degradation.

Active Publication Date: 2021-01-12
TSINGHUA UNIV
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the existing technology, the probability-based deep clustering method provides a systematic solution, which can prevent degenerate solutions concisely and effectively, but the existing methods limit the discriminability of the model
Methods based on deep generative models try to introduce a mixed Gaussian distribution prior to the latent representation of data to model multimodal data, but waste model performance in the process of learning data generation
However, discriminative clustering directly learns the mapping from input data to cluster labels, which is usually better than deep generation methods. Although the discriminative method has high computational efficiency, directly outputting low-dimensional cluster labels forces the model to discard the information of sample granularity, which limits Performance of feature learning and clustering
Existing probabilistic methods perform well on simple image datasets (such as MNIST), but perform poorly on more complex image datasets due to insufficient discriminative models

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image depth clustering method and system based on self-supervised contrast learning
  • Image depth clustering method and system based on self-supervised contrast learning
  • Image depth clustering method and system based on self-supervised contrast learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0060] see figure 1 , providing a method for deep clustering of images based on self-supervised contrastive learning, the image is divided into a sample set, a training set and a test set, the method for deep clustering of images based on self-supervised comparative learning includes the following steps:

[0061] S1: Assign an index value to each sample in the sample set, and use the index value as the self-supervised label of the corresponding sample; the training set includes samples, unknown cluster labels and the self-supervised label; through the training Set training versus hybrid expert systems;

[0062] S2: Initialize the cluster embedding of the gating function through the maximum Mahalanobis distance distribution, and randomly initialize the cluster embedding of the comparative mixed expert system;

[0063] S3: For each sample in the sample set, use the student network and the teacher network to extract sample embeddings; record the sample embedding extracted by the...

Embodiment 2

[0094] see figure 2 , providing an image depth clustering system based on self-supervised contrastive learning. The image is divided into a sample set, a training set and a test set. The image deep clustering system based on self-supervised contrastive learning includes:

[0095] Index value assignment module 1 is used to assign an index value to each sample in the sample set, and use the index value as the self-supervised label of the corresponding sample; the training set includes samples, unknown clustering labels and the self-supervised label; train the contrast hybrid expert system by said training set;

[0096] The cluster embedding initialization module 2 is used for initializing the cluster embedding of the gating function through the maximum Mahalanobis distance distribution, and randomly initializing the cluster embedding of the contrast mixed expert system;

[0097] The sample embedding extraction module 3 is used for each sample in the sample set, using the stude...

Embodiment 3

[0116] A computer-readable storage medium is provided, wherein the computer-readable storage medium stores program codes for image depth clustering based on self-supervised contrastive learning, and the program codes include implementation of Embodiment 1 or any possible Instructions for implementing a self-supervised contrastive learning-based deep clustering method for images.

[0117] The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server, a data center, etc. integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a hard disk, or a magnetic tape), an optical medium (for example, DVD), or a semiconductor medium (for example, a solid state disk (SolidStateDisk, SSD)) and the like.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

According to the image deep clustering method and system based on self-supervised comparative learning, comparative learning is utilized to improve the discrimination of embedding, and under the condition of not giving human annotations, the comparative learning can learn the embedding with high cosine similarity and strong discrimination for semantically similar samples by discriminating the samples. On the basis, according to the technical scheme, the subtasks capable of simplifying the learning process are mined, and due to the fact that the intra-class difference of samples of the same class is smaller than that of samples of different classes, it is determined that the subtasks are the most natural division mode according to the classes of the samples. Therefore, compared with a mixedexpert system, highly professional experts are encouraged, each expert is good at processing samples of a specific category, and a good clustering result is naturally obtained. Meanwhile, compared with a hybrid expert system, a single objective function is optimized, clustering degradation can be prevented without processing such as pre-training or regular terms, and the method can be applied tounsupervised clustering tasks of more complex images.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to an image depth clustering method and system based on self-supervised comparative learning. Background technique [0002] At present, unsupervised clustering is one of the challenging difficulties of deep neural networks. Since labeling a large amount of data requires high cost and intensive manpower, and the quality of labeling is difficult to guarantee, clustering is often required to divide the data into different Subsets can be used to mine the structural characteristics of the data set. [0003] At present, classical clustering methods such as k-means algorithm, spectral clustering and hierarchical clustering are not very effective when dealing with high-dimensional data. With the rapid development of deep learning, many studies have tried to combine classical clustering and deep learning methods, aiming to use deep neural networks to extract features, and then dire...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23G06F18/214
Inventor 朱军蔡淙崴李崇轩
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products