Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Convolutional neural network-based unsupervised multi-modal subspace clustering method

A convolutional neural network, multi-modal technology, applied in the field of unsupervised multi-modal subspace clustering, which can solve problems such as increasing data dimensions

Inactive Publication Date: 2018-10-09
SHENZHEN WEITESHI TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of the problem that the previous subspace clustering method relies on the spatial correspondence between modalities and increases the data dimension when outputting, the purpose of the present invention is to provide an unsupervised multimodal subspace based on convolutional neural network For the clustering method, for the input multimodal data, the encoder is used to realize the spatial fusion first, and then it is fused into the latent space representation through the space fusion network, and then the fusion result is input into the self-expression layer, and the self-expression is used in the latent space representation. The space encodes the joint representation, and finally, the joint representation generated by the output of the self-expression layer is input into the multimodal decoder, and the different modalities are reconstructed to obtain the final clustering result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network-based unsupervised multi-modal subspace clustering method
  • Convolutional neural network-based unsupervised multi-modal subspace clustering method
  • Convolutional neural network-based unsupervised multi-modal subspace clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described in detail below in conjunction with the drawings and specific embodiments.

[0038] figure 1 It is a system flowchart of an unsupervised multimodal subspace clustering method based on a convolutional neural network in the present invention. It mainly includes multimodal encoder, self-expression layer and multimodal decoder.

[0039] A multimodal encoder refers to taking multimodal data as input and fusing it into a latent spatial representation through a spatial fusion network.

[0040] Among them, the spatial fusion network uses three different fusion techniques, which can provide modal representations of different spatial positions, and learn a joint representation containing complementary information of different modalities, and each mode in the joint repre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a convolutional neural network-based unsupervised multi-modal subspace clustering method. The method mainly comprises a multi-modal encoder, a self-expression layer and a multi-modal decoder. The process of the method is as follows: firstly, for input multi-modal data, the spatial fusion is realized through an encoder and the input data are fused into a potential space expression through a space fusion network. Secondly, a fused result is input into the self-expression layer, and the combined expression is coded in the potential space through self-expression. Finally, the combined expression generated by the output of the self-expression layer is input into the multi-modal decoder, so that different modalities are reconstructed. In this way, a final clustering resultis obtained. According to the invention, the problem that the existing subspace clustering method depends on the spatial correspondence between modals and the data dimension can be increased is solved. The combined expression can be obtained by utilizing the self-expression of the modal, and the accuracy of subspace clustering is improved.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to an unsupervised multi-modal subspace clustering method based on a convolutional neural network. Background technique [0002] Any practical application in image processing, image recognition, and speech processing needs to deal with very high-dimensional data, however, these data are usually located in low-dimensional subspaces, and subspace clustering methods use different subspaces in a data set By finding clusters in the space, high-dimensional data can be processed. The subspace clustering method can be applied to image processing, which can effectively improve the efficiency of data processing; in terms of image recognition, the subspace clustering method can process different forms of images of the same object to improve recognition accuracy; also in speech processing , applying the subspace clustering method can more effectively deal with human sentences in various tones. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/04
CPCG06N3/045G06F18/231G06F18/251G06F18/214
Inventor 夏春秋
Owner SHENZHEN WEITESHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products