Semi-supervised clustering method and semi-supervised clustering system based on nonnegative matrix factorization

A technique of non-negative matrix decomposition and semi-supervised clustering, which is applied in the field of semi-supervised clustering methods and systems, and can solve problems such as difficulty in obtaining information, failure to improve clustering performance, and ineffective use of the internal structure of original data.

Active Publication Date: 2015-11-04
ZHANGJIAGANG INST OF IND TECH SOOCHOW UNIV +1
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method forces the representation of the projection space to have the same type of label as the data in the original space. The disadvantage of this method is that when there are few known labeled data, the method degenerates into NMF, which cannot effectively use the internal structure of the original data. Class performance is not improved
In addition, the CNMF method uses not a constraint pair, but a hard mark, and this information is generally difficult to obtain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised clustering method and semi-supervised clustering system based on nonnegative matrix factorization
  • Semi-supervised clustering method and semi-supervised clustering system based on nonnegative matrix factorization
  • Semi-supervised clustering method and semi-supervised clustering system based on nonnegative matrix factorization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0055] like figure 1 As shown, a semi-supervised clustering method based on non-negative matrix decomposition disclosed in the present invention includes:

[0056] S101, performing a non-negative matrix decomposition projection on the original data matrix to obtain a low-dimensional approximate matrix of the original data with both neighborhood preservation and similarity preservation;

[0057] First, perform non-negative matrix deco...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a semi-supervised clustering method based on nonnegative matrix factorization, which comprises the steps of carrying out nonnegative matrix factorization projection on an original data matrix, and acquiring a low-dimension approximate matrix, which has both neighborhood preserving and similarity preserving, of original data; carrying out clustering on the low-dimension approximate matrix of the original data by using an algorithm receiving parameter K to acquire a clustering result; and evaluating the clustering result by using two types of evaluation standards of precision and mutual information. The semi-supervised clustering method disclosed by the invention is based on nonnegative matrix factorization, not only considers neighborhood preserving of the original data, but also considers the consistency of similarity in an original space and a low-dimension manifold subspace, so that the clustering performance is enabled to be greatly improved when prior information is great in amount, and the clustering performance can still be well preserved when the prior information is little. The invention further discloses a semi-supervised clustering system based on nonnegative matrix factorization.

Description

technical field [0001] The invention relates to the technical field of cluster analysis, in particular to a semi-supervised clustering method and system based on non-negative matrix decomposition. Background technique [0002] In recent years, non-negative matrix factorization technology has played a very important role in pattern recognition and artificial intelligence. There has been research showing evidence based on partial representations of the human brain, both psychologically and physiologically. Non-negative matrix representations have inherent advantages in learning partial representations similar to faces, images, and documents. Also, in many problems such as information retrieval, computer vision, and pattern recognition, the data are characterized by high dimensionality, making learning directly from examples infeasible. Researchers expect to decompose high-dimensional data matrix to obtain low-dimensional representation after high-dimensional matrix decomposi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23
Inventor 路梅赵向军李凡长张莉
Owner ZHANGJIAGANG INST OF IND TECH SOOCHOW UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products