Clustering method for performing matrix decomposition by taking subset grouping as auxiliary information

A technology of auxiliary information and matrix decomposition, applied in the direction of instruments, character and pattern recognition, computer parts, etc., can solve the problems of low efficiency and large human labor

Inactive Publication Date: 2017-11-07
ZHEJIANG UNIV OF TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem solved by the present invention is that in the prior art, the semi-supervised clustering method has a total of n objects for a data set containing n objects 2 There are two possible pairwise relationships, so a large enough number of constraints is required to achieve satisfactory results. In practical applications, obtaining a large number of constraints requires a large amount of human labor and low efficiency, thus providing an optimized A Clustering Method Using Subset Grouping as Auxiliary Information for Matrix Decomposition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method for performing matrix decomposition by taking subset grouping as auxiliary information
  • Clustering method for performing matrix decomposition by taking subset grouping as auxiliary information
  • Clustering method for performing matrix decomposition by taking subset grouping as auxiliary information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be described in further detail below in conjunction with the examples, but the protection scope of the present invention is not limited thereto.

[0039] The invention relates to a clustering method for performing matrix decomposition with subset grouping as auxiliary information, and the method includes the following steps.

[0040] Step 1.1: Collect the grouping results of several subsets of the data set to be grouped by the user, and thus obtain each object x in the data set to be grouped i neighbor set of and distant neighbor set

[0041] In the step 1.1, the neighbor set and distant neighbor set The division method includes the following steps.

[0042]Step 1.1.1: For any subset S of samples to be grouped, divide S into k groups C 1 ,C 2 ,...C k , the k groups satisfy C 1 ∪C 2 ...∪C k =S, and q≠p,

[0043] In the present invention, namely k groups C 1 ,C 2 ,...C k The two do not cross each other.

[0044] Step 1.1....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a clustering method for performing matrix decomposition by taking subset grouping as auxiliary information, which is characterized in that grouping information of a certain number of subsets is obtained through collecting grouping results of a user for different subset to act as guidance for clustering, a close neighbor set and a far neighbor set of objects in the subset are obtained based on the results, and the category of each object in the subset is enabled to be close to the category of objects in the close neighbor set but be different from the category of objects in the far neighbor set through a mode of adding regular items to an objective function for matrix decomposition so as to complete clustering. The clustering method not only considers the decomposition error, but also reduces the difference between grouping of objects in the subset and grouping of objects in the close neighbor set and increases the difference between grouping of objects in the subset and grouping of objects in the far neighbor set at the same time in matrix decomposition based on a grouping result of the subset in the previous step, thereby achieving satisfactory results without the need of excessive number of constraints, and is fast in clustering, high in efficiency and low in labor cost in practical application.

Description

technical field [0001] The invention belongs to the technical field of electrical digital data processing, and in particular relates to a clustering method based on machine learning and data mining and using subset grouping as auxiliary information for matrix decomposition. Background technique [0002] In data processing and analysis problems in many fields, it is necessary to use clustering algorithms to group samples in a data set, and then quickly browse, analyze and process the internal structure of the entire data set based on the grouping results. [0003] In order to improve the accuracy of traditional unsupervised clustering methods, semi-supervised clustering using a small amount of supervised information is proposed. Most of the existing semi-supervised clustering methods use the form of constraints between two objects to achieve supervision, so that two objects related to the "must link" constraint are classified into the same class, while the "cannot link" The ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 梅建萍
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products