Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Clustering method and device

A clustering method and clustering technology, applied in character and pattern recognition, instruments, computer components, etc., can solve problems such as low accuracy of clustering results, unstable clustering results, and poor clustering results, and achieve Improve the instability of clustering results, improve clustering efficiency, and solve the effect of poor clustering results

Active Publication Date: 2019-07-05
BEIJING UNIV OF POSTS & TELECOMM
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, according to different thresholds in the threshold list, the feature vectors are clustered multiple times, which can improve the instability of the clustering results caused by the random graph clustering algorithm, and is used to solve the poor clustering effect and clustering problems in the prior art. Technical issues with lower accuracy of class results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method and device
  • Clustering method and device
  • Clustering method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0025] At present, the following two methods are mainly used to realize the clustering of feature vectors:

[0026] The first way is to use partition-based clustering methods, such as the K-means algorithm and the improved K-means algorithm, to cluster the feature vectors, and divide the feature vectors into K (preset value) clusters. Specifically, set the number of clusters of the clustering target in advance, randomly select K feature vectors, and regard them as the center of the cluster; assign the remaining feature...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a clustering method and device, and the method comprises the steps: carrying out the feature extraction of a to-be-clustered object, and obtaining a feature vector; according tothe number of to-be-clustered objects, determining repeated clustering times RT and an iteration times list {Ri} corresponding to a random graph clustering algorithm, the iteration times list {Ri} being composed of iteration times of each time of clustering; a threshold list {Ti} is determined according to the feature vectors, and the threshold list {Ti} is composed of thresholds for constructinga weighted graph during each time of clustering; and clustering the feature vectors according to the repeated clustering times RT, the iteration times list {Ri} and the threshold list {Ti} corresponding to the random graph clustering algorithm. According to the method, the number of iterations of each time of clustering can be determined according to the scale of the to-be-clustered object and the number of repeated clustering times, and for later several times of clustering with a small scale of the to-be-clustered object and a relatively stable clustering result, the small number of iterations can be used, so that the clustering efficiency is improved.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a clustering method and device. Background technique [0002] With the rapid development of machine learning technology and Internet technology, there are more and more scenarios that need to cluster high-dimensional data, such as clustering faces in image data, and clustering text in the field of natural language processing. data clustering etc. In the face of massive data in the Internet, how to realize fast and effective clustering of large-scale high-dimensional data has great practical value. [0003] At present, for high-dimensional data, such as face images and text data clustering methods, feature extraction algorithms are mainly used to extract eigenvectors (or eigenvalues) that can effectively represent data from high-dimensional data, and then through K-means Algorithm, K-means improved algorithm, grid-based clustering algorithm, clustering feature vectors. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/232
Inventor 欧中洪陈忠杰宋美娜宋俊德
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products