Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for hierarchical clustering

A hierarchical clustering and clustering technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as excessive time and resource consumption, unfavorable data analysis, poor clustering results, etc., to reduce computing Amount, saving computing time and resources, reliable effect of clustering results

Active Publication Date: 2015-03-04
XIAOMI INC
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In related technologies, when implementing hierarchical clustering by calculating the distance between classes, the amount of calculation is too large. When there are many data objects contained in a class, it will consume too much time and resources. Moreover, because each class may contain Data objects that do not belong to this class, that is, noise. After using this data object to calculate the inter-class distance and form a new class, more noise may be introduced, resulting in poor clustering results, which is not conducive to subsequent data analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for hierarchical clustering
  • Method and device for hierarchical clustering
  • Method and device for hierarchical clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatuses and methods consistent with aspects of the invention as recited in the appended claims.

[0053] Hierarchical clustering methods can be applied in various scenarios, such as clustering customer groups with different purchasing power in market analysis scenarios, and clustering organisms of different populations in biology, etc., especially Specifically, the embodiment of the present disclosure takes a scene of face recognition as an example to describe the hierarchical clustering met...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and a device for hierarchical clustering, and belongs to the field of data mining. The method comprises the steps of obtaining a data object set to be clustered, wherein the data object set comprises a plurality of classes, and each class corresponds to at least one data object; clustering the data objects corresponding to the first class to obtain a clustering result, wherein the number of the data objects corresponding to the first class is greater than a first preset threshold value, the clustering result comprises a plurality of clusters, and each cluster comprises at least one data object; screening the data objects corresponding to the first class according to the clustering result to obtain representative data objects of the first class; calculating a between-class distance according to the representative data objects of the first class and data objects corresponding to the second class; performing hierarchical clustering on the data object set based on the between-class distance. According to the method and the device for hierarchical clustering, by clustering of the data objects corresponding to the first class, the calculation intensity is reduced, and the calculation time and resources are saved; furthermore, the clustering result is more reliable, and subsequent data analysis is facilitated.

Description

technical field [0001] The present disclosure relates to the field of data mining, in particular to a method and device for hierarchical clustering. Background technique [0002] In the field of data mining, it is usually necessary to analyze a large amount of data in order to obtain valuable analysis results. Clustering algorithm is an important algorithm for analyzing data in the field of data mining. This algorithm is used to classify a set of multiple data according to different categories of data. aggregated into one category to facilitate subsequent data analysis. Among them, hierarchical clustering is a commonly used clustering algorithm. [0003] When the related technology implements the method of hierarchical clustering, it calculates the distance between two classes, that is, the inter-class distance, so as to merge two classes whose inter-class distance is less than a certain value into a new class. Since each class may contain more than one data object, when ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F18/231
Inventor 陈志军代阳杨松
Owner XIAOMI INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products