Clustering method of incremental algorithm, electronic equipment and storage medium

A clustering method and incremental clustering technology, applied in the field of computer algorithms, can solve problems such as low efficiency, division, and update, and achieve the effect of reducing time-consuming and avoiding repeated clustering

Pending Publication Date: 2022-05-31
GUANGDONG OPPO MOBILE TELECOMM CORP LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The time complexity of partitioned clustering is low, but the number of clusters needs to be given in advance, and each time the number of clusters is changed, clustering needs to be re-clustered
Hierarchical clustering and density clustering have high time complexity and are not suitable for large data clustering
[0003] When using the classic K-means clustering algorithm to cluster data with a large amount of data and high dimensions, due to the generation of too many intermediate variables, such as the distance between data points, etc., it will cause memory overflow problems; And the completed clustering can only divide the newly added data, but cannot update the cluster; secondly, every time the number of clusters is changed, clustering needs to be re-clustered, which is inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method of incremental algorithm, electronic equipment and storage medium
  • Clustering method of incremental algorithm, electronic equipment and storage medium
  • Clustering method of incremental algorithm, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The embodiment of the present application provides an incremental algorithm clustering method, electronic equipment and storage medium, which are used for the combined clustering model of partitioned clustering and hierarchical clustering, and the model can quickly obtain clusters from partitioned clustering Based on the clustering results with a large number of clusters, the hierarchical clustering of the target number of clusters is carried out. This combined clustering model can avoid repeated clustering and reduce time-consuming when changing the number of clusters.

[0033] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be described below in conjunction with the drawings in the embodiment of the application. Obviously, the described embodiment is only a part of the application Examples, but not all examples. Based on the embodiments in this ap...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a clustering method of an incremental algorithm, electronic equipment and a storage medium, and is used for a combined clustering model of divided clustering and hierarchical clustering, the model can quickly obtain a clustering result with a large number of clustering numbers through divided clustering, and hierarchical clustering of a target clustering number is carried out on the basis, so that the clustering efficiency is improved. According to the combined clustering model, repeated clustering can be avoided when the clustering number is changed, and time consumption is reduced. The method provided by the embodiment of the invention comprises the following steps: during first clustering, performing division clustering on a first data set by using a MiniBatchK-means algorithm to obtain a clustering model; inputting the second data set into the clustering model, adjusting a clustering center point, carrying out clustering identification, and outputting a preliminary clustering result of a clustering number k = m; inputting the preliminary clustering result into a hierarchical clustering model, and outputting a clustering result nlt of a target clustering number k = n; m, n and m are positive integers.

Description

technical field [0001] The present application relates to the field of computer algorithms, in particular to an incremental algorithm clustering method, electronic equipment and storage media. Background technique [0002] Clustering is to divide data points into groups, and at the same time make the similarity between data points in the group as large as possible, and the similarity between data points in different groups as small as possible. Clustering algorithms include partitioning clustering, hierarchical clustering, density clustering, etc. The time complexity of partitioned clustering is low, but the number of clusters needs to be given in advance, and each time the number of clusters is changed, clustering needs to be re-clustered. Hierarchical clustering and density clustering have high time complexity and are not suitable for large data clustering. [0003] When using the classic K-means clustering algorithm to cluster data with a large amount of data and high d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213Y02D10/00
Inventor 潘启灏张鼎徐红艳李永超黄飞
Owner GUANGDONG OPPO MOBILE TELECOMM CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products