Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Clustering result evaluation method and device based on large-scale samples

A result evaluation and large-scale technology, applied in the field of data clustering, can solve the problems of high computational complexity and low efficiency of clustering result evaluation generation, and achieve the effects of reducing complexity, improving generation efficiency, and reducing data volume

Active Publication Date: 2020-10-02
PCI TECH GRP CO LTD +1
View PDF17 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when using the silhouette coefficient to evaluate the clustering results of large-scale samples, due to the relatively large number of samples to be evaluated, the computational complexity of the clustering result evaluation based on the silhouette coefficient is relatively high, and the generation efficiency of the clustering result evaluation is relatively low. lower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering result evaluation method and device based on large-scale samples
  • Clustering result evaluation method and device based on large-scale samples
  • Clustering result evaluation method and device based on large-scale samples

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0062] figure 1 A flowchart of a large-scale sample-based clustering result evaluation method provided in Embodiment 1 of the present application is given. The large-scale sample-based clustering result evaluation method provided in this embodiment can be composed of large-scale sample-based clustering result evaluation method The clustering result evaluation device based on large-scale samples can be implemented by means of software and / or hardware, and the clustering result evaluation device based on large-scale samples can be composed of two or more physical entities, It can also be a physical entity. Generally speaking, the large-scale sample-based clustering result evaluation device can be computing devices such as computers and server hosts.

[0063] The following description will be made by taking the large-scale sample-based clustering result evaluation device as the main body performing the large-scale sample-based clustering result evaluation method as an example. ...

Embodiment 2

[0098] On the basis of the above examples, Figure 5 It is a schematic structural diagram of a large-scale sample-based clustering result evaluation device provided in Embodiment 2 of the present application. refer to Figure 5 , the apparatus for evaluating clustering results based on large-scale samples provided in this embodiment specifically includes: a first extraction module 21 , a second extraction module 22 and a calculation module 23 .

[0099] Wherein, the first extraction module 21 is used to obtain the clustering result, and randomly extracts a first set number of classes from all classes of the clustering result as the first sampling class;

[0100] The second extraction module 22 is used to extract a second set number of samples as sampling samples according to a set sampling rule for each class of the first sampling class, and form a second sampling class based on the sampling samples;

[0101] The calculation module 23 is used to calculate the silhouette coef...

Embodiment 3

[0120] Embodiment 3 of the present application provides an electronic device, referring to Figure 6 , the electronic device includes: a processor 31 , a memory 32 , a communication module 33 , an input device 34 and an output device 35 . The number of processors in the electronic device may be one or more, and the number of memories in the electronic device may be one or more. The processor, memory, communication module, input device and output device of the electronic device can be connected through a bus or in other ways.

[0121] The memory 32, as a computer-readable storage medium, can be used to store software programs, computer-executable programs and modules, such as program instructions / modules corresponding to the large-scale sample-based clustering result evaluation method described in any embodiment of the present application ( For example, the first extraction module, the second extraction module and the calculation module in the large-scale sample clustering res...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a clustering result evaluation method and device based on large-scale samples. The embodiment of the invention provides a technical scheme. The method comprises: obtaining clustering results, randomly extracting a first set number of classes from all the classes of the clustering result, taking the clustering result as a first sampling class, extracting asecond set number of samples from each class of the first sampling class according to a set sampling rule to serve as sampling samples, forming a second sampling class based on the sampling samples, finally calculating a contour coefficient according to the second sampling class, and obtaining corresponding clustering result evaluation. By adopting the technical means, the data volume of the clustering result can be reduced through reasonable sample sampling, the complexity of clustering result evaluation calculation is reduced on the premise of ensuring that the sampled sample has the representativeness of the clustering result, and the generation efficiency of clustering result evaluation is further improved.

Description

technical field [0001] The embodiments of the present application relate to the technical field of data clustering, in particular to a method and device for evaluating clustering results based on large-scale samples. Background technique [0002] Clustering is a method of data mining. In the field of image processing technology, image clustering is a process of dividing multiple images into multiple classes composed of similar images based on image features. Image clustering plays an important role in image segmentation, object tracking and other fields. In many clustering scenarios, errors may occur in the clustering process. If images that do not belong to the same class are classified into the same image cluster, the center of the image cluster will shift, making the subsequent clustering results more accurate. increasingly inaccurate. Therefore, it is necessary to evaluate the accuracy of the clustering results and correct the wrong image clustering results in time. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23G06F18/217
Inventor 李逸帆丁保剑秦伟郑丁科曾明杨东泉
Owner PCI TECH GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products