Data clustering method, apparatus and storage medium

A data clustering and clustering technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of low clustering efficiency and low accuracy

Inactive Publication Date: 2019-01-01
CHERY AUTOMOBILE CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a data clustering method, device and storage medium, which are used to solve the problem of low clustering efficiency and low accuracy in related technologies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering method, apparatus and storage medium
  • Data clustering method, apparatus and storage medium
  • Data clustering method, apparatus and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0111] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0112] Before explaining the embodiments of the present invention in detail, the application scenarios involved in the embodiments of the present invention are firstly explained.

[0113] In the big data environment, shape clustering algorithms need to be used in many application scenarios to solve problems. For example, in the field of geographic information processing, clustering algorithms are used to extract terrain information of mountains and rivers; in the field of image processing, people or objects in images are identified; in the field of medicine, protein structures are clustered to identify different types of protein and more. However, due to the bias towards the shape of the data set during clustering, the data with many ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data clustering method, a device and a storage medium, belonging to the technical field of data mining. The method comprises the following steps: the original sample data setis uniformly sampled to obtain a uniform sample data set; the positions of each sample in the uniform sample data set are updated to obtain a data set after updating the positions; data clustering isperformed on the updated dataset of the position by aggregation clustering technique. As that original sample data set is uniformly sample, the number of sample is reduced, As a result, the terminalrunning resources are reduced, the clustering speed is improved, the positions of each sample in the obtained uniform sample data set are updated subsequently, and the data clustering is carried out on the data set after the position updating through the aggregation clustering technology, so that the accuracy rate of clustering the samples is improved, and the accuracy rate of clustering the samples is improved.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a data clustering method, device and storage medium. Background technique [0002] In the big data environment, shape clustering algorithms need to be used in many application scenarios to solve problems. For example, in the field of geographic information processing, clustering algorithms are used to extract terrain information of mountains and rivers; in the field of image processing, people or objects in images are identified; in the field of medicine, protein structures are clustered to identify different types of protein and more. Among them, a clustering algorithm refers to an algorithm that divides similar data samples into the same cluster through the similarity between each data sample in a data set, thereby realizing the algorithm of dividing the samples of the original data set into multiple clusters. [0003] At present, clustering algorithms usually require cer...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 赛影辉张国兴李中兵
Owner CHERY AUTOMOBILE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products