Supercharge Your Innovation With Domain-Expert AI Agents!

Method for similarity identification in cluster analysis

A technology of cluster analysis and similarity, which is applied in the field of similarity identification in cluster analysis, and can solve problems such as incomplete similarity expression

Inactive Publication Date: 2017-10-13
GUANGDONG UNIV OF TECH
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a method for similarity recognition in cluster analysis, aiming at solving the problem of incomplete expression of similarity in cluster analysis in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for similarity identification in cluster analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0029] See figure 1 , figure 1 It is a schematic flowchart of a specific implementation method for a similarity identification method in cluster analysis provided by an embodiment of the present invention, the method includes the following steps:

[0030] Step 101: Obtain a first sequence and a second sequence.

[0031] It should be noted that the above-mentioned first sequence and second sequence may refer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for similarity identification in cluster analysis, wherein the method includes steps of acquiring a first sequence and a second sequence; calculating Euclidean distance between an element pre-distributed with a preset weight in the first sequence and an element pre-distributed with a preset weight in the second sequence; according to the gain of the i-dimensional element in the first sequence and the gain of the i-dimensional element in the second sequence, calculating the correlation coefficient between the i-dimensional element in the first sequence and the i-dimensional element in the second sequence; according to the correlation coefficient, calculating a grey relational degree between the first sequence and the second sequence; according to the grey correlation and the Euclidean distance, calculating the similarity of two sequences by the preset weight coefficient. Through the weight coefficient, the application organically combines the Euclidean distance and the grey relational degree between sequences, thus the similarity can reflect the partial distance between two sequences as well as the form similarity; namely, the calculated similarity can represent the 'model similarity' degree and 'value similarity' degree between the sequences at the same time.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a method and device for similarity identification in cluster analysis. Background technique [0002] With the advent of the era of big data, a large amount of complex data has accumulated in various fields, making how to mine the potential value of data has become a research hotspot in today's big data environment. Among them, cluster analysis is widely used in many fields, such as weather forecast, electric power, finance, forestry, etc. [0003] Cluster analysis is a multivariate analysis method in mathematical statistics. It uses mathematical methods to quantitatively determine the relationship between samples, so as to divide the types objectively. Usually, the things to be clustered are called samples, and a group of things to be clustered is called a sample set. The similarity function can be used as a tool to measure the similarity between sample data. [0004] At present, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/285G06F16/2465
Inventor 王星华周亚武陈云龙许炫壕
Owner GUANGDONG UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More