Method for clustering uncertain data

A clustering method, a technology for determining data, applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve the problems of increasing computational complexity of algorithms, difficult to control processing of data uncertainty, and increased time overhead.

Inactive Publication Date: 2016-01-20
JILIN UNIV
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] To sum up, we can find that most of the existing clustering methods for uncertain data are improved from the traditional clustering algorithms dealing with deterministic data, and they mainly have the following problems: (1) The improved Although the clustering algorithm has improved the clustering quality of dealing with uncertain data, but because it has not substantially reduced the uncertainty of the data, in practical applications, there will still be a phenomenon that the clustering results are seriously affected by errors.
(2) When the improved clust

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for clustering uncertain data
  • Method for clustering uncertain data
  • Method for clustering uncertain data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The steps of the present invention are as follows:

[0041] ① Obtain the real covariance structure of the underlying data: the data set contains The bar mean is expressed as Uncertainty records of , the corresponding probability distribution function is expressed as ,data record First elements expressed as , No. record's The probability distribution of elements is expressed as , record the data No. The source value of the dimension is expressed as ,Depend on plus get value, so represented in the construction distribution The noise generated during the mean of , which gives:

[0042]

[0043] ;

[0044] the database No. The random variable corresponding to dimension is expressed as ;

[0045] will correspond to the source data No. truth value of dimension The random variable of , corresponding to First A random variable of dimension is expressed as ,but:

[0046] ;

[0047] the source data peacekeeping The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for clustering uncertain data belongs to the field of data acquiring and processing technology. The invention aims to provide the method for clustering the uncertain data, wherein the method has functions of utilizing an essential latent association of the data, searching a true covariance structure in bottom data recording under an uncertain surface, extracting main characteristics of the data and performing noise reduction clustering. The method comprises the following steps of 1, acquiring the true covariance structure of the bottom data, and 2, according to the covariance structure, performing sharpening noise-reduction processing. The method of the invention can greatly reduce uncertainty of the data from bottom, and furthermore the sharpened noise-reduced data which are obtained through processing can be applied in other fields of fusion, classification, etc. The method has high extendibility.

Description

technical field [0001] The invention belongs to the technical field of data collection and processing. Background technique [0002] In recent years, with the advancement of technology and the deepening of people's understanding of data acquisition and processing technology, uncertain data has been widely valued. In many practical applications, such as economic, military, financial, telecommunications and other fields, data Uncertainty is pervasive and plays a key role. The emergence of uncertain data poses great challenges to traditional cluster analysis ] . The uncertainty of data comes from a variety of situations, the error generated by the data collected by physical instruments, the influence of the surrounding environment on the data in the case of sensor network applications, the bandwidth, transmission delay, energy, etc. The interference of factors and the special purpose of privacy protection may lead to data uncertainty. [0003] The manifestations of data unc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 李嘉菲孙小玉高滢
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products