Hybrid filling method for incomplete data

A technology of mixed filling and complete data, applied in the computer field, can solve problems such as lack of generality

Active Publication Date: 2015-08-26
DALIAN UNIV OF TECH
View PDF3 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, no clustering can guarantee 100% correct division results, so selecting candidate filling data in the obtained clusters becomes the key
In addition, most of the clustering algorithms used in existing data filling need to specify the number o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hybrid filling method for incomplete data
  • Hybrid filling method for incomplete data
  • Hybrid filling method for incomplete data

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0030] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0031] figure 1 It shows a schematic structural diagram of a method for mixing and filling incomplete data in an embodiment of the present invention, which includes the following steps:

[0032] (1) Normalization and special value filling preprocessing for incomplete data sets

[0033] Suppose that the entire data object set D contains n data objects, and each object has m attributes, that is, D={x 1 ,x 2 ,...,x n }, A={a 1 ,a 2 ,...,a m }. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hybrid filling method for incomplete data. The hybrid filling method comprises the following steps: (1) performing special value filling pre-processing on a missing data value in a data set; (2) extracting data attribute significant characteristics by utilizing a stack type automatic coding machine; (3) performing incremental clustering on the filled data set based on the extracted characteristics; (4) performing attribute value weighted filling on a data missing object by utilizing attribute values, corresponding to front k% objects which are most similar with the data missing object, in the obtained each clustering result; and judging difference between all missing data filling values of this time and a last filling value, and iteratively updating (2) to (4) until filling value convergence conditions are met. According to the embodiment of the invention, local similarity characteristics of data in the data set, the data clustering precision, in-class data filling accuracy and algorithm practical application non-supervision and timeliness are considered to construct an algorithm of firstly clustering the incomplete data and then filling the incomplete data, and the filling result precision and the filling algorithm speed are ensured through ideas of utilizing special value filling, adopting the stack type automatic coding machine, performing incremental clustering, performing weighted filing on in-class front k% complete data objects, and the like.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method for realizing mixed filling of incomplete data based on a stacked deep learning network and incremental clustering. Background technique [0002] With the development of the Internet of Things, social networks, and e-commerce, data has grown and accumulated at an unprecedented rate, and incomplete data has also followed, resulting in a serious reduction in data quality. In the actual data analysis process, the efficient filling preprocessing of missing data is another major problem facing the current academic and industrial circles. [0003] The earlier method uses the average value of the attribute value in the data set to fill in the missing data, and the other method directly deletes the records containing the missing value. Compared with directly deleting missing records, average filling produces more erroneous analysis results, but simple data deletion will serio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/355
Inventor 陈志奎赵亮杨镇楠
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products