Unlock instant, AI-driven research and patent intelligence for your innovation.

A Data Restoration Method Based on Nearest Neighbor

A technology of data restoration and neighbors, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as the inability to accurately describe the characteristics of data sets, and achieve the effect of improving accuracy

Active Publication Date: 2018-12-28
TSINGHUA UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the existing data repair methods are based on certain constraint rules, which are either manually specified by domain experts or mined from part of the training data, but they all cause problems that cannot accurately describe the characteristics of the repaired data set

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Data Restoration Method Based on Nearest Neighbor
  • A Data Restoration Method Based on Nearest Neighbor
  • A Data Restoration Method Based on Nearest Neighbor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the objectives, technical solutions and advantages of the present invention clearer, the following will clearly and completely describe the technical solutions in the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are of the present invention. Some embodiments, not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0024] As an embodiment of the embodiment of the present invention, this embodiment provides a data repair method based on neighbors, refer to figure 1 , Is a flowchart of a neighbor-based data repair method according to an embodiment of the present invention, including:

[0025] S1, based on all attributes of the data point, by calculating the K-nearest neighbor distance of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data recovery method based on a nearest neighbor. The method comprises the following steps that: S1: on the basis of all attributes of a data point, calculating the K-nearestneighbor distance of the data point in all-attribute space to detect an exceptional data point in the all-attribute space; S2: on the basis of given parts of attributes in all attributes, calculatingthe K-nearest neighbor distance of the exceptional data point in given parts of attribute subspace, and carrying out data exception judgment to determine the normal attribute of the exceptional data point; and S3: on the basis of the normal attribute of the exceptional data point, utilizing a given operation way to calculate the exceptional attribute recovery value of the exceptional data point, and carrying out exceptional data point recovery. By use of the method, data recovery accuracy and data recovery operation efficiency can be effectively improved.

Description

Technical field [0001] The present invention relates to the technical field of computer data management, and more specifically, to a data repair method based on neighbors. Background technique [0002] In today's big data era, there is a huge amount of data for analysis and mining to provide more convenience for people to conduct various activities. With the increasing use of data, the issue of data quality has gradually attracted people's attention. Data quality problems refer to data deviations due to some reasons during its life cycle (generation, storage, processing, and use), resulting in inconsistencies, inaccuracy, and incompleteness of the final data. [0003] There are many reasons for data quality problems, such as data source failure, human error, and damage to storage media. Many factors cause data quality to prevail in production and life. In practice, the losses caused by data quality problems cannot be underestimated. According to statistics, the economic loss ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 王建民宋韶旭王昳晗
Owner TSINGHUA UNIV