Missing data repairing method and device in time-space sequence data

A technology for sequence data and missing data, applied in the field of data processing, can solve problems such as not taking good care of time-space correlation, interpolation accuracy needs to be improved, etc.

Inactive Publication Date: 2016-06-15
NEC CORP
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the above-mentioned missing value patching algorithms for time-space series data only consider the data of the...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Missing data repairing method and device in time-space sequence data
  • Missing data repairing method and device in time-space sequence data
  • Missing data repairing method and device in time-space sequence data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0092] The inventive idea of ​​the present invention is mainly: in the process of interpolating and repairing the time-space sequence data, not only the influence of the space dimension on the missing data is considered, but also the influence of the time dimension on the missing data is considered, and the time-space correlation and heterogeneity are taken into account at the same time. Qualitative, thereby improving the interpolation accuracy. According to the above inventive concept, figure 1 A schematic flowchart showing a method for repairing missing data in spatio-temporal sequence data according to an embodiment of the present invention. Such as figure 1 As shown, the method can mainly include:

[0093] Step 101 , respectively determine the contribution weights of spatial peripheral points and temporal peripheral points to the point to be found with missing data.

[0094] Step 102 : Calculate spatial dimension estimation data of the point to be sought according to ...

Embodiment 2

[0101] figure 2 A schematic flowchart showing a method for repairing missing data in spatio-temporal sequence data according to another embodiment of the present invention. Such as figure 2 As shown, the main difference from the foregoing embodiment is that, before step 101, the method may further include:

[0102] Step 201. Arrange the time-space sequence data into a two-dimensional array to detect missing points where data is missing.

[0103]For example, the time-space sequence data can be arranged in the format of a two-dimensional array P(Y,T) to facilitate calculation. Wherein, the data in row u and column v in the two-dimensional array can represent the data of the vth spatial point at the uth time point.

[0104] Step 202 , perform missing data detection on the above-mentioned two-dimensional array, find missing points, and uniformly set the data of missing points to null, and also determine whether the number of missing points is single or multiple. In the case ...

Embodiment 3

[0264] image 3 A structural block diagram of an apparatus for repairing missing data in spatio-temporal sequence data according to an embodiment of the present invention is shown. Such as image 3 As shown, the device can mainly include:

[0265] The spatial dimension estimation module 11 is used to determine the contribution weight of the spatial peripheral points to the data-missing points to be sought, and calculate the Spatial dimension estimation data of the point to be sought;

[0266] The time dimension estimation module 13 is used to determine the contribution weight of the surrounding points in time to the to-be-required point with missing data, and calculate the Estimated data of the time dimension of the point to be requested;

[0267] The data fusion module 15 is configured to calculate the data of the point to be sought according to the estimated data of the space dimension and the estimated data of the time dimension.

[0268] The device for repairing missi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a method and device for repairing missing data in time-space sequence data, wherein the method includes: separately determining the contribution weights of space peripheral points and time peripheral points to the missing data points; The contribution weights of the points are sorted from large to small, and the spatial dimension estimation data of the points to be obtained are calculated; according to the contribution weights of the points to be obtained, the first multiple points are sorted from large to small Calculate the estimated data of the time dimension of the point to be sought for the surrounding points in time; calculate the data of the point to be sought based on the estimated data of the spatial dimension and the estimated data of the time dimension. The invention makes full use of the time-space correlation and heterogeneity of the time-space sequence data, and the obtained data of the point to be sought has high precision.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method and device for repairing missing data in time-space sequence data. Background technique [0002] The evolution of nature, human production activities, and social and economic development are all spatiotemporal processes, which can be represented by spatiotemporal series data. However, spatio-temporal series data may have local data missing due to various reasons. It is obviously wasteful and unreasonable to discard all data due to the lack of partial data. Therefore, it is necessary to repair the missing spatio-temporal sequence data to better mine the spatio-temporal association rules of things based on big data. [0003] At present, interpolation methods are commonly used to estimate missing values ​​of time-space series data. For example, common methods for estimating missing data in climate datasets are regression-based methods, Kriging and its variants, InverseDist...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/00
Inventor 刘博胡卫松刘晓炜樊子德邓敏
Owner NEC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products