System and method for identifying associations and evolution patterns among data elements
A data and pattern technology, applied in the field of data analysis, can solve problems such as inability to obtain time series correlations, manual division, etc., and achieve the effects of optimizing time length and time lag constraints, improving accuracy, and identifying accurately
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] The preprocessing unit 110 preprocesses the collected data to obtain a processed data sequence. In this embodiment, the collected data may include, for example, meteorological data and air quality data, and preferably also includes at least one item of traffic data, population density data, and pollution source data.
[0035] In this embodiment, the preprocessing performed by the preprocessing unit 110 may include the following aspects:
[0036] (1) Normalize the collected data into data sequences with the same time scale.
[0037] (2) Stabilize the normalized data sequence. For example, if the time series is non-stationary, it can be differenced to make it stationary.
[0038] (3) Based on the duration and time lag constraints, the smoothed data sequence is converted into a sample sequence. For example, the transformation parameters of each data element may be duration and time lag constraints. Transformation parameters may be set based on experience or prior knowl...
Embodiment 2
[0062] The difference between Embodiment 2 and Embodiment 1 is that: the duration and time lag constraints are optimized to improve the accuracy of causality discovery. This operation can be performed by figure 1 The preprocessing unit 110 shown in is implemented. Below, combine figure 1 The specific details of Embodiment 2 are described in detail with FIGS. 8-9.
[0063] First, set L>T according to empirical knowledge, that is, select an optimal time segment [t1, t2] in the range [1, L]. For example, L=10 may be set.
[0064] Then, use the following formula to get β:
[0065]
[0066] The formula includes two function penalty items, the former is a sparse penalty item, and the latter is a penalty item to ensure continuity. The meanings of the parameters in this formula are as follows:
[0067] -N is the number of samples, which can be regarded as the length of the Y sequence.
[0068] -L is a preset value, that is, in the [1, L] time range, select an optimal time se...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


