Causal relationship mining method based on deep learning

A causal relationship, deep learning technology, applied in neural learning methods, neural architecture, biological neural network models, etc., can solve the problem that nonlinear data cannot be predicted with high accuracy

Inactive Publication Date: 2019-07-09
BEIJING UNIV OF TECH
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the existing multivariate time series causality analysis mainly focuses on the qualitative Granger causality test between variables, and some studies based on Granger causality only use linear regression to fit data and give results. This method cannot achieve high-accuracy predictions for complex nonlinear data in real life, and has great limitations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Causal relationship mining method based on deep learning
  • Causal relationship mining method based on deep learning
  • Causal relationship mining method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0036] The used hardware equipment of the present invention has 1 PC machine, 2 1080 graphics cards;

[0037] Such as image 3 As shown, the present invention provides a method for mining Granger causality based on deep learning, which specifically includes the following steps:

[0038] Step 1. Obtain relevant time series data and clean the data.

[0039] Step 2, preprocessing the data for data missing, not smooth, etc.

[0040] Step 2.1, for the missing values ​​in the data, use the mode or median to complete.

[0041] In step 2.2, One-hot encoding or LabelEncoder is required to standardize category features.

[0042] Step 2.3, for the non-stationary time series, transform it into a stationary series through the method of first-order difference processing.

[0043] Step 2.4, convert the time series problem into...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a causal relationship mining method based on deep learning. The method comprises the following steps: firstly, preprocessing data by using technologies such as missing data supplement, data normalization and independent hot coding; then, based on a Keras deep learning framework, carrying out univariate time sequence prediction on the target characteristics by using an LSTM;adjusting the model structure and a series of hyper-parameters to obtain an optimal model; recording the R2 score of the model on the test set; and then, predicting all the alternative features by using the model to obtain R2 scores of the alternative features on the test set, subtracting the two scores to obtain a Granger causal relationship score of the alternative features and the target feature, and obtaining a quantitative number for describing the Granger causal relationship between the alternative features and the target feature. The method is suitable for the problem of influence factor analysis of other time sequences. In conclusion, the Granger causal relationship mining method based on deep learning has the advantages of mining much data in various fields.

Description

technical field [0001] The invention belongs to the technical field of data mining, and in particular relates to mining Granger causality from multivariate time series data based on deep learning technology. Background technique [0002] Time series data is a set of data points observed at constant time intervals. It has two main characteristics. One is that it depends on time, and there is correlation between the observed values. Most time series also have a seasonal trend, that is, a specific change in a specific time window. As the cost of data storage has been greatly reduced, a large amount of time series data generated in real life has also been completely recorded. This kind of data is commonly found in fields such as finance, commodity prices, and transportation. For example, in the field of air quality, in recent years, atmospheric visibility has declined, air quality has deteriorated, and many cities are often accompanied by smog weather, which has a relatively la...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/049G06N3/08
Inventor 刘博贺玺
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products