Meta-learner-based training data generation method and heterogeneous response difference estimation method for causal effects

A technique for training data, learners, applied in the field of machine learning

Active Publication Date: 2022-05-17
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to aim at the causal inference problem existing in training data samples in traditional machine learning methods, to provide a general training data generation method, so that the generated training data can meet the learning requirements of the heterogeneous response difference estimation method, so that it can be used A Machine Learning Approach Directly Models the Relationship Between Heterogeneous Reaction Responses and User Characteristics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Meta-learner-based training data generation method and heterogeneous response difference estimation method for causal effects
  • Meta-learner-based training data generation method and heterogeneous response difference estimation method for causal effects
  • Meta-learner-based training data generation method and heterogeneous response difference estimation method for causal effects

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] Based on the above principles and random field test data, the method for generating training data based on meta-learners provided in this embodiment includes the following steps:

[0066] A1 Obtain two sets of original training data through random field experiments, one set of data is the original training data of the treatment group, and the other set of data is the original training data of the control group; the original training data of the treatment group includes the user's characteristic information, group and user Response under the influence of a given behavior; the original training data of the control group includes user characteristic information, group and user's response under the influence of no given behavior.

[0067] In this step, 80% of the data in the processing group of 1,601,963 transactions and the control group of 603,189 transactions are used to generate training data. Here the final data set will be D={X i ,Y i (1), Y i (0),T i},X i Repres...

Embodiment 2

[0093] Based on the training data generated in embodiment 1, this embodiment further learns a final task learner. The input of the task learner is the user's feature information, and the output is the difference of the user's causal effect and heterogeneous response. Here, two task sub-learners are designed to constitute the final task learner, and the two task sub-learners learn the corresponding training data of the generated control group and treatment group respectively. The two task sub-learner models in this embodiment adopt the same model as the two basic learners in Embodiment 1, namely GRU.

[0094] This step specifically includes the following sub-steps:

[0095] B11 divides the generated training data into two groups: the user characteristic information from the control group and the corresponding causal effect heterogeneous response difference form the control group to generate training data, namely The user characteristic information from the treatment group an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for generating training data based on a meta-learner and a method for estimating the difference between causal effects and heterogeneous responses. Firstly, two basic learners are learned by using the original training data obtained from a random field test, and then the two The base learner conducts cross-test on the original training data to generate training data; then uses the generated training data to learn a task sub-learner for the treatment group and the control group respectively, and then two task sub-learners constitute the final task learner ; Through the final task learner can realize the estimation of user causal effect heterogeneous response difference. Based on the meta-learner, the present invention proposes a framework for estimating the heterogeneous response differences of causal effects, which can be used in conjunction with any basic learner model to estimate any type of causal effect response differences; for example, it can be used to guide pricing Discounting means, design of effective advertising strategies, and design of product dimensions and packaging solutions, etc.

Description

technical field [0001] The invention belongs to the combined application of the field of machine learning (Machine Learning) technology and the field of randomized field experiments (Randomized Field Experiments), and relates to the difference estimation of causal effect heterogeneous responses (Heterogeneous Response) based on meta-learners. [0002] The difference in heterogeneous responses refers to the difference in a specific response between a user who is affected by a certain behavior and who is not affected by a certain behavior. Sometimes the impact is defined as treatment (Treatment), and the unaffected is defined as control (Control), so the present invention is to estimate the difference of a certain behavior of a certain user between the treatment group (TreatmentGroup) and the control group (ControlGroup) , is said to be heterogeneous because a specific user can only be in one group, that is, either in the treatment group or in the control group, and it is imposs...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06N20/00
CPCG06N3/08G06N20/00G06N3/045G06N3/044G06F18/2411G06F18/214G06F18/24323
Inventor 周帆曹丞泰钟婷徐增
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products