Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Calculation method and device for contribution degree of training data set, equipment and storage medium

A technology for training data sets and training data, which is applied in the field of financial technology data processing, can solve problems such as calculating the contribution of training data sets, and achieve the effect of improving accuracy

Pending Publication Date: 2020-06-23
WEBANK (CHINA)
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The main purpose of the present invention is to provide a method, device, device and storage medium for calculating the contribution of training data sets, aiming at solving the existing technical problem of how to calculate the contribution of each training data set

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Calculation method and device for contribution degree of training data set, equipment and storage medium
  • Calculation method and device for contribution degree of training data set, equipment and storage medium
  • Calculation method and device for contribution degree of training data set, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0040] The present invention provides a method for calculating the contribution of a training data set, referring to figure 1 , figure 1 It is a schematic flowchart of the first embodiment of the method for calculating the contribution degree of the training data set in the present invention.

[0041] The embodiment of the present invention provides an embodiment of the method for calculating the contribution of the training data set. It should be noted that although the logical order is shown in the flowchart, in some cases, it can be executed in a different order than here steps shown or described.

[0042] The contribution calculation method of the training data set is applied to the server or terminal, and the terminal can include mobile terminals such as mobile phones, tablet computers, notebook computers, palm...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a calculation method and device for a contribution degree of a training data set, equipment and a storage medium, and relates to the field of financial science and technology,and the method comprises the steps: obtaining each training data set of a training machine learning model; calculating a SHAP target value of a SHAP interpretation method of each feature in the training data set; and calculating the contribution degree of the training data set according to the SHAP target value of each feature in the training data set. According to the invention, the contributiondegree of each training data set is obtained through corresponding calculation of the SHAP target value of each feature in the training data set; the importance degree of each training data set in theprocess of training the machine learning model is evaluated through the contribution degree of each training data set, so that the training data set for training the machine learning model can be selected more accurately, and the data prediction accuracy of the machine learning model obtained by training is improved.

Description

technical field [0001] The present invention relates to the technical field of financial technology (Fintech) data processing, and in particular to a method, device, device and storage medium for calculating the contribution of a training data set. Background technique [0002] With the development of computer technology, more and more technologies are applied in the financial field. The traditional financial industry is gradually transforming into financial technology (Fintech), and data processing technology is no exception. However, due to the security and real-time requirements of the financial industry, It also places higher demands on technology. [0003] Data is very important for machine learning modeling. A piece of high-quality data can help improve the performance of machine learning models, thereby improving business benefits or reducing operating costs. In the process of large-scale practical deployment of machine learning, the value of data has become increasi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 卓本刚黄启军唐兴兴林冰垠
Owner WEBANK (CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products