Sample generation and survival evaluation method and device based on data genetic variation

A genetic variation and sample technology, applied in the computer field, can solve problems such as lack of samples, only a small number of samples, and difficulty in machine learning to build models, and achieve high consistency results

Pending Publication Date: 2022-04-15
SICHUAN XW BANK CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In order to solve the problem that there is a lack of samples or only a small number of samples in the prior art, and it is difficult to apply machine learning methods to build models, the present invention provides a method and device for sample generation and survival assessment based on genetic variation of data, the purpose of which is to , in the early stage of the business, enough samples for modeling can be obtained, which makes the establishment of the model better, which is more conducive to reducing business risks and improving the profitability of financial institutions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample generation and survival evaluation method and device based on data genetic variation
  • Sample generation and survival evaluation method and device based on data genetic variation
  • Sample generation and survival evaluation method and device based on data genetic variation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] Such as figure 1 As shown, on the one hand, the present invention provides a method for sample generation and survival assessment based on genetic variation of data, which may specifically include the following steps:

[0058] S101. Parental sample preparation: According to the business scenario, the initial parental sample is obtained, the time window of the initial parental sample is T1, and the feature space is Xi (i is the dimension of the feature vector, usually a high-dimensional vector). The labels of the initial parental samples are classified into three categories, positive, negative, and indeterminate. The total amount of initial parental samples is D1, among which the positive sample is Dp, the negative sample is Dn, and the uncertain sample is Du, obviously Dp+Dn+Du=D1.

[0059] S102. Parental crossover: combining various initial parental samples, and crossover to obtain labels of offspring samples. There are many specific crossing schemes, such as: positi...

Embodiment 2

[0071] Another aspect of the present invention also provides a device for implementing the above-mentioned sample generation and survival assessment method based on genetic variation of data, please refer to figure 2 , the device includes:

[0072] Business data storage module: used to accept and store business data, and provide initial parent samples;

[0073] Massive sample generation module: used to accept the initial parental samples, cooperate with the parameters and rules provided by the rule configuration module, and generate massive sub-generational samples;

[0074] Rule configuration module: used to visually configure genetic coefficients, coefficients of variation, and static survival rules;

[0075] Survival model operation module: used for training and deploying multiple survival model classifiers, used for static and dynamic generation and evaluation of offspring samples, and output sample survival results and corresponding survival weights;

[0076] Sample re...

Embodiment 3

[0080] Another aspect of the present invention also provides a computer device, such as image 3 As shown, the computer device includes a memory, a processor, a communication interface, and a communication bus, and a computer program that can run on the processor is stored in the memory, and when the processor executes the computer program, the methods in the above-mentioned embodiments are implemented. step.

[0081] The processor may be a central processing unit (Central Processing Unit, CPU).

[0082] As a non-transitory computer-readable storage medium, the memory can be used to store non-transitory software programs, non-transitory computer-executable programs and units, such as the corresponding program units in the above method embodiments of the present invention. The processor runs the non-transitory software programs, instructions and modules stored in the memory to execute various functional applications of the processor and process data of works, that is, to reali...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sample generation and survival evaluation method and device based on data genetic variation, and belongs to the technical field of computers, and the technical scheme comprises the steps of parental sample preparation, parental sample crossing, feature inheritance, feature variation, first-stage static survival evaluation, second-stage static survival evaluation, dynamic survival evaluation and business modeling. And circulating the steps to generate a filial generation sample set S2, and performing static survival evaluation and dynamic survival evaluation on filial generation samples in the filial generation sample set S2 to determine elimination or reservation. The objective of the invention is to obtain samples sufficient for modeling in the early stage of a service, so that the model establishment effect is better, which is more beneficial to reducing the service risk and improving the profitability of a financial institution.

Description

technical field [0001] The invention belongs to the field of computer technology, and in particular relates to a sample generation and survival evaluation method and device based on data genetic variation. Background technique [0002] The current application scenarios of machine learning are very extensive. In finance, communications, medical care, transportation, e-commerce, etc., many new business start-up stages will have a period of cold start stage. Due to the lack of samples or only a small number of samples, it is very difficult to It is difficult to apply machine learning methods to build models. [0003] The most typical example is how financial institutions make a scoring model to distinguish good or bad customers based on a small amount of customer information at the beginning of the target scenario business. For this situation, the general methods currently used are: [0004] 1. Expand the sample of similar businesses. For example, the goal is to establish a s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B40/20G16B20/20G06K9/62
Inventor 郑乐
Owner SICHUAN XW BANK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products