Unlock instant, AI-driven research and patent intelligence for your innovation.

Low-cost and high-resolution data classification method based on constructed prediction model

A prediction model and data classification technology, applied in the fields of statistics, classification prediction, and machine learning algorithms, can solve the problems of high acquisition cost, long data acquisition time, and high resource consumption to unlock, achieve accurate classification, save query costs, and reduce total costs. time cost effect

Inactive Publication Date: 2019-04-26
SICHUAN XW BANK CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In view of the problems of the above research, the purpose of the present invention is to provide a low-cost, high-discrimination data classification method based on the constructed prediction model, to solve the classification method in the prior art to classify data, which needs to obtain more data To make multiple binary classifiers, resulting in long data acquisition time, high acquisition cost, high resource consumption and unlocking problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Low-cost and high-resolution data classification method based on constructed prediction model
  • Low-cost and high-resolution data classification method based on constructed prediction model
  • Low-cost and high-resolution data classification method based on constructed prediction model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0071] A binary classifier F for judging the quality of application information developed in this embodiment (i.e., the analysis of the quality of corresponding data), the application process is divided into the following four steps:

[0072] 1. Using any data source in user credit information or other data, this implementation develops a binary classifier with a value in [300, 900] points. The binary classifier uses the GBDT model and uses 139 variables;

[0073] 2. Analyze the performance of the binary classifier F in different ranges of scores within [300, 900], select an interval [450, 750], so that the performance of the binary classifier can achieve relatively high accuracy outside this interval ; This interval includes the sample score 600 when the independent variable is completely missing. On the left and right boundaries of the interval, the domain discrimination of the binary classifier exceeds the threshold of 0.6 (selected according to industry experience); the dom...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a low-cost and high-resolution data classification method based on a constructed prediction model, belongs to the technical field of classification prediction, and solves the problems of long data acquisition time, high acquisition cost and the like due to the fact that more data needs to be acquired to manufacture a plurality of binary classifiers in the prior art. The method comprises the steps that a binary classifier of a continuous prediction variable is constructed based on any existing data source, and the continuous prediction variable is predicted jointly through n independent variables without time information and m independent variables with time information; According to a prediction result of the binary classifier in a continuous prediction variable range, dividing the continuous prediction variable range into three intervals, namely a left interval, a middle interval and a right interval; Constructing an associated index based on the intermediate interval and the binary classifier; And predicting the data in the left interval and the right interval by using a binary classifier, and jointly predicting the data in the middle interval by using thebinary classifier and an associated index to obtain the classification of the final data. The method is used for classifying the data intervals by using the same data source.

Description

technical field [0001] A low-cost, high-discrimination data classification method based on a constructed prediction model is used for data interval classification using the same data source, and belongs to the technical fields of statistics, machine learning algorithms, and classification prediction. Background technique [0002] In machine learning and statistics, classification is the problem of identifying correspondences within a set of categories (subpopulations) to which new observations belong, based on a training dataset containing observations (or instances) whose category membership is known. For example, assigning a given email to the "spam" or "not spam" class, and assigning a diagnosis to a given patient based on observed patient characteristics (gender, blood pressure, presence or absence of certain symptoms, etc.) . In machine learning terminology, classification prediction is considered an instance of supervised learning, i.e. learning to obtain a training s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/243G06F18/214
Inventor 韩晗陈锐浩陈贻汕
Owner SICHUAN XW BANK CO LTD