Data mining method with model early-warning updating mechanism

A data mining and model technology, applied in data mining, electrical digital data processing, special data processing applications, etc., can solve the problems of low accuracy of repeated execution process, lack of actual data verification, small number of models, etc., to improve generalization performance , the effect of reducing the implementation cost and improving the accuracy of the model

Inactive Publication Date: 2018-09-07
成都优易数据有限公司
View PDF8 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0017] The purpose of the present invention is: the present invention provides a data mining method with a model early warning update mechanism, which solves the low accuracy and high cost of the existing data mining process due to the small number of models and the lack of actual data inspection resulting in repeated execution of the process The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data mining method with model early-warning updating mechanism
  • Data mining method with model early-warning updating mechanism
  • Data mining method with model early-warning updating mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0069] In the data mining project, the machine learning model is used to predict whether customers will churn in the future. The data feature field is the customer's behavior data for a period of time, and the label field is the customer's churn status and the value is: yes / no; after the model is trained, use The test data is tested to obtain the evaluation indicators of model performance, namely the correct classification rate, the accuracy rate and recall rate for specific values ​​​​of the label column, and the AUC value. The AUC value judges the generalization ability of the model, that is, it will The ability to classify samples of different categories; set model early warning rules based on these indicators, but after testing the new test numbers of the model, compare the new indicators with the prediction rules, and after the model early warning rules are met, the system will give the user a corresponding alarm remind.

[0070] The calculation process of the early warni...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data mining method with a model early-warning updating mechanism, and relates to the field of CRISP-DM (cross-industry standard process for data mining). The method includes:1, obtaining a preliminary scheme of a service objective by commercial understanding, and defining a model failure determination principle according to the service objective; 2, carrying out data understanding and data preparation in sequence on the basis of the preliminary scheme to obtain a data set suitable for modeling analysis; 3, training multiple models on the basis of the data set and thefailure determination principle to complete establishment and optimization of the models; 4, carrying out model evaluation and preliminary deployment on the multiple established models, then judgingwhether the same meet an early-warning rule and need to be updated, if the same meet the early-warning rule, recalculating the models to complete updating, and then jumping to a step 5, and if the same do not need to be updated, directly jumping to the step 5; and 5, carrying out final deployment of the models to complete data mining. According to the method, problems of low precision and high costs brought by repeated execution processes caused by existing data mining processes due to smaller model numbers are solved, and effects of improving model precision and reducing costs are achieved.

Description

technical field [0001] The invention relates to the field of cross-industry data mining standard procedures, in particular to a data mining method with a model early warning update mechanism. Background technique [0002] CRISP-DM (cross-industry standard process for data mining) "Cross-industry standard process for data mining" is a data mining methodology jointly developed by NCR, OHRA, SPSS, Daimler-Benz and other global companies. Compared with other existing data mining methodologies , the CRISP-DM methodology is more superior, so it is widely used. [0003] The CRISP-DM methodology defines data mining practice as six standard stages, which are business understanding, data understanding, data preparation, model building, model evaluation, and model deployment. The following are brief introductions: [0004] 1. Business understanding: [0005] Business understanding is to clarify the business goals to be achieved and transform them into data mining topics; understand t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F2216/03
Inventor 勇萌哲普雪飞
Owner 成都优易数据有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products