Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Enterprise default risk assessment method and equipment based on GBDT algorithm and logistic regression model, and medium

A logistic regression model and risk assessment technology, applied in the field of financial credit, can solve problems such as low efficiency of assessment, poor accuracy of results, and difficulty in identifying the probability of corporate default risk, so as to improve credibility and reference and reduce bad debt rate Effect

Pending Publication Date: 2022-05-20
天元大数据信用管理有限公司
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The embodiment of this specification provides an enterprise default risk assessment method, equipment and medium based on GBDT algorithm and logistic regression model, which are used to solve the following technical problems in the prior art: the existing data due diligence on enterprises based on offline 1. Experts evaluate the default risk of enterprises based on experience, which has the problems of imperfect evaluation index system, low evaluation efficiency, and poor accuracy of results. It is difficult to effectively identify the probability of corporate default risk and avoid loan risks.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enterprise default risk assessment method and equipment based on GBDT algorithm and logistic regression model, and medium
  • Enterprise default risk assessment method and equipment based on GBDT algorithm and logistic regression model, and medium
  • Enterprise default risk assessment method and equipment based on GBDT algorithm and logistic regression model, and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0178] A total of 27,196 sample enterprises were selected, and these enterprises were divided according to the predefined standard with default risk, and 21,083 normal enterprises and 6,113 enterprises with default risk were obtained.

[0179] Taking the business hours data of each sample enterprise as an example, first obtain the business hours data of the sample enterprises stored in the third-party subject, and then standardize, calculate and standardize the data to obtain the standardized data of business hours. Calculate the WOE value of the standardized data of business hours, and use the WOE value to divide the variable of business hours into intervals.

[0180] There are two requirements for binning variables using WOE values:

[0181](1) The number of groups should not be less than 5% of the total number of samples, and each group must have both good and bad samples.

[0182] (2) The WOE values ​​between the groups should be spread as far as possible and present a tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and equipment for constructing an enterprise default risk assessment model based on a GBDT algorithm and a logistic regression model, and a medium. The method comprises the following steps: acquiring data of a plurality of sample enterprises, integrating and processing the acquired data to obtain standardized data, calculating a WOE value and an IV value of the standardized data, performing variable binning and screening through the WOE value and the IV value, using a GBDT algorithm as a preposition algorithm of a logistic regression model, constructing a new combination feature by the GBDT algorithm, and constructing the new combination feature according to the logistic regression model. And inputting the data to the logistic regression model so as to construct an enterprise default risk assessment model, and finally assessing a to-be-assessed enterprise by the enterprise default risk assessment model to obtain an enterprise default risk assessment result, and identifying an enterprise default risk probability. According to the method, the model entering variables of the logistic regression model are combined by adopting the GBDT algorithm, the variables which contribute more to the model are selected, and meanwhile, the importance of the variables is analyzed, so that the model evaluation result is more reasonable, and the accuracy of model evaluation is effectively improved.

Description

technical field [0001] This application relates to the field of financial credit technology, in particular to a method, equipment and medium for evaluating enterprise default risk based on GBDT algorithm and logistic regression model. Background technique [0002] Small and micro enterprises occupy an important position in the process of my country's economic and social development, and they play a very important role in promoting economic development, increasing fiscal revenue, and providing social employment. However, small and micro enterprises are facing many problems in the process of development, among which financing difficulties are one of the important problems that plague the development of small and micro enterprises, which have increasingly attracted great attention from the society. [0003] The financing difficulties of small and micro enterprises are not only related to the conditions of small and micro enterprises themselves, but also closely related to the l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/06G06Q40/02G06N20/00
CPCG06Q10/0635G06N20/00G06Q40/03
Inventor 刘先淇郭英楠崔乐乐
Owner 天元大数据信用管理有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products