A taxpayer credit evaluation method based on distributed automatic feature combination

A credit evaluation and feature combination technology, applied in data processing applications, finance, instruments, etc., to reduce tax risks, reduce artificial feature construction process, and improve computing speed

Active Publication Date: 2022-06-28
CHINA NAT SOFTWARE & SERVICE
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the scorecard model can only process processed features. To obtain a more accurate credit scoring effect, a large number of professionals are required to construct carefully calculated indicators.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A taxpayer credit evaluation method based on distributed automatic feature combination
  • A taxpayer credit evaluation method based on distributed automatic feature combination
  • A taxpayer credit evaluation method based on distributed automatic feature combination

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] This section provides detailed descriptions of specific embodiments of the invention.

[0043] The training process of the credit evaluation model of the distributed automatic combination feature can be mainly divided into five steps S1-S5.

[0044] In step S1, a training sample of the credit evaluation model needs to be constructed. The training sample selected here is based on the taxpayer, and includes the basic characteristics of the taxpayer in four main areas: basic information, declaration information, tax information, invoice information, and relationship network. , each of which includes rich fundamental features. In addition, the taxpayer's risk label is constructed according to the taxpayer's historical risk situation. Taxpayers with risky behaviors in the historical records are used as black samples, and taxpayers without risky behaviors are used as white samples for subsequent model training.

[0045] Step S2 uses a distributed random forest model to disco...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed automatic feature combination taxpayer credit evaluation method. The steps of the method include: 1) using the training samples to train the random forest model using the MapReduce distributed computing framework to obtain a distributed random forest model; 2) inputting the training samples into the distributed random forest model to generate each training input Multiple combined features of the sample; 3) Merge the generated combined features with the feature information of the corresponding taxpayer; 4) Use the combined features to train the scorecard model; 5) For a taxpayer to be evaluated for credit, use the distribution The random forest model generates the combined features of the taxpayer and merges them with the taxpayer's feature information, and then inputs the combined features of the taxpayer into the trained scorecard model to predict the taxpayer's credit score. The invention can carry out accurate credit evaluation of taxpayers.

Description

technical field [0001] The invention relates to a credit evaluation model and a taxpayer credit evaluation method, in particular to a credit evaluation model and a taxpayer credit evaluation method using distributed random forest for automatic feature combination, belonging to the field of computer big data processing. [0002] technical background [0003] Credit evaluation has been developed in the field of bank credit for decades. It is mainly used to evaluate the personal credit of people applying for loans, and to assist in the issuance of loans through credit evaluation to reduce the risk of bank capital gains and capital recovery. [0004] Taxpayer credit assessment in the field of taxation has only emerged in recent years, and it is mainly based on expert experience. Tax experts select indicators that can represent tax risks according to their professional experience, and assign different scores to different indicators. For each taxpayer, a lot of manual analysis and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06Q40/00
CPCG06Q40/10
Inventor 刘宗前武锦王彦李雪峰韩佶兴付婷婷郭乐乐
Owner CHINA NAT SOFTWARE & SERVICE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products