Unlock instant, AI-driven research and patent intelligence for your innovation.

Cross-regional enterprise tax evasion identification method based on PU learning

An identification method and tax evasion technology, applied in the field of tax audit, can solve the problems of tax evasion identification, low model feasibility, and limited tax audit.

Active Publication Date: 2020-03-06
XI AN JIAOTONG UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the field of taxation, the case selection method based on data mining still faces the following problems: due to the uneven economic development among provinces and cities, there are great differences in the scale and characteristics of taxation data; However, some areas (target domains) lack or even have no marked data; because tax audits are limited by limited manpower and material resources, even in developed areas, it is impossible to audit all enterprises. In practice, tax audits are often used The sampling inspection method makes the areas with sufficient labeled data (source domain) only have a small number of marked positive labels, so it is impossible to establish a unified tax evasion identification model
[0008] Document 2 proposes a corporate tax evasion identification method based on deep adversarial transfer learning. By using the tax data in the source domain, a tax evasion identification model suitable for the target domain is constructed to solve the problem that the target domain cannot be detected due to the lack of labeled data. Problems of identification and modeling of corporate tax evasion
[0009] The method described in the above literature mainly has the following problems: the identification model of literature 1 requires the characteristics of tax data to be independent and identically distributed and needs to rely on certain expert knowledge, which cannot solve the problem of different distribution of tax data characteristics in different regions and cannot effectively identify tax evasion across regions this question
The method in Document 2 requires that the source domain tax data have all labels, but in actual tax scenarios, the source domain tax data often only has positive samples and a large number of unlabeled samples. The premise of this method does not match the actual tax scenario. Labeling takes a lot of time and cost, making the model less viable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-regional enterprise tax evasion identification method based on PU learning
  • Cross-regional enterprise tax evasion identification method based on PU learning
  • Cross-regional enterprise tax evasion identification method based on PU learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0077] In order to illustrate the technical solution of the present invention more clearly, a method for identifying tax evasion by cross-regional enterprises based on PU learning of the present invention will be described in detail below in conjunction with the drawings and specific embodiments.

[0078] In this example, the source domain is Guangdong Province, and the target domain is Shaanxi Province. The tax evasion data of enterprises in Shaanxi Province is established by using the tax payment data of Guangdong Province composed of positive samples and unlabeled samples and the tax payment data of Shaanxi Province that are completely unlabeled. Such as figure 1 Shown, the present invention mainly comprises the following steps:

[0079] Step 1. Generate source domain tax data labels

[0080] As the tax data of Guangdong Province, the tax data marked as positive samples (tax evasion enterprises) only account for a small part of the samples in Guangdong Province, most of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-regional enterprise tax evasion identification method based on PU learning, and the method comprises the steps: firstly generating a label of a label-free sample in a source domain through a small number of positive samples with labels in the source domain based on PU learning; secondly, constructing a cross-regional enterprise tax evasion identification model through a domain adaptation method of maximizing classifier difference; and finally, performing tax evasion identification on the tax payment data of the target domain by using the trained tax evasion identification model. Under the condition that source domain tax payment data only has positive samples and a large number of unmarked samples, the purpose of establishing a tax evasion identification model for a target domain without a label of the tax payment data is achieved.

Description

technical field [0001] The invention belongs to the technical field of tax inspection, and in particular relates to a method for identifying tax evasion by cross-regional enterprises based on PU learning. Background technique [0002] In recent years, with the rapid development of computer science and Internet technology, the combination of domestic tax collection management and information technology has become a development trend in the era of big data. Among them, with the development of the "Golden Tax Phase III" and "Internet + Taxation", the domestic taxation department has accumulated a large amount of tax-related data. How to make full and effective use of these tax-related data to implement tax audit has become a new challenge. [0003] Due to the limitation of manpower and time, the tax inspection department cannot inspect every taxpayer. Generally, taxpayers who are suspected of tax evasion are sampled through audit selection, and the audit is carried out on thes...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06Q40/00
CPCG06N3/084G06Q40/123G06N3/045G06F18/2411G06F18/214
Inventor 郑庆华王伊杨董博高宇达张发阮建飞师斌陈妍
Owner XI AN JIAOTONG UNIV