Abnormity detection method for enterprise industry classification

An anomaly detection and industry-leading technology, applied in neural learning methods, biological neural network models, resources, etc., can solve problems that are difficult, unable to effectively extract detailed features of data, and difficult to select k values

Active Publication Date: 2019-04-19
XI AN JIAOTONG UNIV
View PDF11 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] Although the above traditional methods can solve their specific anomaly detection problems, it is difficult to directly extend to the anomaly detection problem of industry classification, because the anomaly detection of industry classification has the characteristics of multi-category and multi-level
First, enterprise industry classification is a multi-classification problem, and the variety of categories and the amount of data make the problem of anomaly detection complicated.
The self-encoder network structures in Documents 1 and 3 are too simple, with only one hidden layer, unable to effectively extract detailed features of the data, and severely lack generalization ability under large-scale data sets; Document 2 defines local outliers by using k-neighborhood dispersion coefficient, but in the case of many industry categories and a large amount of industry information data, the selection of k value becomes extremely difficult
Second, there is a hierarchical affiliation relationship between the industry category and the details of the enterprise. The two belong to different levels, and the industry details are the expansion and refinement of the industry category. Any enterprise corresponds to an industry category and a Industry details, each requires data with different information granularity (reflecting the level of detail of the information) for anomaly analysis. Documents 1-3 do not have a solution for the multi-level anomaly detection problem of industry classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abnormity detection method for enterprise industry classification
  • Abnormity detection method for enterprise industry classification
  • Abnormity detection method for enterprise industry classification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0092] The present invention will be further described below with reference to the accompanying drawings and embodiments.

[0093] Select the taxpayer information registered in the national tax of a certain region from 2011 to 2017, including the sample data of 25 industry categories and 112 industry details, and each industry category has multiple different industry details. The present invention will be further described in detail below with reference to the accompanying drawings and in combination with experimental cases and specific embodiments. All technologies implemented based on the content of the present invention belong to the scope of the present invention.

[0094] like figure 1 As shown, in the specific implementation of this patent, the anomaly detection process for taxpayer industry 2-level classification includes the following steps:

[0095] Step 1. Text attribute processing

[0096] There is a lot of valuable information and knowledge in the taxpayer indus...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an abnormity detection method for enterprise industry classification, which comprises the following steps of: firstly, extracting to-be-mined text and non-text information in taxpayer industry information, and carrying out feature processing and coding processing; Secondly, constructing a deep network structure conforming to the industry classification abnormity detection problem, and determining the number of neurons of an input layer and an output layer of the network according to the characteristic dimension of the coded data; Thirdly, on the basis of the constructeddeep network structure, adopting different training strategies to train the industry large-class network and the industry detail network through cross validation; And finally, carrying out abnormitydetection on the industry large class by using dimension reduction characteristics of the industry large class network in combination with an SOS abnormity detection algorithm, and carrying out abnormity detection on industry details according to reconstruction characteristics of the industry detail network. According to the invention, the TADM model is utilized to carry out abnormal detection onthe original data, and macroscopic management work such as national statistics, tax collection and industrial and commercial management can be analyzed more reasonably and accurately.

Description

technical field [0001] The invention belongs to the field of data mining, in particular to a TADM (Two-level Anomaly Detection Model, 2-level anomaly detection model)-oriented anomaly detection method for industry classification of enterprises. Background technique [0002] After the reform and opening up, my country's national economy has developed rapidly, the market economy has continued to prosper, the country's economic structure has gradually improved, and the division of labor among enterprises has gradually become more refined. In the new era, the study of enterprise industry classification plays a fundamental role in promoting finance, taxation, and national standard management, and also provides a basis for further analysis of national economic industries and industrial development status, and grasping national economic development trends. The "National Economic Industry Classification" (GB / T 4754-2017) issued by the General Administration of Quality Supervision, I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/06G06Q50/26G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06Q10/0639G06Q50/26G06N3/045G06F18/2415
Inventor 郑庆华高宇达阮建飞赵珮瑶董博孙铭潞田雨润
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products