Method and equipment for determining abnormal reasons of multi-dimensional sample data

A technology of sample data and abnormal causes, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problems of unrecognized proportion, unable to evaluate correlation characteristics, and uninterpretable abnormal detection results. Convenience and quick investigation, ensuring consistent results

Pending Publication Date: 2021-04-30
SHENGDOUSHI SHANGHAI SCI & TECH DEV CO LTD
View PDF12 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, although the single-dimensional outlier detection algorithm has good interpretability, it can only deal with sample data with a good distribution shape and cannot include the joint distribution characteristics between each dimension into the evaluation range, so the correlation between each dimension cannot be evaluated. feature
Although the high-dimensional outlier detection algorithm represented by the isolated forest model algorithm can well detect the complex joint distribution of high-dimensional sample data, the current mainstream application scheme only provides anomaly scores representing the degree of comprehensive anomaly
The anomaly score cannot represent the abnormal causes of multi-dimensional outliers, and cannot identify the main dimensions that are abnormal at the outliers and their proportions to the abnormal causes, that is, the abnormal detection results are not interpretable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and equipment for determining abnormal reasons of multi-dimensional sample data
  • Method and equipment for determining abnormal reasons of multi-dimensional sample data
  • Method and equipment for determining abnormal reasons of multi-dimensional sample data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Exemplary embodiments of the present application will now be described more fully with reference to the accompanying drawings. Exemplary embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein; Fully conveyed to those skilled in the art. In the drawings, the size of some elements may be exaggerated or deformed for clarity. The same reference numerals in the drawings denote the same or similar structures, and thus their detailed descriptions will be omitted.

[0018] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of the embodiments of the application. However, those skilled in the art will appreciate that the technical solutions of the present application may be practiced without one or more of the spec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for determining an abnormality reason of multi-dimensional sample data, which comprises the steps of obtaining the multi-dimensional sample data, generating an isolated forest model, inputting the multi-dimensional sample data into the isolated forest model to detect abnormality, and recording a final dimension and a corresponding statistical frequency, and determining the final occurrence probability of the final dimension and evaluating the contribution level of the final dimension to the exception. An apparatus for determining causes of anomalies of multi-dimensional sample data and a computer-readable storage medium are also presented. According to the method and the equipment, automatic anomaly reason analysis after anomaly value detection of the multi-dimensional sample data in complex distribution can be completed.

Description

technical field [0001] The present application relates to risk control, in particular to a method and equipment for determining abnormal causes of multi-dimensional sample data applied in the catering industry. Background technique [0002] In an industry such as the restaurant industry, there is a need for risk control of the transactional data of the people involved. The premise of risk control is to accurately detect abnormal behaviors in the transaction data of restaurant personnel. [0003] The transaction data of restaurant personnel usually has the characteristics of high dimensionality, multi-peak and complex joint distribution. The abnormal sample data in the sample data set is different from most sample data, and it accounts for a small proportion of the overall data. Therefore, traditional classification methods based on supervised learning, such as classification algorithms such as SVM and logistic regression, which use a large number of positive sample data and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/24G06F18/214
Inventor 胡旻皓
Owner SHENGDOUSHI SHANGHAI SCI & TECH DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products