Multi-dense block detection and extraction method based on Possion distribution

A technology of extraction method and measurement method, which is applied in the direction of digital data information retrieval, special data processing applications, instruments, etc., can solve the problems of low detection accuracy and recall rate, achieve the effect of ensuring independence and integrity, and improving efficiency

Active Publication Date: 2022-03-22
NANJING COLLEGE OF INFORMATION TECH
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problems of low detection accuracy and recall rate of existing dense block detection methods, the present invention proposes a multi-dense block detection and extraction method based on Possion distribution, which takes into account the countin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dense block detection and extraction method based on Possion distribution
  • Multi-dense block detection and extraction method based on Possion distribution
  • Multi-dense block detection and extraction method based on Possion distribution

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0037] Below in conjunction with accompanying drawing, technical scheme of the present invention is further described:

[0038] The present invention proposes a multi-dense block detection and extraction method based on Possion distribution, such as figure 1 , 2 shown, including the following steps:

[0039] Step A: Obtain multi-dimensional tensor data, the number m of dense blocks to be extracted, and the size range of dense blocks.

[0040] The acquisition method of multi-dimensional tensor data is as follows: 1. Data integration, integrating the data to be detected into the designated data center through ETL technology; 2. Data desensitization, in the data to be detected, sensitive information in the production environment ( For example, ID card number) for desensitization; 3. Data cleaning, cleaning the desensitized data to ensure the accuracy and consistency of the data; 4. Data preprocessing, modeling the cleaned data, Convert the data to be detected into multidimensi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Possion distribution-based multi-dense block detection and extraction method, which comprises the following steps of: carrying out suspicious degree measurement on multi-dimensional tensor data by utilizing a dense block suspicious degree measurement method to obtain a snapshots list containing a plurality of suspicious data snapshots; according to the snapshots list, extracting a single dense block from the multi-dimensional tensor data; removing the extracted single dense block from the multi-dimensional tensor data to obtain updated multi-dimensional tensor data; generating a new snapshots list according to the updated multi-dimensional tensor data, and extracting new dense blocks until m dense blocks are extracted; wherein the dense block suspicious degree measurement method is obtained through Possion distribution derivation containing double factors of counting and density. According to the method, the accuracy and recall rate of the dense blocks can be effectively improved while the detection efficiency is ensured.

Description

technical field [0001] The invention relates to a multi-dense block detection and extraction method based on Possion distribution, which belongs to the technical field of abnormal data detection. Background technique [0002] With the advent of the big data era, data anomaly detection becomes more and more important. Abnormal data will not only bring data fraud, affect the normal data analysis work, but may also lead to the loss and leakage of normal data. Therefore, it is the research focus in the field of data detection to quickly and accurately detect and extract abnormal data from massive data. [0003] There is a type of dense abnormal data in abnormal data. Dense abnormal data often has "consistency" and presents a dense block structure in tensor data. For this type of abnormal data, many dense block detection and extraction have appeared on the market. method, but the existing detection methods also have some shortcomings. For example, the CrossSpot detection method ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/64G06F16/215
CPCG06F21/64G06F16/215
Inventor 王俊松边荟凇虞振峰陈诚
Owner NANJING COLLEGE OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products