Cross-feature federal abnormal data detection method based on isolated forest
An abnormal data detection, isolated technology, applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve problems such as abnormal data detection of difficult characteristic data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0143] Embodiment 1, model training process
[0144] Assume that institutions A, B, and C all have a common sample of 100,000, of which party A has the 2-dimensional features of the 100,000 samples, institution B has the other 5-dimensional features of the 100,000 samples, and institution C has the 100,000 samples. The other 10-dimensional features, the feature data distribution of the three institutions are shown in Table 1.
[0145] Table 1
[0146]
[0147] The current organization A hopes that the characteristic data of the federal agencies B and C will detect the abnormal data of the above 100,000 samples, so the agency A initiates a federal anomaly detection request to the agencies B and C, and the agencies B and C agree to participate; then the agency A As the initiator, institutions B and C are two of the participants, and the implementation steps of federated anomaly detection are as follows:
[0148] (1) Institution A, as the initiator, first sends the modeling ...
Embodiment 2
[0175] Embodiment 2, prediction embodiment
[0176] (1) Institutions A, B, and C participate at the same time, and the prediction samples are calculated on the leaf nodes divided on each isolated tree in the test isolated forest model, based on the number of training samples on the leaf nodes and the number of layers of the leaf nodes The path length of the sample falling on the leaf node;
[0177] (2), institutions A, B, and C calculate the path length of each prediction sample in all isolated trees according to formula (1), and then calculate their average path length in all isolated trees;
[0178] (3) Institutions A, B, and C calculate the abnormal score of each prediction sample according to the formula (b) based on the average path length of each prediction sample and the number of training samples of a single isolated tree. Finally, institutions A, B, and C Both get the abnormal score of this predicted sample;
[0179] Among them, the path length prediction process of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com