Unbalanced data classification method based on active learning
A data classification and active learning technology, applied in the field of machine learning, can solve the problems of time cost and labor cost, and achieve the effect of reducing sample size, saving time and labor cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0013] The invention can be used in the fields of credit card fraud transaction detection, information security detection and the like.
[0014] A kind of unbalanced data classification method based on active learning of the present invention, comprises the following steps:
[0015] randomly sampling samples from the original unlabeled data for labeling as initial training data; the original unlabeled data includes credit card transaction data;
[0016] Use a general machine learning model to perform cost-sensitive learning training on the initial training data;
[0017] Use the trained binary supervised classification model to predict all unlabeled samples in the original training data samples, and select the most uncertain N samples according to the uncertainty; respectively calculate the center point of the N samples and the trained data set The sum of the Euclidean distances, select M samples from N samples according to the order of distance from large to small, where M i...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com