Oversampling method and device based on SMOTE algorithm and electronic equipment
An oversampling and algorithmic technology, applied in computing, computer components, character and pattern recognition, etc., can solve problems such as reduced prediction accuracy, impact on analysis results, blurred sample boundaries, etc., to solve data imbalance, optimize sampling methods, The effect of improving the distribution
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] Below, will refer to Figure 1 to Figure 3 An embodiment of the oversampling method based on the SMOTE algorithm of the present invention is described.
[0045] figure 1 It is a flowchart of the oversampling method based on the SMOTE algorithm of the present invention. Such as figure 1 As shown, an oversampling method includes the following steps.
[0046] Step S101, acquiring historical sample data sets, determining positive and negative samples and their corresponding quantities.
[0047] Step S102, determining the sample data of the majority class and the sample data of the minority class, and performing data vectorization processing.
[0048] Step S103, using the outlier point monitoring method to screen target sample data from the minority class sample data set.
[0049] Step S104, based on the SMOTE algorithm, oversampling the target sample data to generate a specific amount of new sample data.
[0050] Step S105 , according to the generated new sample data ...
Embodiment 2
[0092] An apparatus embodiment of the present invention is described below, and the apparatus can be used to execute the method embodiment of the present invention. The details described in the device embodiments of the present invention should be regarded as supplements to the above method embodiments; details not disclosed in the device embodiments of the present invention can be implemented by referring to the above method embodiments.
[0093] refer to Figure 4 , Figure 5 and Figure 6 , the present invention also provides a SMOTE algorithm-based oversampling device 400 for financial risk assessment or prediction, including: a data acquisition module 401 for acquiring historical sample data sets, determining positive and negative samples and their corresponding quantities; The determining module 402 is used to determine the majority class sample data and the minority class sample data, and perform data vectorization processing; the screening module 403 is used to use t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com