Spark big data platform-based neighborhood density imbalance data mixed sampling method
A big data platform, mixed sampling technology, applied in electrical digital data processing, digital data information retrieval, special data processing applications, etc. The interference of environmental factors and individual factors can improve the classification accuracy, improve the modeling efficiency, and improve the recognition rate.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.
[0031] The technical scheme that the present invention solves the problems of the technologies described above is:
[0032] A kind of unbalanced data mixed sampling method based on the neighborhood density of Spark big data platform, comprises the following steps:
[0033] Upload the local unbalanced data set to the big data platform, normalize the data through z-score, use HDFS distributed storage, combine the distributed computing framework Spark to read the data file in HDFS and save it as RDD, and then save it is a LabelPoint object. Specific steps include:
[0034] First create an RDD data set in a distributed manner through the textFile method of the Spark Context object (for parallel computing in...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com