Data dimension reduction method based on parallel principal component analysis (PCA) algorithm
A principal component analysis and data dimensionality reduction technology, applied in computing, computer parts, instruments, etc., can solve problems such as inability to load at one time, large data scale, etc., to reduce I/O operations and improve processing efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings.
[0031] like figure 1 , a data dimensionality reduction method based on parallel principal component analysis algorithm, including the following steps:
[0032] S1: Construct the sample data matrix D by constructing the data to be dimensionally reduced in such a way that each row represents one data and the number of rows represents the number of samples n×m , where the dimension of the sample data is m and the number of samples is n;
[0033] S2: According to the Hadoop cluster node environment used for data processing, the sample data matrix D n×m Carry out horizontal division and divide into N blocks (to ensure that each block can be loaded into memory), that is, D={D 1 ,D 2 ,...,D N}, distribute the data block to N machines for processing respectively, each machine calculates the square matrix and sum vector of the correspond...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com