Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

GPU cluster-based multidimensional big data factorization method

A GPU cluster and big data technology, applied in the field of signal analysis, can solve problems such as result differentiation, high computational complexity, and insufficient computational resources

Inactive Publication Date: 2016-01-20
WUHAN UNIV
View PDF1 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] PARAFAC can analyze data of any size and dimension, but its computational complexity is high, and the performance requirements of the computer are also high. Therefore, the sliding window is used to divide the entire data, and PARAFAC is used to analyze the data in a certain dimension one by one. This lays the foundation for the dynamic tensor analysis of big data. Under this theory, the direct fusion of data makes the correlation between the data partly lost, and the obtained causal joint factor is difficult to reflect the dynamic characteristics of the original data.
[0005] In order to solve the problem of large-scale data, the mathematical theory of PARAFAC has some innovations. It regards large-scale data as the grid of small data, that is, gridPARAFAC ([Document 4]), which converts the decomposition of tensors into independent tensors. The decomposition of the quantum set, the output of the fusion tensor subset results can get all the factors of the tensor, this method is effective, but it faces two major problems, insufficient computing resources and the result differentiation caused by the division of the tensor subset

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU cluster-based multidimensional big data factorization method
  • GPU cluster-based multidimensional big data factorization method
  • GPU cluster-based multidimensional big data factorization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit this invention.

[0076] please see figure 1 , the present invention aims at the problem that the traditional grid parallel factor analysis model (gridPARAFAC) cannot handle large-scale, high-dimensional multidimensional data analysis, and proposes an effective multi-mode decomposition method for multi-dimensional big data based on (graphics processing unit) GPU clusters , that is, the hierarchical parallel factor analysis (H-PARAFAC) framework, which is based on gridPARAFAC (grid parallel factor analysis), includes the process of integrating tensor subsets under a coarse-grai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a GPU cluster-based multidimensional big data factorization method, aims to solve the problem that a conventional grid parallel factor analysis model cannot process large-scale and high-dimension multidimensional data analysis, and provides an effective pattern processing unit-based multidimensional big data multi-mode decomposition method, namely a hierarchical parallel factor analysis framework. The framework is based on the conventional grid parallel factor analysis model, comprises a process of integrating tensor subsets under a coarse-grained model and a process of calculating all of the tensor subsets and fusing factor subsets under a fine-grained model, and is operated on a cluster formed by a plurality of nodes; each node comprises a plurality of pattern processing units. Tensor decomposition on pattern processing unit equipment can fully utilize a powerful parallel computing capability and a paralleling resource generated in tensor decomposition; experimental results show that through the adoption of the method, executive time for acquiring tensor factors can be greatly shortened, the large-scale data processing capability is improved, and the problem that the computing resource is insufficient is well solved.

Description

technical field [0001] The invention belongs to the technical field of signal analysis, and relates to a multidimensional big data analysis method, in particular to an efficient multidimensional big data factorization method based on a GPU cluster. Background technique [0002] In complex applications based on data analysis, it is necessary to reflect the dynamic characteristics of large-scale tensors during the decomposition process without causing large data deformation. Today, the data scale and data dimension are constantly increasing. Today, it is facing more and more challenges. Finding useful information of data from multidimensional data is becoming more and more important in today's science and engineering, such as feature extraction and dimensionality reduction. Two-dimensional data decomposition methods, such as singular value decomposition (SVD), principal component analysis (PCA), and independent component analysis (ICA), directly applying these methods to the d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/50
Inventor 陈丹胡阳阳蔡畅李小俚王力哲
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products