Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A GPU-based fast solution method for L1 minimization problem

A minimization and problem-solving technology, which is applied in the field of fast solving of GPU-based L1 minimization problems, can solve problems such as large overhead, CUBLAS does not support fusion, performance fluctuations, etc., achieve multi-level cache control optimization, and reduce global data access , the effect of high parallelism and adaptability

Active Publication Date: 2018-05-08
ZHEJIANG UNIV OF TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, through testing, it is found that the matrix-vector multiplication method in CUBLAS will produce performance fluctuations as the number of matrix rows or columns increases, and the maximum and minimum performance gaps are significant.
CUBLAS does not support Fusion. When facing multiple concurrent L1 minimization problems, it cannot make full use of the new features of the existing GPU, and cannot optimally configure the computing resources of the entire GPU, resulting in large additional overhead.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A GPU-based fast solution method for L1 minimization problem
  • A GPU-based fast solution method for L1 minimization problem
  • A GPU-based fast solution method for L1 minimization problem

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In the following description, combined with the attached Figure 1-7 The present invention is further explained in detail with specific implementation methods.

[0047] The fast iterative shrinkage threshold algorithm is an iterative shrinkage threshold algorithm, an accelerated version implemented by combining Nesterovs optimal gradient algorithm, with a non-asymptotic convergence rate O(k 2 ). The algorithm adds a new sequence {y k ,k=1,2,…}, the specific iteration steps are as follows:

[0048]

[0049] Among them, λ is a scalar weight, soft(u,a)=sign(u) max{|u|-a,0} is a soft threshold operator, y 1 =x 0 , t 1 = 1, L f is the Lipschitz constant associated with ▽f(·), which can be calculated by A T The characteristic spectrum of A is obtained (||A T A|| 2 ),▽f(y k )=A T (Ay k -b).

[0050] The invention relates to solving the L1 minimization problem, adopts a fast iterative shrinkage threshold algorithm, and the algorithm mainly involves vector operat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a GPU-based L1 minimization problem fast solving method. A CUDA parallel computation model is utilized on a NVIDIA Maxwell-architecture GPU device, and an L1 minimization problem fast solving method is provided by utilizing GPU new features and internal kernel merging and optimizing technologies. According to the GPU-based L1 minimization problem fast solving method, not only designs of self-adaptation optimal vector computation, non-transposed matrix vector multiplication and transposed matrix vector multiplication are included, but also parallel solving of single or parallel multiple L1 minimization problems can be implemented only by simple CUDA thread allocation configuration difference. Based on experimental results, the GPU-based L1 minimization problem fast solving method provided by the invention is efficient, and also has high parallelism and adaptability. Compared with the existing parallel solving method, the GPU-based L1 minimization problem fast solving method has largely improved performances.

Description

technical field [0001] The invention relates to the fields of signal processing and face recognition, and more specifically relates to a GPU-based fast solution method for L1 minimization problem. Background technique [0002] The L1 minimization problem is min||x|| 1 , satisfying the constraint Ax=b, where A∈R m×n (m<<n) is a dense matrix of full rank, b∈R m is a preset vector, x∈R n is an unknown solution. The solution of the L1 minimization problem, also known as sparse representation, has been widely used in many fields, such as signal processing, machine learning and statistical inference, etc. To solve the L1 minimization problem, researchers have designed many effective algorithms. For example, gradient projection method, truncated Newton interior point method, homotopy method, iterative shrinkage threshold method and augmented Lagrangian multiplier method, etc. In practice, b often contains noise, so a variant of this problem is called the unconstrained b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/38G06F9/50
CPCG06F9/3867G06F9/5044
Inventor 高家全李泽界王宇
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products