Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method for accelerating GPU directed at deep learning super-resolution technology

A technology of super-resolution and deep learning, applied in the field of image super-resolution, can solve problems such as huge computing overhead and cannot meet the needs of practical applications, and achieve the effect of accelerating and optimizing computing speed

Active Publication Date: 2016-08-17
SHANGHAI JIAO TONG UNIV
View PDF3 Cites 57 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method relies on a huge amount of computing overhead, and it takes 300 seconds to execute this method with a CPU for each frame (1920*1080 to 3840*2160, single channel, all tests below are based on this resolution), even if GEMM-based GPU convolution and acceleration methods also require close to 1 second for each frame, which cannot meet the needs of practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for accelerating GPU directed at deep learning super-resolution technology
  • Method for accelerating GPU directed at deep learning super-resolution technology
  • Method for accelerating GPU directed at deep learning super-resolution technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0024] Such as figure 1 Shown, the flowchart of SRCNN. As a preferred embodiment of the present invention, the super-resolution GPU acceleration technology of the present invention is aimed at SRCNN, and its flow chart is as follows figure 1 As shown, it contains bicubic preprocessing (not marked), three convolutional layers and two RELU layers (following the first convolution and the second convolution respectively). The sizes of the three convolutional layers are (according to output channel *...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for accelerating a GPU directed at a deep learning super-resolution technology. The method conducts concurrent processing on all the steps of a super-resolution technology which is based on deep learning and a convolutional neural network, and operates on a GPU. The concurrent processing refers to conducting concurrent task dividing on convolutions of the super-resolution technology which is based on deep learning and the convolutional neural network into millions of micro-tasks which are irrelevant to one another and can be concurrently executed in any order so as to fully exhibit the super-strong calculating capability of the GPU. Further, the method uses features of a GPU memory to cache convolution nuclear data and input image data to a shared memory and a register so as to substantially optimize calculating speeds of the convolutions. The method integrates the convolutions and non-linear layers. The method selects the optimal method for the sizes of different convolutions. According to the invention, the method accelerates the high quality super-resolution method to meet velocity requirements for processing videos, and does not cause any image quality loss.

Description

technical field [0001] The present invention relates to an image super-resolution field and a GPU acceleration method, in particular to a GPU acceleration method for deep learning super-resolution technology. Background technique [0002] Image super-resolution is to convert a low-resolution image into a high-resolution image, which has a wide range of applications in image post-processing and video nonlinear editing. Early super-resolution methods (such as bicubic) are often based on simple interpolation, which can work quickly and reliably, and are easy to integrate on chips; however, the high-resolution images obtained by these methods are of poor quality and produce significant artifacts, such as rings, Effects such as aliasing, blurring, etc. Super-resolution methods of such quality are difficult to meet the current high-quality video demands. The current super-resolution methods with advanced performance can generate high-quality images, but with huge computational o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T3/40G06T1/20
CPCG06T1/20G06T3/4053
Inventor 宋利赵章宗解蓉
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products