Video analysis and accelerating method based on thread level flow line

A video analysis and threading technology, applied in the field of video analysis, can solve problems such as waste of computing resources, inability to ensure CPU and GPU utilization, and inability to give full play to the computing capabilities of heterogeneous platforms, so as to improve processing speed, reduce communication overhead, and utilize rate-enhancing effect

Active Publication Date: 2017-01-25
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the CPU and GPU are two processors with different architectures. Using a simple GPU optimization method, the CPU and GPU often wait for each other due to the different calculation time, resulting in waste of computing resources, and cannot ensure the optimal utilization of the CPU and GPU. Unable to give full play to the computing power of heterogeneous platforms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video analysis and accelerating method based on thread level flow line
  • Video analysis and accelerating method based on thread level flow line
  • Video analysis and accelerating method based on thread level flow line

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0020] figure 1 It is a process diagram of video analysis. The inventive method comprises the following steps:

[0021] Divide the video analysis task into multiple subtasks, with TC 1 TC 2 TC 3 TC 4 Indicates the CPU task number, TG 1 Indicates the GPU task label and divides the system into the following subtasks:

[0022] TC 1 : video decoding;

[0023] TC 2 : conversion of color spa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a video analysis and accelerating method. The method comprises the following steps: dividing a video frame processing task into four levels of subtasks according a sequential order, and allocating the subtasks to GPU and CPU to process; realizing each level of subtask through a thread, transmitting data to a thread for the next subtask after processing, and ensuring that all the threads perform concurrent execution; pausing and waiting when new tasks does not exist or a thread for the next level of subtask does not accomplish processing; adopting a first-in first-out (FIFO) buffer queue to transmit data between threads for two adjacent levels of subtasks; realizing asynchronous cooperation concurrency of CPU and GPU subtasks through CUDA function asynchronous invocation for two subtask not in a dependency relationship. According to the method, various computing resources in a heterogeneous system are effectively utilized, a reasonable task scheduling mechanism is established, and communication overhead between different processors is reduced, so that the computing power of each computing resource is given into full play, and the system efficiency is improved.

Description

technical field [0001] The present invention relates to the field of video analysis, and more specifically, to a method for accelerating video analysis based on a thread-level pipeline. Background technique [0002] Video analysis technology has been widely used in the security field. There is a large amount of image data processing in video analysis, which leads to low efficiency. With the popularity of high-definition cameras, it is necessary to optimize the performance of video analysis systems. In recent years, most optimization techniques use GPU to process complex calculations in video analysis, convert serial algorithms of complex calculation modules into parallel algorithms, and use GPUs for parallel computing. However, the CPU and GPU are two processors with different architectures. Using a simple GPU optimization method, the CPU and GPU often wait for each other due to the different calculation time, resulting in waste of computing resources, and cannot ensure the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04N7/01H04N7/015H04N7/18H04N5/76
CPCH04N5/76H04N7/0125H04N7/015H04N7/18
Inventor 凌贺飞李深唐堃
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products