Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dynamic enabling and disabling of SIMD units in a graphics processor

A processor and dynamic technology, applied in image data processing, data processing power supply, 3D image processing, etc., can solve problems such as low efficiency and increased power consumption of graphics processors, and achieve optimal use and power saving effects

Active Publication Date: 2013-05-01
ADVANCED MICRO DEVICES INC
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional methods continuously and actively provide clock signals to these SIMD
This method increases the power consumption of the graphics processor and is inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dynamic enabling and disabling of SIMD units in a graphics processor
  • Dynamic enabling and disabling of SIMD units in a graphics processor
  • Dynamic enabling and disabling of SIMD units in a graphics processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] As discussed above, embodiments of the present invention achieve power savings by dynamically activating and deactivating individual SIMDs in shader compositors.

[0031] Embodiments dynamically disable SIMD for reduced performance needs of graphics processing units or to reduce thermal design power (TDP). Furthermore, embodiments enable disabled SIMD for high performance applications without refreshing the graphics pipeline of the graphics processing unit. This is achieved by dynamically switching several SIMDs without flushing the SP. Dynamic control (or switching) is achieved in part by appropriately programming groups of registers.

[0032] In an embodiment, a Shader Pipe Interposer (SPI) allocates new work (or threads) according to registers configured to dynamically indicate which SIMDs are activated. In one embodiment, this dynamic configuration takes effect after currently outstanding requests (or pending requests) are serviced by a disabled SIMD. Once disabl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Systems and methods to improve performance in a graphics processing unit are described herein. Embodiments achieve power saving in a graphics processing unit by dynamically activating / deactivating individual SIMDs in a shader complex that comprises multiple SIMD units. On-the-fly dynamic disabling and enabling of individual SIMDs provides flexibility in achieving a required performance and power level for a given processing application. In this way, embodiments achieve optimal usage of a graphics processing unit. Embodiments of the invention also achieve dynamic grain (e.g., medium grain) clock gating of SIMDs in a shader complex. Embodiments reduce switching power by shutting down clock trees to unused logic by providing a clock on demand mechanism. In this way, embodiments enhance clock gating to save more switching power for the duration of time when SIMDs are idle (or assigned no work).

Description

technical field [0001] Embodiments of the invention generally relate to the control of a single instruction multiple data (SIMD) unit in a graphics processing unit (GPU). Background technique [0002] The arithmetic unit of the graphics processor includes an arithmetic logic unit (ALU) or an arithmetic unit configured to perform integer, logical, and single / double precision floating point operations. [0003] Graphics processors may include arrays of arithmetic units known as shader cores. A shader core includes a shader pipeline (SP). For higher performance, multiple SPs can be configured to work together as stacked SPs. SPs can be configured as Quad Arrays (QP) and SIMD. All shader units per SIMD can conditionally execute the same ALU instruction on different sets of origin, vertex, or pixel values. In this way, SIMD provides arithmetic processing power to the graphics processor. [0004] In conventional GPUs with stacked SPs, SIMD can be statically enabled or disable...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/30G06F9/38G06F1/32G06T15/00
CPCY02B60/142G06T13/20G06F1/3287G06T15/005G06F9/5083G06F9/30101G06F1/3203G06F9/3867G06T2210/52G06F9/5094G06F9/3842Y02B60/32Y02B60/1282G06F9/3897G06F9/3887Y02D10/00Y02D30/50G06F9/3888G06F9/38G06F9/30G06F1/32G06T15/00G06F9/50
Inventor 图沙尔·K·沙阿拉沙德·奥雷费基迈克尔·J·曼特布莱恩·恩贝林
Owner ADVANCED MICRO DEVICES INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products