Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

System and method of improving performance of general GPU (Graphics Processing Unit)

A graphics processor and multi-processor technology, applied in the direction of processor architecture/configuration, image memory management, etc., can solve problems such as aggravated cache group access imbalance, reduce cache jitter, improve on-chip storage utilization, and improve performance Effect

Active Publication Date: 2018-09-14
SHANGHAI JIAO TONG UNIV
View PDF12 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the on-chip storage based on the fusion register file and data cache has its own unique characteristics: the on-chip fusion storage capacity of the set associative structure is large enough, and it is managed according to the cache method, and the register data needs to be protected first, which increases the burden on the cache between groups. access imbalance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method of improving performance of general GPU (Graphics Processing Unit)
  • System and method of improving performance of general GPU (Graphics Processing Unit)
  • System and method of improving performance of general GPU (Graphics Processing Unit)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The implementation of the present invention is described below through specific examples and in conjunction with the accompanying drawings, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific examples, and various modifications and changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention.

[0028] figure 1 It is a system architecture diagram of a system for improving the performance of a general-purpose graphics processor in the present invention. like figure 1 Shown, a kind of system of the present invention improves the performance of general graphics processor, comprises:

[0029] The cache group pressure monitoring table 10 is set in the on-chip storage of mainstream mul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a system and a method of improving performance of a general GPU (Graphics Processing Unit). The system comprises a cache group pressure monitoring table, an access target blockarbitration unit and a remapping table, wherein the cache group pressure monitoring table is arranged in an on-chip memory of a mainstream multiprocessor for recording the reading and writing times of each cache group, and according to the recorded reading and writing times, the pressure state of each cache group is periodically updated; the access target block arbitration unit is used for arbitrating an access request according to the pressure state of the cache group and the remapping table, when a support group is searched for the request of a cache group with large pressure, the support group is searched, a searching result is obtained according to the cache group pressure monitoring table, and further, in the remapping table, the mapping relationship between the cache group with large pressure and the support group is built; and the remapping table is arranged in the on-chip memory of the mainstream multiprocessor for recording the mapping relationship between each high-frequencyvisited cache group and the support group, and the mapping is released through changing an effective status bit. Thus, the on-chip resource efficiency of the GPU can be improved.

Description

technical field [0001] The invention relates to the field of general graphics processor chip design, in particular to a system and method for improving the performance of a general graphics processor. Background technique [0002] Graphics processors have experienced tremendous growth in recent years as general-purpose and high-throughput devices. Relying on a large number of parallel threads and huge memory bandwidth, general-purpose graphics processors achieve superior performance. On-chip storage as a parallel thread and memory bridge is implemented using a very complex multi-level structure, including registers, shared memory, and data caches. Jing et al. According to the phenomenon of low utilization of internal registers and insufficient data caches in each stream multiprocessor in graphics processors, they proposed to fuse on-chip registers and caches, and integrate the two into one, so as to improve resource utilization. and to improve performance. [0003] Howeve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T1/60G06T1/20
CPCG06T1/20G06T1/60
Inventor 景乃锋王建飞王琴蒋剑飞贺光辉梁晓峣毛志刚
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products