Method and device for realizing data pre-fetching

A prefetch and prefetcher technology, applied in electrical digital data processing, instruments, sustainable buildings, etc., can solve the problem of low storage utilization efficiency, waste of processor access to on-chip cache bandwidth and power consumption, and inability to determine whether the prefetch address is For problems such as addresses that have been prefetched, it can improve storage utilization efficiency, optimize performance, and reduce bandwidth requirements and power consumption overhead.

Active Publication Date: 2012-06-27
BEIJING PKUNITY MICROSYST TECH
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, there is still a lot of room for optimization in the delta-associative prefetcher
On the one hand, the difference correlation prefetcher uses the global history buffer (GHB, Global History Buffer) of the circular queue structure to save all invalidation addresses that have recently occurred in cache failure in chronological order. For a stride memory access mode, although the previous one The invalidation address and stride value can be fully represented, but the delta-dependent prefetcher needs to use multiple global history buffer entries to store all invalidation addresses that have recently occurred cache invalidation in the stride access mode, so the storage utilization efficiency lower
On the other hand, the delta-dependent prefetcher will issue multiple prefetch requests according to the rules when it encounters a pattern match, and cannot judge whether the current prefetch address is a prefetched address, so a large number of redundant prefetch requests will be generated. fetches, wasting the bandwidth and power consumption of the processor accessing the on-chip cache

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for realizing data pre-fetching
  • Method and device for realizing data pre-fetching
  • Method and device for realizing data pre-fetching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The technical solutions of the present invention will be described in detail below in conjunction with the accompanying drawings and preferred embodiments. It should be understood that the embodiments described below are only used to illustrate and explain the present invention, and are not intended to limit the technical solution of the present invention.

[0069] like figure 1 As shown, the internal structure of the embodiment of the prefetcher device for realizing data prefetching of the present invention is represented, including the memory access pattern recognition unit, wherein:

[0070] The memory access pattern recognition unit is used to divide the global invalidation address stream into local invalidation address streams, record the address information of the two most recent cache failures in the local invalidation address stream as historical memory access information in chronological order, and according to the Records capture stride fetch patterns and gen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for realizing data pre-fetching. A pre-fetcher device comprises: a memory access mode recognition unit divides a global failure address flow into a local failure address flow, records information of two addresses with caching failure happening lately in the local failure address flow as history memory access information according to a time sequence, and captures a spanning memory access mode according to the record and generates a corresponding pre-fetching address. According to the invention, the storage utilization efficiency of a difference value related pre-fetcher is improved and the redundancy pre-fetching having no contribution to pre-fetcher performance improvement is reduced, so that the pre-fetcher can obtain a better performance optimization effect under the condition of same storage expense, and the cached bandwidth requirement and power consumption expense on a pre-fetcher access sheet are reduced effectively.

Description

technical field [0001] The invention relates to a method for optimizing the memory access performance of a modern microprocessor, in particular to a method and device for realizing data prefetching. Background technique [0002] With the progress of the integrated circuit manufacturing process, the performance of the processor and the memory is continuously improved. However, due to the difference in the production process and development route between the two, the performance improvement speed of the processor is significantly faster than that of the memory, so that the performance gap between the processor and the memory continues to increase. As a result, when the on-chip cache (On-Chip Cache) fails and the memory access instruction needs to access the main memory, the whole process often takes hundreds or even hundreds of clock cycles, forming a memory wall (Memory Wall) problem. [0003] Cache and memory hierarchy based on the principle of program access locality have ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F12/08G06F12/0862
CPCY02B60/1225G06F12/0862G06F2212/6026Y02D10/00
Inventor 程旭党向磊王箫音佟冬陆俊林易江芳王克义
Owner BEIJING PKUNITY MICROSYST TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products