Method and apparatus for precisely identifying effective addresses associated with hardware events

a technology of effective addresses and hardware events, applied in the field of computer systems, can solve the problems of reducing performance and difficulty in identifying instruction program statements or other information, and achieve the effect of reducing the difficulty of identifying instruction instruction information and reducing the difficulty of program identification

Active Publication Date: 2007-02-22
ORACLE INT CORP
View PDF14 Cites 161 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0010] A performance instrumentation mechanism is contemplated in which an indication that a particular event has occurred may be conveyed when in fact the event may not have occurred. In one embodiment, a counter is utilized to record the detection of events. The counter is initialized to a negative value and incremented in response to detecting an event which is being monitored. In response to detecting the counter has ...

Problems solved by technology

If the number of cache misses is suspiciously high, then that may point towards the cause of the reduced performance.
The further away the instruction which was executing when the trap occurred from the event-causing instruction, the more difficult it is to associate a program ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for precisely identifying effective addresses associated with hardware events
  • Method and apparatus for precisely identifying effective addresses associated with hardware events
  • Method and apparatus for precisely identifying effective addresses associated with hardware events

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

Overview of Multithreaded Processor Architecture

[0048] A block diagram illustrating one embodiment of a multithreaded processor 10 is shown in FIG. 1. In the illustrated embodiment, processor 10 includes a plurality of processor cores 100a-h, which are also designated “core 0” though “core 7”. Each of cores 100 is coupled to an L2 cache 170 via a crossbar 105. L2 cache 170 is coupled to one or more memory interface(s) 180, which are coupled in turn to one or more banks of system memory (not shown). Additionally, crossbar 105 couples cores 100 to input / output (I / O) interface 190, which is in turn coupled to a peripheral interface 195 and a network interface 185. As described in greater detail below, I / O interface 190, peripheral interface 195, and network interface 185 may respectively couple processor 10 to boot and / or service devices, peripheral devices, and a network.

[0049] Cores 100 may be configured to execute instructions and to process data according to a particular instruc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A system and method for precisely identifying an instruction causing a performance-related event is disclosed. The instruction may be detected while in a pipeline stage of a microprocessor preceding a writeback stage and the microprocessor's architectural state may not be updated until after information identifying the instruction is captured. The instruction may be flushed from the pipeline, along with other instructions from the same thread. A hardware trap may be taken when the instruction is detected and/or when an event counter overflows or is within a given range of overflowing. A software trap handler may capture and/or log information identifying the instruction, such as one or more extended address elements, before returning control and initiating a retry of the instruction. The captured and/or logged information may be stored in an event space database usable by a data space profiler to identify performance bottlenecks in the application containing the instruction.

Description

CONTINUING APPLICATION DATA [0001] This application is a Continuation-In-Part of U.S. patent application Ser. No. 10 / 881,032 titled “Performance Instrumentation in a Fine Grain Multithreaded Multicore Processor,” filed Jun. 30, 2004, whose inventors are Gregory F. Grohoski, Paul J. Jordan, and Yue Chang, and which is herein incorporated by reference in its entirety.BACKGROUND OF THE INVENTION [0002] 1. Field of the Invention [0003] The invention relates to computing systems and, more particularly, to performance monitoring and profiling of software applications. [0004] 2. Description of the Relevant Art [0005] Modern processors typically include performance monitoring logic (PML) to measure processor performance while running application code and to help identify performance bottlenecks. Two features commonly found in PML are (1) the ability to count certain processor events, such as cache misses, branch mispredictions, etc., and (2) the ability to cause a trap when a counter reache...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G21C17/00
CPCG06F11/3447G06F11/3466G06F11/3471G06F11/3476G06F2201/86G06F2201/865G06F2201/88
Inventor KOSCHE, NICOLAIGROHOSKI, GREGORY F.JORDAN, PAUL J.
Owner ORACLE INT CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products