Method, apparatus and computer program product for efficient per thread performance information

a technology of performance information and computer program, applied in multi-programming arrangements, instruments, nuclear elements, etc., can solve problems such as inability to monitor performance of previously known arrangements for short-duration events, and inability to provide consistent measurement overhead. , to achieve the effect of avoiding overflow and counter interaction, and less sample time overhead

Inactive Publication Date: 2005-08-02
IBM CORP
View PDF2 Cites 160 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0008]The foregoing problem is addressed in the present invention. Since the 32-bit performance monitoring counters are hardware registers on the processor they are accessible in the “user” state, which involves less sample time overhead. However, according to the present convention, as described above, the 32-bit counters are constantly being reset in connection with thread switches to avoid overflow and counter interaction. The invention involves a recognition of the usefulness of reading the 32-bit counters directly despite the fact that their values are conventionally corrupted by resetting with each thread switch. The invention provides a way to use the accumulators and the 32-bit counters in a manner that permits the counters to be accessed more directly for performance measurement and that overcomes the complications of thread switching, counter resetting, overflow and interaction.
[0009]According to one form of the invention, a value in a performance monitoring counter register on a processor is incremented for occurrences of a monitored event, providing a measured value for the event. The value of the counter register for a first thread is saved responsive to a switch from the first thread to a second thread. The value is saved in a performance monitoring accumulator in system memory. Then, responsive to a switch back to the first thread, the value for the first thread is restored from the accumulator (instead of resetting the counter value). In this way, a performance monitoring counter register may be read, and its value, for the first thread, for example, provides a coherent meaning relative to a previous value for the same thread, despite any intervening thread switches. Since the counter register may be read directly, in the user state, this provides a faster and more consistent means for updating performance counts. Moreover, the value is saved in the accumulator in a manner consistent with the conventional accumulator format so that a larger accumulated value for the measured value can still be read using the conventional performance monitoring API.

Problems solved by technology

Unfortunately, the overhead for invoking the system state involves perhaps thousands of instructions.
However, the above described arrangement does not provide consistent measurement overhead.
Thus, the previously known arrangement for measuring performance of short-duration events is problematic.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, apparatus and computer program product for efficient per thread performance information
  • Method, apparatus and computer program product for efficient per thread performance information
  • Method, apparatus and computer program product for efficient per thread performance information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016]The claims at the end of this application set out novel features which applicants believe are characteristic of the invention. The invention, a preferred mode of use, further objectives and advantages, will best be understood by reference to the following detailed description of an illustrative embodiment read in conjunction with the accompanying drawings.

[0017]Referring to FIG. 1, a block diagram illustrating a computer system 10 is shown, according to an embodiment of the present invention. The system 110 includes a processor 115, a volatile memory 127, e.g., RAM, a keyboard 133, a pointing device 130, e.g., a mouse, a non-volatile memory 129, e.g., ROM, hard disk, floppy disk, CD-ROM, and DVD, and a display device 137 having a display screen. Memory 127 and 129 are for storing program instructions, which are executable by processor 115, to implement various embodiments of a method in accordance with the present invention. Memory 127 or memory 129 are also referred to herein...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A value in a counter on a processor is incremented for occurrences of a monitored event, providing a measured value for the event. The value of the counter register for a first thread is saved responsive to a switch from the first thread to a second thread. The value is saved in an accumulator in system memory. Then, responsive to a switch back to the first thread, the value for the first thread is restored from the accumulator. In this way, a counter may be read, and its value, for the first thread, for example, remains consistent despite any intervening thread switches. Since the counter register may be read directly, in the user state, this provides a faster and more consistent way to update performance counts.

Description

BACKGROUND[0001]1. Field of the Invention[0002]The present invention relates to performance monitoring of a computer system or of some aspect of a computer system, such as a processor or memory or software running on the system, and, more particularly, to managing counters for such performance monitoring.[0003]2. Related Art[0004]According to the IBM AIX operating system, a performance monitor function of the operating system (“OS”) services a performance monitoring API. This servicing includes accessing 64-bit performance monitoring accumulators. (The AIX operating system is a product of, and “AIX” is a trademark of, International Business Machines Corporation.) The accesses to the accumulators are by means of operations in the “system” state since the accumulators are conventionally located in system memory. The Power and PowerPC processor architectures provide a set of 32-bit performance monitor counters. These counters are registers on the Power and PowerPC processors. (Power an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(United States)
IPC IPC(8): G06F11/30G06F15/00G06F17/40G06F7/00G06F9/00G06F9/30G06F9/38G06F9/46
CPCG06F9/30101G06F9/3851G06F9/461G06F11/3409G06F11/3466G06F2201/86G06F2201/88G06F2201/885
Inventor JONES, SCOTT THOMASLEVINE, FRANK ELIOTSMOLDERS, LUC RENEURQUHART, ROBERT JOHN
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products