Conference paper

Next-generation performance counters: Towards monitoring over thousand concurrent events

View publication


We present a novel performance monitor architecture, implemented in the Blue Gene/P™ supercomputer. This performance monitor supports the tracking of a large number of concurrent events by using a hybrid counter architecture. The counters have their low order data implemented in registers which are concurrently updated, while the high order counter data is maintained in a dense SRAM array that is updated from the registers on a regular basis. The performance monitoring architecture includes support for perevent thresholding and fast event notification, using a twophase interrupt-arming and triggering protocol. A first implementation provides 256 concurrent 64b counters which offers an up to 64x increase in counter number compared to performance monitors typically found in microprocessors today, and thereby dramatically expands the capabilities of counter-based performance tuning. © 2008 IEEE.