I am looking for a Linux alternative to the Windows high-resolution performance counter API, and the following API functions in particular:
Thanks.
I am looking for a Linux alternative to the Windows high-resolution performance counter API, and the following API functions in particular:
Thanks.
See clock_gettime() with CLOCK_MONOTONIC_RAW flag, and clock_getres().
Here is also an example of how to use it:
stopwatch with C99 support you have to use gcc flag -std=gnu99 instead of -std=c99 for it to work. Just adding because I worked a hour on this :)The perf tool, which has been provided with the kernel for some time, now, probably answers your needs. It has a s*load of options, so study it carefully ;)
EDIT: forget it, I thought you were talking about CPU performance counters.
Linux perf_event_open system call
This system call exposes several performance counters in an arch agnostic manner.
man perf_event_open documents the available counters, and it includes all the most basic things you'd expect:
config = PERF_COUNT_HW_CPU_CYCLES)type = PERF_TYPE_HW_CACHE)config = PERF_COUNT_HW_BRANCH_MISSES)PERF_COUNT_SW_PAGE_FAULTS) and context switches (PERF_COUNT_SW_CONTEXT_SWITCHES)I have given an for the cycle counts at: How to get the CPU cycle count in x86_64 from C++?
perf_event_open.c
#include <asm/unistd.h> #include <linux/perf_event.h> #include <stdio.h> #include <stdlib.h> #include <string.h> #include <sys/ioctl.h> #include <unistd.h> #include <inttypes.h> static long perf_event_open(struct perf_event_attr *hw_event, pid_t pid, int cpu, int group_fd, unsigned long flags) { int ret; ret = syscall(__NR_perf_event_open, hw_event, pid, cpu, group_fd, flags); return ret; } int main(int argc, char **argv) { struct perf_event_attr pe; long long count; int fd; uint64_t n; if (argc > 1) { n = strtoll(argv[1], NULL, 0); } else { n = 10000; } memset(&pe, 0, sizeof(struct perf_event_attr)); pe.type = PERF_TYPE_HARDWARE; pe.size = sizeof(struct perf_event_attr); pe.config = PERF_COUNT_HW_CPU_CYCLES; pe.disabled = 1; pe.exclude_kernel = 1; // Don't count hypervisor events. pe.exclude_hv = 1; fd = perf_event_open(&pe, 0, -1, -1, 0); if (fd == -1) { fprintf(stderr, "Error opening leader %llx\n", pe.config); exit(EXIT_FAILURE); } ioctl(fd, PERF_EVENT_IOC_RESET, 0); ioctl(fd, PERF_EVENT_IOC_ENABLE, 0); /* Loop n times, should be good enough for -O0. */ __asm__ ( "1:;\n" "sub $1, %[n];\n" "jne 1b;\n" : [n] "+r" (n) : : ); ioctl(fd, PERF_EVENT_IOC_DISABLE, 0); read(fd, &count, sizeof(long long)); printf("%lld\n", count); close(fd); }