Linked Questions

0 votes
1 answer
120 views

Trying to optimize concurrent linked lists access, I tried to benchmark the average time that dereferencing takes in x86_64 (my specific processor is a Ryzen). While I knew that the nice old days of ...
George Kourtis's user avatar
0 votes
1 answer
56 views

When I test the cycle number of the module, the results of each test are quite different? 1781344-->First test 1264558-->Second test 1388058-->Third test I use __rdtsc() to record cycles,...
yueluojieying's user avatar
3 votes
0 answers
95 views

I'm following this whitepaper by intel to benchmark code execution. It uses cpuid to fence the reads of the timestamp registers, which seems to work alright. I'm more interested in the commands ...
IRatherStayPrivate's user avatar
1 vote
1 answer
97 views

"I want to write a program that consumes 5 seconds of CPU time, where the time spent off the CPU due to IO, context switches, etc., is not counted towards this 5-second time quota. Currently, I'...
Frontier_Setter's user avatar
0 votes
0 answers
90 views

I spent many time to measure exact clock cycles of given instructions, a portion of code written in C. However, I never could measure exactly how many cycles will take during the runtime, I used PAPI, ...
Elephant88's user avatar
1 vote
0 answers
164 views

I'm interested in how Invariant TSC behaves on a multi-core CPU, on a classic PC with a single physical CPU. The only thing I could find is that its frequency is constant and the same for all CPU ...
lokains's user avatar
  • 137
1 vote
1 answer
61 views

I'm designing a benchmark for a critical system operation. Ideally the benchmark can be used to detect performance regressions. I'm debating between using the total time for a large workload passed ...
Nathan Doromal's user avatar
0 votes
0 answers
80 views

I need to add two memory fences into my codes, in order to prevent my codes from being reordered by either the compiler or CPU. Like this: rec.time_stamp0 = std::system_clock::now(); std::...
Leon's user avatar
  • 2,165
2 votes
0 answers
50 views

I am currently working on an assembly function that sets a buffer to zero. I am measuring the clock cycles it takes to execute the function. However, I have encountered an issue where the number of ...
Z123's user avatar
  • 41
0 votes
0 answers
57 views

I'm measuring the latency of a store instruction on an x86-64 processor and would like to understand the nuances of timing this instruction. Here’s my setup and the specific questions I have: Setup: I ...
user avatar
0 votes
0 answers
33 views

Recently, I have been trying to run some performance anlaysis on my program. I want to measure the latency of some functions in cpu ticks and later convert the delta to nanosecond. (I intentionally am ...
Hedgehog's user avatar
  • 115

15 30 50 per page
1
6 7 8 9
10