If you want to evaluate/tune performance of an object, you can measure the exact number of cpu cycles used by a single object by taking a difference of two "patch/cyclecounter" objects.
Keep the execution order rules in mind!
TIP: Benchmarking
johannes
#1
DrJustice
#2
Nice one!
I immediately adopted that. I also added capture of the maximum cycle count :