![]() I can say that Intel made pretty good branch predictor in Haswell. That is because I started rebuild of my project in Android Studio just at that time. Notice that first 3 seconds there were no misses, an latter they started. This script uses perf to get cache misses every second and prints it to a line. I don't want because I have 5400 rpm HDD, and results just cant be good, so why run it. I have run it just for a second, you can have it run longer. ![]() This script will give us latency of the disk as histogram. ![]() So for easier performance counting in real time, we will use a suite of scripts from github. Perf is very powerful tool, but not at the same time not the best documented tool due to frequent changes to underlying framework. So far we recorded and then analyzed the record, but what if you want it real time, to record and see output immediately? Read on, in next section we cover that. We see that Skype's function is responsible for 0.83% of cache misses in L1 Data cache. ![]() So when we check report with sudo perf report -f we will see which code is responsible for the misses This command will record cpu level 1 data cache misses sudo perf record -e L1-dcache-load-misses -c 10000 -ag - sleep 5 I expanded one line in second image, there we see more info. The plus means that report is expandable and you can see which code paths are responsible.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |