Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 423.37 | 63.18 | 0.00 | 15.35 | 0.00 | 0.00 | 1.51 | 0.04 | 0.00 | 9.28 | 0.72 | 9.93 |
▼Node skylake | 423.37 | 63.18 | 0.00 | 15.35 | 0.00 | 0.00 | 1.51 | 0.04 | 0.00 | 9.28 | 0.72 | 9.93 |
▼Process 671974 | 423.37 | 63.18 | 0.00 | 15.35 | 0.00 | 0.00 | 1.51 | 0.04 | 0.00 | 9.28 | 0.72 | 9.93 |
○Thread 671974 | 423.37 | 63.18 | 0.00 | 15.35 | 0.00 | 0.00 | 1.51 | 0.04 | 0.00 | 9.28 | 0.72 | 9.93 |
▼run_2_threads | 414.21 | 36.39 | 0.00 | 51.30 | 0.00 | 0.00 | 0.88 | 0.02 | 0.00 | 5.33 | 0.37 | 5.72 |
▼Node skylake | 414.21 | 36.39 | 0.00 | 51.30 | 0.00 | 0.00 | 0.88 | 0.02 | 0.00 | 5.33 | 0.37 | 5.72 |
▼Process 672071 | 414.21 | 36.39 | 0.00 | 51.30 | 0.00 | 0.00 | 0.88 | 0.02 | 0.00 | 5.33 | 0.37 | 5.72 |
○Thread 672071 | 414.21 | 32.38 | 0.00 | 45.71 | 0.00 | 0.00 | 1.56 | 0.03 | 0.00 | 9.48 | 0.66 | 10.18 |
○Thread 672134 | 323.24 | 41.52 | 0.00 | 58.48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 362.66 | 23.45 | 0.00 | 68.88 | 0.00 | 0.00 | 0.61 | 0.01 | 0.00 | 3.25 | 0.27 | 3.53 |
▼Node skylake | 362.66 | 23.45 | 0.00 | 68.88 | 0.00 | 0.00 | 0.61 | 0.01 | 0.00 | 3.25 | 0.27 | 3.53 |
▼Process 672167 | 362.66 | 23.45 | 0.00 | 68.88 | 0.00 | 0.00 | 0.61 | 0.01 | 0.00 | 3.25 | 0.27 | 3.53 |
○Thread 672167 | 362.66 | 19.06 | 0.00 | 56.03 | 0.00 | 0.00 | 1.98 | 0.04 | 0.00 | 10.57 | 0.86 | 11.46 |
○Thread 672231 | 271.71 | 25.17 | 0.00 | 74.83 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672232 | 271.79 | 25.41 | 0.00 | 74.59 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672233 | 271.78 | 25.62 | 0.00 | 74.38 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 350.24 | 13.84 | 0.00 | 81.98 | 0.00 | 0.00 | 0.27 | 0.01 | 0.00 | 1.87 | 0.16 | 1.88 |
▼Node skylake | 350.24 | 13.84 | 0.00 | 81.98 | 0.00 | 0.00 | 0.27 | 0.01 | 0.00 | 1.87 | 0.16 | 1.88 |
▼Process 672265 | 350.24 | 13.84 | 0.00 | 81.98 | 0.00 | 0.00 | 0.27 | 0.01 | 0.00 | 1.87 | 0.16 | 1.88 |
○Thread 672265 | 350.24 | 10.82 | 0.00 | 63.30 | 0.00 | 0.00 | 1.69 | 0.03 | 0.00 | 11.56 | 0.97 | 11.62 |
○Thread 672338 | 259.30 | 14.41 | 0.00 | 85.59 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672339 | 259.32 | 14.33 | 0.00 | 85.67 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672340 | 259.30 | 14.37 | 0.00 | 85.63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672341 | 259.34 | 14.57 | 0.00 | 85.43 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672342 | 259.33 | 14.37 | 0.00 | 85.63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672343 | 259.29 | 14.49 | 0.00 | 85.51 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672344 | 259.26 | 14.39 | 0.00 | 85.61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_16_threads | 347.14 | 8.29 | 0.00 | 89.55 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.94 | 0.09 | 0.98 |
▼Node skylake | 347.14 | 8.29 | 0.00 | 89.55 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.94 | 0.09 | 0.98 |
▼Process 672371 | 347.14 | 8.29 | 0.00 | 89.55 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.94 | 0.09 | 0.98 |
○Thread 672371 | 347.14 | 6.32 | 0.00 | 67.61 | 0.00 | 0.00 | 1.75 | 0.02 | 0.00 | 11.35 | 1.08 | 11.87 |
○Thread 672435 | 256.40 | 8.57 | 0.00 | 91.43 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672436 | 256.45 | 8.47 | 0.00 | 91.53 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672437 | 256.43 | 8.65 | 0.00 | 91.35 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672438 | 256.42 | 8.00 | 0.00 | 92.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672439 | 256.43 | 8.44 | 0.00 | 91.56 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672440 | 256.39 | 8.57 | 0.00 | 91.43 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672441 | 256.44 | 8.43 | 0.00 | 91.57 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672442 | 256.45 | 8.54 | 0.00 | 91.46 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672443 | 256.42 | 8.30 | 0.00 | 91.70 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672444 | 256.43 | 8.47 | 0.00 | 91.53 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672445 | 256.43 | 8.58 | 0.00 | 91.42 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672446 | 256.44 | 8.72 | 0.00 | 91.28 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672447 | 256.41 | 8.67 | 0.00 | 91.33 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672448 | 256.42 | 8.42 | 0.00 | 91.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672449 | 256.40 | 8.20 | 0.00 | 91.80 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_26_threads | 347.08 | 6.16 | 0.00 | 92.51 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.58 | 0.04 | 0.63 |
▼Node skylake | 347.08 | 6.16 | 0.00 | 92.51 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.58 | 0.04 | 0.63 |
▼Process 672480 | 347.08 | 6.16 | 0.00 | 92.51 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.58 | 0.04 | 0.63 |
○Thread 672480 | 347.08 | 4.91 | 0.00 | 69.03 | 0.00 | 0.00 | 1.75 | 0.03 | 0.00 | 11.23 | 0.87 | 12.18 |
○Thread 672544 | 256.45 | 6.27 | 0.00 | 93.73 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672545 | 256.52 | 6.28 | 0.00 | 93.72 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672546 | 256.54 | 6.29 | 0.00 | 93.71 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672547 | 256.48 | 6.01 | 0.00 | 93.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672548 | 256.52 | 6.44 | 0.00 | 93.56 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672549 | 256.01 | 6.27 | 0.00 | 93.73 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672550 | 256.47 | 6.24 | 0.00 | 93.76 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672551 | 256.50 | 6.27 | 0.00 | 93.73 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672552 | 256.48 | 6.21 | 0.00 | 93.79 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672553 | 256.46 | 6.02 | 0.00 | 93.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672554 | 256.50 | 6.14 | 0.00 | 93.86 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672555 | 256.53 | 6.36 | 0.00 | 93.64 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672556 | 256.52 | 6.15 | 0.00 | 93.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672557 | 256.51 | 6.26 | 0.00 | 93.74 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672558 | 256.50 | 6.31 | 0.00 | 93.69 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672559 | 256.38 | 6.32 | 0.00 | 93.68 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672560 | 256.47 | 6.30 | 0.00 | 93.70 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672561 | 256.53 | 6.38 | 0.00 | 93.62 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672562 | 256.52 | 6.04 | 0.00 | 93.96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672563 | 256.52 | 6.16 | 0.00 | 93.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672564 | 256.47 | 6.11 | 0.00 | 93.89 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672565 | 256.49 | 6.06 | 0.00 | 93.94 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672566 | 256.51 | 6.33 | 0.00 | 93.67 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672567 | 256.51 | 6.20 | 0.00 | 93.80 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○Thread 672568 | 256.50 | 6.15 | 0.00 | 93.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) | Pthread (%) | String (%) | Memory (%) | Others (%) |
---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 63.18 | 15.35 | 1.51 | 0.04 | 9.28 | 0.72 | 9.93 |
run_2_threads | 2 | 36.39 | 51.3 | 0.88 | 0.02 | 5.33 | 0.37 | 5.72 |
run_4_threads | 4 | 23.45 | 68.88 | 0.61 | 0.01 | 3.25 | 0.27 | 3.53 |
run_8_threads | 8 | 13.84 | 81.98 | 0.27 | 0.01 | 1.87 | 0.16 | 1.88 |
run_16_threads | 16 | 8.29 | 89.55 | 0.14 | 0 | 0.94 | 0.09 | 0.98 |
run_26_threads | 26 | 6.16 | 92.51 | 0.09 | 0 | 0.58 | 0.04 | 0.63 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) | Pthread (s) | String (s) | Memory (s) | Others (s) |
---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 423.37 | 267.47 | 65 | 6.38 | 0.17 | 39.27 | 3.04 | 42.03 |
run_2_threads | 2 | 414.21 | 150.73 | 212.5 | 3.64 | 0.08 | 22.06 | 1.53 | 23.68 |
run_4_threads | 4 | 362.66 | 85.04 | 249.8 | 2.21 | 0.04 | 11.8 | 0.96 | 12.8 |
run_8_threads | 8 | 350.24 | 48.46 | 287.12 | 0.96 | 0.02 | 6.55 | 0.55 | 6.58 |
run_16_threads | 16 | 347.14 | 28.78 | 310.87 | 0.5 | 0.01 | 3.26 | 0.31 | 3.41 |
run_26_threads | 26 | 347.08 | 21.37 | 321.07 | 0.31 | 0.01 | 2 | 0.15 | 2.17 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.51 |
run_4_threads | 4 | 0.29 |
run_8_threads | 8 | 0.15 |
run_16_threads | 16 | 0.08 |
run_26_threads | 26 | 0.05 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 50.88 | 0 | 0 | 0 | 0.34 | 0.45 | 0.57 | 42.84 | 4.92 | 0 |
run_4_threads | 4 | 68.66 | 0 | 0 | 0 | 0 | 0.06 | 2.85 | 1.45 | 26.73 | 0.23 | 0 |
run_8_threads | 8 | 81.86 | 0 | 0 | 0 | 1.64 | 0.04 | 0.22 | 15.63 | 0.48 | 0.12 | 0 |
run_16_threads | 16 | 89.49 | 0 | 1.38 | 0 | 0 | 0.08 | 7.02 | 1.88 | 0.11 | 0.04 | 0.01 |
run_26_threads | 26 | 92.46 | 1.47 | 0 | 0 | 0.01 | 4.66 | 0.13 | 1.22 | 0.02 | 0.02 | 0.01 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 50.88 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 49.12 | 0 |
run_4_threads | 4 | 0 | 0 | 68.66 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31.33 | 0 |
run_8_threads | 8 | 0 | 0 | 81.86 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18.14 | 0 |
run_16_threads | 16 | 0 | 0 | 89.49 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 10.51 | 0 |
run_26_threads | 26 | 0 | 92.46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 7.53 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_16_threads | run_26_threads |
---|---|---|---|---|---|---|
/usr/lib/ld-linux-x86-64.so.2 | ||||||
/usr/lib/libc.so.6 | ||||||
/usr/lib/libgcc_s.so.1 | ||||||
/usr/lib/libgomp.so.1.0.0 | ||||||
/usr/lib/libm.so.6 | ||||||
/usr/lib/libstdc++.so.6.0.34 |