Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 270.46 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 270.46 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 195083 | 270.46 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 195083) | 270.46 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 139.78 | 97.74 | 0.00 | 2.25 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 139.78 | 97.74 | 0.00 | 2.25 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 195131 | 139.78 | 97.74 | 0.00 | 2.25 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 195131) | 139.41 | 95.47 | 0.00 | 4.50 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 195154) | 139.78 | 99.99 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 71.13 | 95.55 | 0.00 | 4.43 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 71.13 | 95.55 | 0.00 | 4.43 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 195180 | 71.13 | 95.55 | 0.00 | 4.43 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 195180) | 70.88 | 92.40 | 0.00 | 7.55 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 195203) | 70.93 | 94.30 | 0.00 | 5.70 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 195204) | 71.13 | 99.85 | 0.00 | 0.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 195205) | 71.00 | 95.62 | 0.00 | 4.34 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 36.94 | 92.56 | 0.00 | 7.40 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 36.94 | 92.56 | 0.00 | 7.40 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 195226 | 36.94 | 92.56 | 0.00 | 7.40 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 195226) | 36.78 | 90.68 | 0.00 | 9.19 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 195249) | 36.85 | 88.23 | 0.00 | 11.73 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 195250) | 36.77 | 86.51 | 0.00 | 13.43 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 195251) | 36.91 | 95.99 | 0.00 | 3.99 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 195252) | 36.94 | 99.34 | 0.00 | 0.64 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 195253) | 36.89 | 94.29 | 0.00 | 5.71 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 195254) | 36.93 | 96.23 | 0.00 | 3.74 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 195255) | 36.81 | 89.14 | 0.00 | 10.82 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 29.93 | 92.22 | 0.00 | 7.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 29.93 | 92.22 | 0.00 | 7.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 195277 | 29.93 | 92.22 | 0.00 | 7.72 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 195277) | 29.63 | 85.91 | 0.00 | 13.87 | 0.00 | 0.00 | 0.22 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 195300) | 29.83 | 93.11 | 0.00 | 6.89 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 195301) | 29.63 | 85.23 | 0.00 | 14.54 | 0.00 | 0.00 | 0.22 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 195302) | 29.77 | 91.98 | 0.00 | 8.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 195303) | 29.83 | 95.11 | 0.00 | 4.89 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 195304) | 29.93 | 97.67 | 0.00 | 2.33 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 195305) | 29.87 | 94.56 | 0.00 | 5.44 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 195306) | 29.87 | 95.34 | 0.00 | 4.66 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 195307) | 29.83 | 89.00 | 0.00 | 10.89 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 195308) | 29.80 | 94.22 | 0.00 | 5.78 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) |
---|---|---|---|---|
run_1_thread | 1 | 100 | 0 | 0 |
run_2_threads | 2 | 97.74 | 2.25 | 0.01 |
run_4_threads | 4 | 95.55 | 4.43 | 0.02 |
run_8_threads | 8 | 92.56 | 7.4 | 0.04 |
run_10_threads | 10 | 92.22 | 7.72 | 0.06 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 270.46 | 270.46 | 0 | 0 |
run_2_threads | 2 | 139.78 | 136.62 | 3.14 | 0.02 |
run_4_threads | 4 | 71.13 | 67.96 | 3.15 | 0.02 |
run_8_threads | 8 | 36.94 | 34.2 | 2.73 | 0.02 |
run_10_threads | 10 | 29.93 | 27.6 | 2.31 | 0.02 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.73 |
run_4_threads | 4 | 0.49 |
run_8_threads | 8 | 0.29 |
run_10_threads | 10 | 0.24 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 97.73 | 2.26 |
run_4_threads | 4 | 0 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.53 | 4.45 |
run_8_threads | 8 | 0.04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 92.52 | 7.44 |
run_10_threads | 10 | 11.46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 80.77 | 7.78 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2.27 | 97.73 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 0 | 4.45 | 95.53 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0.04 | 0 | 0 | 0 | 7.44 | 92.52 | 0 |
run_10_threads | 10 | 11.46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.78 | 80.77 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/librt.so.1 | |||||
/usr/lib/libstdc++.so.6.0.34 |