Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 297.88 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 297.88 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 194644 | 297.88 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 194644) | 297.88 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 152.79 | 98.77 | 0.00 | 1.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 152.79 | 98.77 | 0.00 | 1.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 194689 | 152.79 | 98.77 | 0.00 | 1.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 194689) | 152.55 | 97.54 | 0.00 | 2.46 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 194705) | 152.79 | 99.99 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 77.56 | 97.01 | 0.00 | 2.97 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 77.56 | 97.01 | 0.00 | 2.97 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 194731 | 77.56 | 97.01 | 0.00 | 2.97 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 194731) | 77.33 | 94.72 | 0.00 | 5.28 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 194747) | 77.44 | 96.90 | 0.00 | 3.06 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 194748) | 77.56 | 99.91 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 194749) | 77.41 | 96.52 | 0.00 | 3.46 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 39.84 | 95.41 | 0.00 | 4.57 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 39.84 | 95.41 | 0.00 | 4.57 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 194772 | 39.84 | 95.41 | 0.00 | 4.57 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 194772) | 39.72 | 92.60 | 0.00 | 7.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 194788) | 39.73 | 93.71 | 0.00 | 6.23 | 0.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 194789) | 39.73 | 92.36 | 0.00 | 7.62 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 194790) | 39.81 | 98.38 | 0.00 | 1.62 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 194791) | 39.84 | 99.50 | 0.00 | 0.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 194792) | 39.83 | 97.07 | 0.00 | 2.93 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 194793) | 39.82 | 98.67 | 0.00 | 1.33 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 194794) | 39.79 | 90.98 | 0.00 | 8.99 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 32.35 | 94.95 | 0.00 | 5.02 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 32.35 | 94.95 | 0.00 | 5.02 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 194814 | 32.35 | 94.95 | 0.00 | 5.02 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 194814) | 32.18 | 90.12 | 0.00 | 9.88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 194830) | 32.33 | 95.11 | 0.00 | 4.84 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 194831) | 32.17 | 91.82 | 0.00 | 8.12 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 194832) | 32.25 | 94.56 | 0.00 | 5.33 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 194833) | 32.34 | 97.29 | 0.00 | 2.71 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 194834) | 32.28 | 98.08 | 0.00 | 1.92 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 194835) | 32.35 | 98.25 | 0.00 | 1.75 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 194836) | 32.26 | 97.34 | 0.00 | 2.66 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 194837) | 32.20 | 93.38 | 0.00 | 6.57 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 194838) | 32.26 | 93.55 | 0.00 | 6.45 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | Pthread (%) |
---|---|---|---|---|
run_1_thread | 1 | 100 | 0 | 0 |
run_2_threads | 2 | 98.77 | 1.23 | 0 |
run_4_threads | 4 | 97.01 | 2.97 | 0.01 |
run_8_threads | 8 | 95.41 | 4.57 | 0.01 |
run_10_threads | 10 | 94.95 | 5.02 | 0.03 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | Pthread (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 297.88 | 297.88 | 0 | 0 |
run_2_threads | 2 | 152.79 | 150.91 | 1.88 | 0 |
run_4_threads | 4 | 77.56 | 75.24 | 2.31 | 0.01 |
run_8_threads | 8 | 39.84 | 38.02 | 1.82 | 0.01 |
run_10_threads | 10 | 32.35 | 30.72 | 1.62 | 0.01 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.75 |
run_4_threads | 4 | 0.51 |
run_8_threads | 8 | 0.31 |
run_10_threads | 10 | 0.26 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 98.76 | 1.24 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.99 | 3.01 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.37 | 4.63 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88.35 | 11.65 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.24 | 98.76 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3.01 | 96.99 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.63 | 95.37 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11.65 | 88.35 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libomp.so | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/libstdc++.so.6.0.34 |