Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 60.56 | 8.50 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.17 | 0.06 | 33.92 | 3.29 | 39.47 |
▼Node ip-172-31-18-66 | 60.56 | 8.50 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.17 | 0.06 | 33.92 | 3.29 | 39.47 |
▼Process 6438 | 60.56 | 8.50 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.17 | 0.06 | 33.92 | 3.29 | 39.47 |
○OMP # 0 (TID 6438) | 60.56 | 8.50 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.17 | 0.06 | 33.92 | 3.29 | 39.47 |
▼run_2_threads | 60.11 | 8.55 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.14 | 0.08 | 34.45 | 3.22 | 38.95 |
▼Node ip-172-31-18-66 | 60.11 | 8.55 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.14 | 0.08 | 34.45 | 3.22 | 38.95 |
▼Process 6531 | 60.11 | 8.55 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.14 | 0.08 | 34.45 | 3.22 | 38.95 |
○OMP # 0 (TID 6531) | 60.11 | 8.52 | 0.00 | 0.00 | 0.00 | 0.00 | 14.61 | 0.14 | 0.08 | 34.46 | 3.22 | 38.96 |
▼run_4_threads | 60.01 | 8.71 | 0.00 | 0.00 | 0.00 | 0.00 | 13.76 | 0.13 | 0.04 | 34.62 | 3.36 | 39.37 |
▼Node ip-172-31-18-66 | 60.01 | 8.71 | 0.00 | 0.00 | 0.00 | 0.00 | 13.76 | 0.13 | 0.04 | 34.62 | 3.36 | 39.37 |
▼Process 6624 | 60.01 | 8.71 | 0.00 | 0.00 | 0.00 | 0.00 | 13.76 | 0.13 | 0.04 | 34.62 | 3.36 | 39.37 |
○OMP # 0 (TID 6624) | 60.01 | 8.66 | 0.00 | 0.00 | 0.00 | 0.00 | 13.77 | 0.13 | 0.04 | 34.64 | 3.36 | 39.39 |
▼run_8_threads | 60.00 | 8.86 | 0.00 | 0.00 | 0.00 | 0.00 | 14.67 | 0.17 | 0.07 | 34.05 | 3.48 | 38.70 |
▼Node ip-172-31-18-66 | 60.00 | 8.86 | 0.00 | 0.00 | 0.00 | 0.00 | 14.67 | 0.17 | 0.07 | 34.05 | 3.48 | 38.70 |
▼Process 6745 | 60.00 | 8.86 | 0.00 | 0.00 | 0.00 | 0.00 | 14.67 | 0.17 | 0.07 | 34.05 | 3.48 | 38.70 |
○OMP # 0 (TID 6745) | 60.00 | 8.67 | 0.00 | 0.00 | 0.00 | 0.00 | 14.70 | 0.17 | 0.07 | 34.12 | 3.48 | 38.78 |
▼run_16_threads | 60.06 | 9.69 | 0.00 | 0.00 | 0.00 | 0.00 | 13.62 | 0.16 | 0.06 | 34.57 | 3.39 | 38.51 |
▼Node ip-172-31-18-66 | 60.06 | 9.69 | 0.00 | 0.00 | 0.00 | 0.00 | 13.62 | 0.16 | 0.06 | 34.57 | 3.39 | 38.51 |
▼Process 6850 | 60.06 | 9.69 | 0.00 | 0.00 | 0.00 | 0.00 | 13.62 | 0.16 | 0.06 | 34.57 | 3.39 | 38.51 |
○OMP # 0 (TID 6850) | 60.06 | 8.64 | 0.00 | 0.00 | 0.00 | 0.00 | 13.78 | 0.17 | 0.06 | 34.97 | 3.43 | 38.95 |
▼run_32_threads | 59.94 | 12.13 | 0.00 | 0.00 | 0.00 | 0.00 | 14.45 | 0.10 | 0.04 | 32.42 | 2.98 | 37.89 |
▼Node ip-172-31-18-66 | 59.94 | 12.13 | 0.00 | 0.00 | 0.00 | 0.00 | 14.45 | 0.10 | 0.04 | 32.42 | 2.98 | 37.89 |
▼Process 6960 | 59.94 | 12.13 | 0.00 | 0.00 | 0.00 | 0.00 | 14.45 | 0.10 | 0.04 | 32.42 | 2.98 | 37.89 |
○OMP # 0 (TID 6960) | 59.94 | 8.52 | 0.00 | 0.00 | 0.00 | 0.00 | 15.04 | 0.10 | 0.04 | 33.75 | 3.10 | 39.45 |
▼run_48_threads | 59.96 | 16.13 | 0.00 | 0.00 | 0.00 | 0.00 | 13.58 | 0.18 | 0.03 | 31.06 | 3.00 | 36.02 |
▼Node ip-172-31-18-66 | 59.96 | 16.13 | 0.00 | 0.00 | 0.00 | 0.00 | 13.58 | 0.18 | 0.03 | 31.06 | 3.00 | 36.02 |
▼Process 7084 | 59.96 | 16.13 | 0.00 | 0.00 | 0.00 | 0.00 | 13.58 | 0.18 | 0.03 | 31.06 | 3.00 | 36.02 |
○OMP # 0 (TID 7084) | 59.96 | 8.55 | 0.00 | 0.00 | 0.00 | 0.00 | 14.81 | 0.20 | 0.03 | 33.87 | 3.27 | 39.27 |
▼run_64_threads | 60.67 | 34.24 | 0.00 | 0.01 | 0.00 | 0.00 | 10.51 | 0.14 | 0.05 | 24.29 | 2.29 | 28.46 |
▼Node ip-172-31-18-66 | 60.67 | 34.24 | 0.00 | 0.01 | 0.00 | 0.00 | 10.51 | 0.14 | 0.05 | 24.29 | 2.29 | 28.46 |
▼Process 7223 | 60.67 | 34.24 | 0.00 | 0.01 | 0.00 | 0.00 | 10.51 | 0.14 | 0.05 | 24.29 | 2.29 | 28.46 |
○OMP # 0 (TID 7223) | 60.67 | 9.24 | 0.00 | 0.01 | 0.00 | 0.00 | 14.51 | 0.20 | 0.07 | 33.53 | 3.16 | 39.29 |
○OMP # 4 (TID 7298) | 0.62 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 14 (TID 7308) | 0.62 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 21 (TID 7315) | 0.65 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 22 (TID 7316) | 0.63 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 28 (TID 7322) | 0.61 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 29 (TID 7323) | 0.65 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 35 (TID 7329) | 0.63 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 42 (TID 7336) | 0.61 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 43 (TID 7337) | 0.65 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 49 (TID 7343) | 0.62 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 50 (TID 7344) | 0.65 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 63 (TID 7357) | 0.64 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) | Pthread (%) | IO (%) | String (%) | Memory (%) | Others (%) |
---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 8.5 | 0 | 14.61 | 0.17 | 0.06 | 33.92 | 3.29 | 39.47 |
run_2_threads | 1 | 8.55 | 0 | 14.61 | 0.14 | 0.08 | 34.45 | 3.22 | 38.95 |
run_4_threads | 1 | 8.71 | 0 | 13.76 | 0.13 | 0.04 | 34.62 | 3.36 | 39.37 |
run_8_threads | 1 | 8.86 | 0 | 14.67 | 0.17 | 0.07 | 34.05 | 3.48 | 38.7 |
run_16_threads | 1 | 9.69 | 0 | 13.62 | 0.16 | 0.06 | 34.57 | 3.39 | 38.51 |
run_32_threads | 1 | 12.13 | 0 | 14.45 | 0.1 | 0.04 | 32.42 | 2.98 | 37.89 |
run_48_threads | 1 | 16.13 | 0 | 13.58 | 0.18 | 0.03 | 31.06 | 3 | 36.02 |
run_64_threads | 13 | 34.24 | 0.01 | 10.51 | 0.14 | 0.05 | 24.29 | 2.29 | 28.46 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | System (s) | Pthread (s) | IO (s) | String (s) | Memory (s) | Others (s) |
---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 60.56 | 5.14 | 8.84 | 0.1 | 0.03 | 20.54 | 1.99 | 23.9 |
run_2_threads | 1 | 60.11 | 5.14 | 8.78 | 0.08 | 0.05 | 20.71 | 1.93 | 23.41 |
run_4_threads | 1 | 60.01 | 5.23 | 8.26 | 0.08 | 0.02 | 20.78 | 2.01 | 23.63 |
run_8_threads | 1 | 60.01 | 5.31 | 8.8 | 0.1 | 0.04 | 20.43 | 2.09 | 23.22 |
run_16_threads | 1 | 60.06 | 5.82 | 8.18 | 0.1 | 0.03 | 20.76 | 2.04 | 23.13 |
run_32_threads | 1 | 59.94 | 7.27 | 8.66 | 0.06 | 0.02 | 19.43 | 1.79 | 22.71 |
run_48_threads | 1 | 59.95 | 9.67 | 8.14 | 0.11 | 0.02 | 18.62 | 1.8 | 21.59 |
run_64_threads | 13 | 60.67 | 20.78 | 6.38 | 0.09 | 0.03 | 14.73 | 1.39 | 17.27 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 1 | 1.5 |
run_4_threads | 1 | 2 |
run_8_threads | 1 | 2.41 |
run_16_threads | 1 | 2.69 |
run_32_threads | 1 | 2.86 |
run_48_threads | 1 | 2.92 |
run_64_threads | 13 | 0.23 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 1 | 0 | 0 | 0 | 0.02 | 0 | 0.03 | 0 | 0 | 3.28 | 96.62 | 0.04 |
run_4_threads | 1 | 0 | 0.09 | 0.03 | 0 | 0 | 0 | 0 | 0.22 | 1.75 | 97.9 | 0.01 |
run_8_threads | 1 | 0.22 | 0 | 0.03 | 0 | 0 | 0 | 0 | 1.12 | 3.91 | 94.69 | 0.02 |
run_16_threads | 1 | 1.26 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0 | 3.98 | 94.7 | 0.02 |
run_32_threads | 1 | 4.01 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0.18 | 1.92 | 93.8 | 0.05 |
run_48_threads | 1 | 8.47 | 0 | 0.03 | 0.02 | 0 | 0 | 0 | 0.57 | 0.72 | 90.16 | 0.02 |
run_64_threads | 13 | 34.69 | 65.22 | 0.05 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.04 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 1 | 0 | 0 | 0 | 0.02 | 0 | 0.03 | 0 | 0 | 3.28 | 51.95 | 44.72 | 0 |
run_4_threads | 1 | 0 | 0.09 | 0.03 | 0 | 0 | 0 | 0 | 0.22 | 1.75 | 52.29 | 45.62 | 0 |
run_8_threads | 1 | 0.22 | 0 | 0.03 | 0 | 0 | 0 | 0 | 1.12 | 3.91 | 56.41 | 38.3 | 0 |
run_16_threads | 1 | 1.26 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0 | 3.98 | 39.96 | 54.77 | 0 |
run_32_threads | 1 | 4.01 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0.18 | 1.92 | 6.15 | 87.69 | 0 |
run_48_threads | 1 | 8.47 | 0 | 0.03 | 0.02 | 0 | 0 | 0 | 0.57 | 0.72 | 6.59 | 83.59 | 0 |
run_64_threads | 13 | 28.07 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 0.11 | 71.8 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_16_threads | run_32_threads | run_48_threads | run_64_threads |
---|---|---|---|---|---|---|---|---|
/opt/arm/gcc-14.2.0_Ubuntu-20.04/lib64/libgcc_s.so.1 | ||||||||
/opt/arm/gcc-14.2.0_Ubuntu-20.04/lib64/libgomp.so.1.0.0 | ||||||||
/opt/arm/gcc-14.2.0_Ubuntu-20.04/lib64/libstdc++.so.6.0.33 | ||||||||
/usr/lib/aarch64-linux-gnu/ld-linux-aarch64.so.1 | ||||||||
/usr/lib/aarch64-linux-gnu/libc.so.6 | ||||||||
/usr/lib/aarch64-linux-gnu/libdl.so.2 | ||||||||
/usr/lib/aarch64-linux-gnu/libm.so.6 | ||||||||
/usr/lib/aarch64-linux-gnu/libpthread.so.0 |