Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 330.57 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 330.57 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 232491 | 330.57 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 232491) | 330.57 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 217.85 | 98.07 | 0.00 | 1.93 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 217.85 | 98.07 | 0.00 | 1.93 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 232544 | 217.85 | 98.07 | 0.00 | 1.93 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 232544) | 217.77 | 96.15 | 0.00 | 3.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 232562) | 217.85 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 157.90 | 96.15 | 0.00 | 3.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 157.90 | 96.15 | 0.00 | 3.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 232587 | 157.90 | 96.15 | 0.00 | 3.85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 232587) | 157.83 | 93.47 | 0.00 | 6.53 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 232605) | 157.88 | 95.03 | 0.00 | 4.97 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 232606) | 157.90 | 99.95 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 232607) | 157.82 | 96.16 | 0.00 | 3.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 127.72 | 94.91 | 0.00 | 5.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 127.72 | 94.91 | 0.00 | 5.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 232631 | 127.72 | 94.91 | 0.00 | 5.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 232631) | 127.64 | 92.43 | 0.00 | 7.57 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 232649) | 127.72 | 92.10 | 0.00 | 7.90 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 232650) | 127.72 | 92.03 | 0.00 | 7.97 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 232651) | 127.70 | 95.52 | 0.00 | 4.48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 232652) | 127.70 | 99.70 | 0.00 | 0.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 232653) | 127.72 | 97.78 | 0.00 | 2.22 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 232654) | 127.70 | 97.22 | 0.00 | 2.78 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 232655) | 127.70 | 92.48 | 0.00 | 7.52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 121.76 | 93.93 | 0.00 | 6.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 121.76 | 93.93 | 0.00 | 6.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 232679 | 121.76 | 93.93 | 0.00 | 6.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 232679) | 121.63 | 87.89 | 0.00 | 12.11 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 232697) | 121.68 | 94.97 | 0.00 | 5.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 232698) | 121.71 | 89.94 | 0.00 | 10.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 232699) | 121.71 | 93.39 | 0.00 | 6.61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 232700) | 121.71 | 94.22 | 0.00 | 5.78 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 232701) | 121.73 | 98.22 | 0.00 | 1.78 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 232702) | 121.76 | 99.39 | 0.00 | 0.61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 232703) | 121.73 | 96.04 | 0.00 | 3.96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 232704) | 121.68 | 92.69 | 0.00 | 7.31 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 232705) | 121.64 | 92.59 | 0.00 | 7.41 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) |
---|---|---|---|
run_1_thread | 1 | 100 | 0 |
run_2_threads | 2 | 98.07 | 1.93 |
run_4_threads | 4 | 96.15 | 3.85 |
run_8_threads | 8 | 94.91 | 5.09 |
run_10_threads | 10 | 93.93 | 6.07 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) |
---|---|---|---|---|
run_1_thread | 1 | 330.57 | 330.57 | 0 |
run_2_threads | 2 | 217.85 | 213.66 | 4.2 |
run_4_threads | 4 | 157.9 | 151.83 | 6.07 |
run_8_threads | 8 | 127.72 | 121.21 | 6.51 |
run_10_threads | 10 | 121.76 | 114.37 | 7.39 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.76 |
run_4_threads | 4 | 0.52 |
run_8_threads | 8 | 0.32 |
run_10_threads | 10 | 0.27 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 98.07 | 0 | 0 | 1.93 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 96.15 | 0 | 0 | 0 | 0 | 3.85 |
run_8_threads | 8 | 0 | 0 | 0 | 94.9 | 0 | 0 | 0 | 0 | 0 | 0 | 5.1 |
run_10_threads | 10 | 0 | 0 | 93.93 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.07 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.93 | 98.07 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3.85 | 96.15 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.1 | 94.9 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.07 | 93.93 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libgomp.so.1.0.0 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libstdc++.so.6.0.34 |