Run run_2_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_PROC_BIND: closeOMP_NUM_THREADS: 2OMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
---|---|
Run run_4_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_NUM_THREADS: 4OMP_PROC_BIND: closeOMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
Run run_8_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_NUM_THREADS: 8OMP_PROC_BIND: closeOMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
Run run_16_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_NUM_THREADS: 16OMP_PROC_BIND: closeOMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
Run run_26_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_NUM_THREADS: 26OMP_PROC_BIND: closeOMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
Run run_52_threads | Number processes: 1Number nodes: 1Run Command: <executable>MPI Command: Dataset: Run Directory: /home/fmusial/MD_BenchmarksOMP_NUM_THREADS: 52OMP_PROC_BIND: closeOMP_SCHEDULE: staticOMP_PLACES: coresOMP_WAIT_POLICY: active |
Name | Module | Max Thread Time / Walltime run_2_threads (%) | Max Thread Time / Walltime run_4_threads (%) | Max Thread Time / Walltime run_8_threads (%) | Max Thread Time / Walltime run_16_threads (%) | Max Thread Time / Walltime run_26_threads (%) | Max Thread Time / Walltime run_52_threads (%) | Coverage run_2_threads (%) | Coverage run_4_threads (%) | Coverage run_8_threads (%) | Coverage run_16_threads (%) | Coverage run_26_threads (%) | Coverage run_52_threads (%) | Coverage Excluding Loops run_2_threads (%) | Coverage Excluding Loops run_4_threads (%) | Coverage Excluding Loops run_8_threads (%) | Coverage Excluding Loops run_16_threads (%) | Coverage Excluding Loops run_26_threads (%) | Coverage Excluding Loops run_52_threads (%) | Max Inclusive Time Over Threads run_2_threads (s) | Max Inclusive Time Over Threads run_4_threads (s) | Max Inclusive Time Over Threads run_8_threads (s) | Max Inclusive Time Over Threads run_16_threads (s) | Max Inclusive Time Over Threads run_26_threads (s) | Max Inclusive Time Over Threads run_52_threads (s) | Max Exclusive Time Over Threads run_2_threads (s) | Max Exclusive Time Over Threads run_4_threads (s) | Max Exclusive Time Over Threads run_8_threads (s) | Max Exclusive Time Over Threads run_16_threads (s) | Max Exclusive Time Over Threads run_26_threads (s) | Max Exclusive Time Over Threads run_52_threads (s) | Inclusive Time w.r.t. Wall Time run_2_threads (s) | Inclusive Time w.r.t. Wall Time run_4_threads (s) | Inclusive Time w.r.t. Wall Time run_8_threads (s) | Inclusive Time w.r.t. Wall Time run_16_threads (s) | Inclusive Time w.r.t. Wall Time run_26_threads (s) | Inclusive Time w.r.t. Wall Time run_52_threads (s) | Exclusive Time w.r.t. Wall Time run_2_threads (s) | Exclusive Time w.r.t. Wall Time run_4_threads (s) | Exclusive Time w.r.t. Wall Time run_8_threads (s) | Exclusive Time w.r.t. Wall Time run_16_threads (s) | Exclusive Time w.r.t. Wall Time run_26_threads (s) | Exclusive Time w.r.t. Wall Time run_52_threads (s) | Nb Threads run_2_threads | Nb Threads run_4_threads | Nb Threads run_8_threads | Nb Threads run_16_threads | Nb Threads run_26_threads | Nb Threads run_52_threads | Deviation (coverage) run_2_threads | Deviation (coverage) run_4_threads | Deviation (coverage) run_8_threads | Deviation (coverage) run_16_threads | Deviation (coverage) run_26_threads | Deviation (coverage) run_52_threads | Deviation (walltime) run_2_threads | Deviation (walltime) run_4_threads | Deviation (walltime) run_8_threads | Deviation (walltime) run_16_threads | Deviation (walltime) run_26_threads | Deviation (walltime) run_52_threads | Categories run_2_threads | Categories run_4_threads | Categories run_8_threads | Categories run_16_threads | Categories run_26_threads | Categories run_52_threads | Compilation Options | (run_2_threads) Efficiency | (run_2_threads) Potential Speed-Up (%) | (run_4_threads) Efficiency | (run_4_threads) Potential Speed-Up (%) | (run_8_threads) Efficiency | (run_8_threads) Potential Speed-Up (%) | (run_16_threads) Efficiency | (run_16_threads) Potential Speed-Up (%) | (run_26_threads) Efficiency | (run_26_threads) Potential Speed-Up (%) | (run_52_threads) Efficiency | (run_52_threads) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►computeForces(Particles&, CellList const&) [clone ._omp_fn.0] | md-gcc-Ofast | 98.57 | 98.46 | 98.22 | 97.66 | 96.54 | 91.63 | 98.36 | 97.57 | 95.26 | 92.08 | 88.32 | 80.36 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1017.99 | 519.19 | 268.48 | 139.24 | 89.44 | 49.55 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 1015.62 | 514.40 | 260.31 | 131.23 | 81.75 | 43.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 8 | 16 | 26 | 52 | 0.32 | 0.66 | 1.29 | 1.67 | 1.84 | 2.18 | 4.02 | 3.61 | 3.54 | 2.38 | 1.71 | 1.18 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | GNU C++17 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ... | 1 | 0 | 0.99 | 1.25 | 0.98 | 2.34 | 0.97 | 3 | 0.96 | 3.92 | 0.9 | 8.02 |
►Loop 11 - simulation.cpp:145-237 - md-gcc-Ofast [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 98.36 | 97.57 | 95.26 | 92.08 | 88.32 | 80.36 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1019.61 | 521.20 | 269.99 | 140.07 | 89.71 | 50.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1015.62 | 514.40 | 260.31 | 131.23 | 81.75 | 43.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 1 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 12 - simulation.cpp:145-237 - md-gcc-Ofast [...] | 0.03 | 0.03 | 0.03 | 0.04 | 0.05 | 0.05 | 98.36 | 97.57 | 95.26 | 92.08 | 88.32 | 80.36 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 1019.61 | 521.20 | 269.99 | 140.07 | 89.70 | 50.08 | 0.26 | 0.13 | 0.08 | 0.05 | 0.05 | 0.03 | 1015.62 | 514.40 | 260.31 | 131.23 | 81.75 | 43.39 | 0.21 | 0.10 | 0.05 | 0.03 | 0.02 | 0.01 | 2 | 4 | 8 | 16 | 26 | 45 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.07 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0 | 1.06 | -0 | 0.98 | 0 | 0.91 | 0 | 0.8 | 0 | 0.79 | 0 | ||||||||
►Loop 13 - simulation.cpp:158-237 - md-gcc-Ofast [...] | 0.54 | 0.51 | 0.52 | 0.51 | 0.50 | 0.54 | 98.34 | 97.55 | 95.24 | 92.06 | 88.30 | 80.34 | 0.52 | 0.50 | 0.49 | 0.47 | 0.44 | 0.37 | 1019.34 | 521.07 | 269.91 | 140.02 | 89.65 | 50.06 | 5.61 | 2.69 | 1.42 | 0.73 | 0.47 | 0.29 | 1015.41 | 514.30 | 260.25 | 131.20 | 81.73 | 43.38 | 5.42 | 2.64 | 1.34 | 0.67 | 0.41 | 0.20 | 2 | 4 | 8 | 16 | 26 | 52 | 0.03 | 0.02 | 0.02 | 0.03 | 0.04 | 0.07 | 0.28 | 0.08 | 0.06 | 0.05 | 0.04 | 0.04 | 1 | 0 | 1.02 | 0 | 1.01 | 0 | 1.01 | 0 | 1.02 | 0 | 1.03 | 0 | ||||||||
►Loop 14 - simulation.cpp:176-237 - md-gcc-Ofast [...] | 1.25 | 1.37 | 1.36 | 1.42 | 1.32 | 1.10 | 97.82 | 97.05 | 94.75 | 91.59 | 87.86 | 79.96 | 1.23 | 1.34 | 1.31 | 1.24 | 1.14 | 0.93 | 1013.73 | 518.38 | 268.49 | 139.29 | 89.19 | 49.77 | 12.86 | 7.25 | 3.72 | 2.02 | 1.22 | 0.59 | 1009.99 | 511.65 | 258.91 | 130.53 | 81.32 | 43.18 | 12.74 | 7.09 | 3.59 | 1.77 | 1.05 | 0.50 | 2 | 4 | 8 | 16 | 26 | 52 | 0.02 | 0.03 | 0.03 | 0.07 | 0.08 | 0.09 | 0.18 | 0.16 | 0.07 | 0.10 | 0.07 | 0.05 | 1 | 0 | 0.9 | 0.14 | 0.89 | 0.15 | 0.9 | 0.12 | 0.93 | 0.08 | 0.97 | 0.03 | ||||||||
►Loop 15 - stl_vector.h:1119-1283 - md-gcc-Ofast [...] | 46.43 | 46.03 | 46.32 | 46.22 | 44.66 | 40.81 | 96.58 | 95.70 | 93.43 | 90.35 | 86.72 | 79.03 | 46.28 | 45.58 | 44.26 | 42.25 | 40.01 | 34.13 | 1000.87 | 511.13 | 264.77 | 137.27 | 87.97 | 49.17 | 479.49 | 242.75 | 126.61 | 65.90 | 41.38 | 22.06 | 997.25 | 504.56 | 255.32 | 128.76 | 80.27 | 42.67 | 477.83 | 240.32 | 120.95 | 60.21 | 37.03 | 18.43 | 2 | 4 | 8 | 16 | 26 | 52 | 0.23 | 0.32 | 0.86 | 1.14 | 1.04 | 1.19 | 2.66 | 1.74 | 2.37 | 1.63 | 0.96 | 0.63 | 1 | 0 | 0.99 | 0.27 | 0.99 | 0.54 | 0.99 | 0.34 | 0.99 | 0.3 | 1 | 0.09 | ||||||||
○Loop 17 - simulation.cpp:220-220 - md-gcc-Ofast | 9.58 | 9.57 | 9.49 | 9.19 | 9.33 | 8.17 | 9.56 | 9.53 | 9.31 | 8.87 | 8.49 | 7.52 | 9.56 | 9.53 | 9.31 | 8.87 | 8.49 | 7.52 | 98.90 | 50.44 | 25.94 | 13.11 | 8.65 | 4.42 | 98.90 | 50.44 | 25.94 | 13.11 | 8.65 | 4.42 | 98.71 | 50.23 | 25.45 | 12.65 | 7.85 | 4.06 | 98.71 | 50.23 | 25.45 | 12.65 | 7.85 | 4.06 | 2 | 4 | 8 | 16 | 26 | 52 | 0.03 | 0.06 | 0.16 | 0.17 | 0.29 | 0.30 | 0.33 | 0.29 | 0.45 | 0.25 | 0.27 | 0.16 | 1 | 0 | 0.98 | 0.17 | 0.97 | 0.28 | 0.98 | 0.22 | 0.97 | 0.28 | 0.94 | 0.49 | ||||||||
○Loop 19 - simulation.cpp:229-229 - md-gcc-Ofast | 5.15 | 5.28 | 5.47 | 5.33 | 5.36 | 7.40 | 5.14 | 5.15 | 5.28 | 5.09 | 4.91 | 6.39 | 5.14 | 5.15 | 5.28 | 5.09 | 4.91 | 6.39 | 53.15 | 27.82 | 14.96 | 7.61 | 4.97 | 4.00 | 53.15 | 27.82 | 14.96 | 7.61 | 4.97 | 4.00 | 53.02 | 27.18 | 14.44 | 7.26 | 4.54 | 3.45 | 53.02 | 27.18 | 14.44 | 7.26 | 4.54 | 3.45 | 2 | 4 | 8 | 16 | 26 | 52 | 0.02 | 0.09 | 0.12 | 0.12 | 0.17 | 0.60 | 0.21 | 0.47 | 0.32 | 0.17 | 0.16 | 0.32 | 1 | 0 | 0.98 | 0.13 | 0.92 | 0.43 | 0.91 | 0.44 | 0.9 | 0.5 | 0.59 | 2.61 | ||||||||
○Loop 18 - simulation.cpp:224-224 - md-gcc-Ofast | 9.59 | 9.52 | 9.22 | 9.21 | 9.21 | 7.79 | 9.53 | 9.37 | 9.04 | 8.78 | 8.40 | 7.23 | 9.53 | 9.37 | 9.04 | 8.78 | 8.40 | 7.23 | 99.02 | 50.20 | 25.19 | 13.12 | 8.53 | 4.21 | 99.02 | 50.20 | 25.19 | 13.12 | 8.53 | 4.21 | 98.37 | 49.40 | 24.69 | 12.51 | 7.77 | 3.90 | 98.37 | 49.40 | 24.69 | 12.51 | 7.77 | 3.90 | 2 | 4 | 8 | 16 | 26 | 52 | 0.09 | 0.12 | 0.15 | 0.19 | 0.24 | 0.23 | 0.98 | 0.63 | 0.41 | 0.27 | 0.23 | 0.12 | 1 | 0 | 1 | 0.04 | 1 | 0.04 | 0.98 | 0.15 | 0.97 | 0.22 | 0.97 | 0.22 | ||||||||
○Loop 16 - simulation.cpp:216-216 - md-gcc-Ofast | 6.48 | 6.45 | 6.26 | 6.44 | 6.23 | 7.26 | 6.42 | 6.40 | 6.08 | 6.17 | 5.88 | 6.30 | 6.42 | 6.40 | 6.08 | 6.17 | 5.88 | 6.30 | 66.90 | 34.03 | 17.12 | 9.18 | 5.78 | 3.92 | 66.90 | 34.03 | 17.12 | 9.18 | 5.78 | 3.92 | 66.29 | 33.73 | 16.63 | 8.79 | 5.44 | 3.40 | 66.29 | 33.73 | 16.63 | 8.79 | 5.44 | 3.40 | 2 | 4 | 8 | 16 | 26 | 52 | 0.08 | 0.04 | 0.11 | 0.18 | 0.20 | 0.31 | 0.91 | 0.24 | 0.30 | 0.26 | 0.18 | 0.17 | 1 | 0 | 0.98 | 0.11 | 1 | 0.02 | 0.94 | 0.36 | 0.94 | 0.37 | 0.75 | 1.58 | ||||||||
○Loop 20 - simulation.cpp:233-233 - md-gcc-Ofast | 9.84 | 9.96 | 9.92 | 10.02 | 10.14 | 9.88 | 9.80 | 9.79 | 9.61 | 9.66 | 9.49 | 8.96 | 9.80 | 9.79 | 9.61 | 9.66 | 9.49 | 8.96 | 101.65 | 52.52 | 27.12 | 14.29 | 9.39 | 5.34 | 101.65 | 52.52 | 27.12 | 14.29 | 9.39 | 5.34 | 101.21 | 51.63 | 26.27 | 13.77 | 8.78 | 4.84 | 101.21 | 51.63 | 26.27 | 13.77 | 8.78 | 4.84 | 2 | 4 | 8 | 16 | 26 | 52 | 0.07 | 0.15 | 0.16 | 0.19 | 0.26 | 0.30 | 0.68 | 0.78 | 0.43 | 0.27 | 0.24 | 0.16 | 1 | 0 | 0.98 | 0.19 | 0.96 | 0.35 | 0.92 | 0.78 | 0.89 | 1.08 | 0.8 | 1.75 | ||||||||
○Loop 21 - simulation.cpp:237-237 - md-gcc-Ofast | 9.85 | 10.12 | 10.18 | 9.86 | 10.01 | 9.64 | 9.86 | 9.88 | 9.84 | 9.53 | 9.55 | 8.51 | 9.86 | 9.88 | 9.84 | 9.53 | 9.55 | 8.51 | 101.77 | 53.36 | 27.83 | 14.05 | 9.28 | 5.21 | 101.77 | 53.36 | 27.83 | 14.05 | 9.28 | 5.21 | 101.81 | 52.08 | 26.90 | 13.59 | 8.84 | 4.60 | 101.81 | 52.08 | 26.90 | 13.59 | 8.84 | 4.60 | 2 | 4 | 8 | 16 | 26 | 52 | 0.01 | 0.17 | 0.15 | 0.15 | 0.25 | 0.36 | 0.00 | 0.91 | 0.42 | 0.21 | 0.23 | 0.20 | 1 | 0 | 0.98 | 0.22 | 0.95 | 0.53 | 0.94 | 0.6 | 0.89 | 1.09 | 0.85 | 1.26 | ||||||||
►velocityVerlet(Particles&, CellList&, int, int) [clone ._omp_fn.0] | md-gcc-Ofast | 1.14 | 1.14 | 1.10 | 1.09 | 1.07 | 1.25 | 1.14 | 1.12 | 1.08 | 1.04 | 1.00 | 0.95 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 11.81 | 6.01 | 3.01 | 1.55 | 0.99 | 0.67 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 11.80 | 5.92 | 2.96 | 1.48 | 0.93 | 0.51 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.02 | 0.01 | 0.03 | 0.03 | 0.16 | 0.03 | 0.08 | 0.03 | 0.05 | 0.03 | 0.08 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | GNU C++17 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.98 | 0.02 | 0.88 | 0.11 |
►Loop 35 - simulation.cpp:323-335 - md-gcc-Ofast | 0.30 | 0.30 | 0.31 | 0.36 | 0.36 | 0.47 | 1.14 | 1.12 | 1.08 | 1.04 | 1.00 | 0.95 | 0.29 | 0.28 | 0.28 | 0.29 | 0.27 | 0.32 | 12.03 | 6.19 | 3.29 | 1.74 | 1.19 | 0.82 | 3.12 | 1.57 | 0.83 | 0.51 | 0.33 | 0.25 | 11.80 | 5.92 | 2.96 | 1.48 | 0.93 | 0.51 | 3.01 | 1.49 | 0.78 | 0.41 | 0.25 | 0.17 | 2 | 4 | 8 | 16 | 26 | 52 | 0.02 | 0.02 | 0.02 | 0.04 | 0.04 | 0.07 | 0.17 | 0.10 | 0.05 | 0.05 | 0.04 | 0.04 | 1 | 0 | 1.01 | -0 | 0.97 | 0.01 | 0.91 | 0.03 | 0.93 | 0.02 | 0.66 | 0.11 | ||||||||
○Loop 32 - simulation.cpp:333-333 - md-gcc-Ofast | 0.41 | 0.42 | 0.44 | 0.40 | 0.44 | 0.50 | 0.40 | 0.41 | 0.39 | 0.36 | 0.35 | 0.30 | 0.40 | 0.41 | 0.39 | 0.36 | 0.35 | 0.30 | 4.22 | 2.23 | 1.21 | 0.57 | 0.41 | 0.27 | 4.22 | 2.23 | 1.21 | 0.57 | 0.41 | 0.27 | 4.14 | 2.14 | 1.07 | 0.51 | 0.32 | 0.16 | 4.14 | 2.14 | 1.07 | 0.51 | 0.32 | 0.16 | 2 | 4 | 8 | 16 | 26 | 52 | 0.01 | 0.01 | 0.03 | 0.02 | 0.04 | 0.07 | 0.11 | 0.08 | 0.09 | 0.03 | 0.04 | 0.04 | 1 | 0 | 0.97 | 0.01 | 0.97 | 0.01 | 1.01 | -0 | 0.98 | 0.01 | 0.98 | 0.01 | ||||||||
○Loop 34 - simulation.cpp:335-335 - md-gcc-Ofast | 0.30 | 0.29 | 0.30 | 0.29 | 0.30 | 0.35 | 0.30 | 0.29 | 0.27 | 0.25 | 0.24 | 0.21 | 0.30 | 0.29 | 0.27 | 0.25 | 0.24 | 0.21 | 3.14 | 1.55 | 0.83 | 0.41 | 0.28 | 0.19 | 3.14 | 1.55 | 0.83 | 0.41 | 0.28 | 0.19 | 3.10 | 1.51 | 0.73 | 0.36 | 0.23 | 0.12 | 3.10 | 1.51 | 0.73 | 0.36 | 0.23 | 0.12 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.01 | 0.02 | 0.02 | 0.03 | 0.05 | 0.05 | 0.06 | 0.06 | 0.03 | 0.03 | 0.03 | 1 | 0 | 1.03 | 0 | 1.06 | 0 | 1.07 | 0 | 1.05 | 0 | 1.04 | 0 | ||||||||
○Loop 33 - simulation.cpp:334-334 - md-gcc-Ofast | 0.15 | 0.16 | 0.15 | 0.17 | 0.18 | 0.19 | 0.15 | 0.15 | 0.14 | 0.13 | 0.14 | 0.12 | 0.15 | 0.15 | 0.14 | 0.13 | 0.14 | 0.12 | 1.55 | 0.84 | 0.42 | 0.25 | 0.16 | 0.10 | 1.55 | 0.84 | 0.42 | 0.25 | 0.16 | 0.10 | 1.55 | 0.77 | 0.38 | 0.19 | 0.13 | 0.06 | 1.55 | 0.77 | 0.38 | 0.19 | 0.13 | 0.06 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.01 | 0.01 | 0.02 | 0.03 | 0.04 | 0.01 | 0.05 | 0.04 | 0.03 | 0.03 | 0.02 | 1 | 0 | 1.01 | -0 | 1.02 | -0 | 1.02 | -0 | 0.95 | 0.01 | 0.94 | 0.01 | ||||||||
○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 0.46 | 1.44 | 4.02 | 7.12 | 10.11 | 14.08 | 0.23 | 0.94 | 3.03 | 5.64 | 8.41 | 11.66 | 0.23 | 0.94 | 3.03 | 5.64 | 8.41 | 11.66 | 4.77 | 7.58 | 10.99 | 10.14 | 9.37 | 7.61 | 4.77 | 7.58 | 10.99 | 10.14 | 9.37 | 7.61 | 2.42 | 4.96 | 8.27 | 8.04 | 7.79 | 6.30 | 2.42 | 4.96 | 8.27 | 8.04 | 7.79 | 6.30 | 2 | 4 | 8 | 16 | 26 | 52 | 0.32 | 0.67 | 1.29 | 1.66 | 1.86 | 2.22 | 3.33 | 3.55 | 3.52 | 2.37 | 1.72 | 1.20 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 1 | 0 | 0.24 | 0.71 | 0.07 | 2.81 | 0.04 | 5.43 | 0.02 | 8.21 | 0.01 | 11.49 | |
►velocityVerlet(Particles&, CellList&, int, int) [clone ._omp_fn.1] | md-gcc-Ofast | 0.09 | 0.10 | 0.14 | 0.30 | 0.30 | 0.71 | 0.09 | 0.09 | 0.12 | 0.20 | 0.24 | 0.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.97 | 0.54 | 0.38 | 0.43 | 0.28 | 0.38 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.96 | 0.49 | 0.34 | 0.28 | 0.22 | 0.22 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.01 | 0.01 | 0.05 | 0.02 | 0.09 | 0.02 | 0.04 | 0.03 | 0.06 | 0.02 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | GNU C++17 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ... | 1 | 0 | 0.98 | 0 | 0.71 | 0.04 | 0.42 | 0.11 | 0.34 | 0.16 | 0.17 | 0.34 |
○Loop 30 - simulation.cpp:352-354 - md-gcc-Ofast | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 31 - simulation.cpp:352-354 - md-gcc-Ofast | 0.09 | 0.10 | 0.14 | 0.30 | 0.30 | 0.71 | 0.09 | 0.09 | 0.12 | 0.20 | 0.24 | 0.40 | 0.09 | 0.09 | 0.12 | 0.20 | 0.24 | 0.40 | 0.97 | 0.54 | 0.38 | 0.43 | 0.28 | 0.38 | 0.97 | 0.54 | 0.38 | 0.43 | 0.28 | 0.38 | 0.96 | 0.49 | 0.34 | 0.28 | 0.22 | 0.22 | 0.96 | 0.49 | 0.34 | 0.28 | 0.22 | 0.22 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.01 | 0.01 | 0.05 | 0.02 | 0.09 | 0.02 | 0.04 | 0.03 | 0.06 | 0.02 | 0.05 | 1 | 0 | 0.98 | 0 | 0.71 | 0.04 | 0.42 | 0.11 | 0.34 | 0.16 | 0.17 | 0.33 | ||||||||
○gomp_barrier_wait_end | libgomp.so.1.0.0 | 0.11 | 0.23 | 0.44 | 1.01 | 1.99 | 6.72 | 0.05 | 0.16 | 0.38 | 0.89 | 1.86 | 6.28 | 0.05 | 0.16 | 0.38 | 0.89 | 1.86 | 6.28 | 1.10 | 1.19 | 1.21 | 1.43 | 1.84 | 3.63 | 1.10 | 1.19 | 1.21 | 1.43 | 1.84 | 3.63 | 0.55 | 0.87 | 1.05 | 1.27 | 1.72 | 3.39 | 0.55 | 0.87 | 1.05 | 1.27 | 1.72 | 3.39 | 1 | 3 | 7 | 16 | 25 | 51 | 0.00 | 0.01 | 0.01 | 0.24 | 0.04 | 0.13 | 0.00 | 0.03 | 0.01 | 0.34 | 0.03 | 0.07 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 1 | 0 | 0.32 | 0.11 | 0.13 | 0.33 | 0.05 | 0.85 | 0.02 | 1.81 | 0.01 | 6.24 | |
○unknown_function | libc.so.6 | 0.10 | 0.19 | 0.38 | 0.74 | 1.23 | 4.25 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.03 | 0.98 | 1.04 | 1.05 | 1.13 | 2.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.52 | 0.25 | 0.13 | 0.07 | 0.04 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 3 | 2 | 6 | 5 | 14 | 0.07 | 0.11 | 0.27 | 0.30 | 0.55 | 1.16 | 0.72 | 0.56 | 0.74 | 0.43 | 0.50 | 0.61 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 1 | 0 | 1.05 | -0 | 0.99 | 0 | 0.96 | 0 | 0.9 | 0 | 0.44 | 0.05 | |
►assignParticlesToCells(Particles const&, CellList&) [clone ._omp_fn.4] | md-gcc-Ofast | 0.04 | 0.05 | 0.06 | 0.08 | 0.09 | 0.21 | 0.04 | 0.05 | 0.05 | 0.07 | 0.08 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.44 | 0.25 | 0.16 | 0.11 | 0.09 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.43 | 0.24 | 0.14 | 0.10 | 0.07 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.03 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | GNU C++17 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ... | 1 | 0 | 0.89 | 0 | 0.74 | 0.01 | 0.55 | 0.03 | 0.47 | 0.04 | 0.21 | 0.11 |
○Loop 5 - simulation.cpp:114-125 - md-gcc-Ofast [...] | 0.04 | 0.05 | 0.06 | 0.08 | 0.09 | 0.21 | 0.04 | 0.05 | 0.05 | 0.07 | 0.08 | 0.14 | 0.04 | 0.05 | 0.05 | 0.07 | 0.08 | 0.14 | 0.44 | 0.25 | 0.16 | 0.11 | 0.09 | 0.12 | 0.44 | 0.25 | 0.16 | 0.11 | 0.09 | 0.12 | 0.43 | 0.24 | 0.14 | 0.10 | 0.07 | 0.08 | 0.43 | 0.24 | 0.14 | 0.10 | 0.07 | 0.08 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.03 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 1 | 0 | 0.89 | 0 | 0.74 | 0.01 | 0.55 | 0.03 | 0.47 | 0.04 | 0.21 | 0.11 | ||||||||
►assignParticlesToCells(Particles const&, CellList&) [clone ._omp_fn.1] | md-gcc-Ofast | 0.02 | 0.02 | 0.02 | 0.03 | 0.04 | 0.07 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.16 | 0.09 | 0.05 | 0.04 | 0.04 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.16 | 0.08 | 0.04 | 0.03 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | GNU C++17 15.1.1 20250425 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ... | 1 | 0 | 0.98 | 0 | 0.97 | 0 | 0.74 | 0 | 0.59 | 0.01 | 0.27 | 0.03 |
○Loop 6 - simulation.cpp:58-68 - md-gcc-Ofast [...] | 0.02 | 0.02 | 0.02 | 0.03 | 0.04 | 0.07 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.04 | 0.16 | 0.09 | 0.05 | 0.04 | 0.04 | 0.04 | 0.16 | 0.09 | 0.05 | 0.04 | 0.04 | 0.04 | 0.16 | 0.08 | 0.04 | 0.03 | 0.02 | 0.02 | 0.16 | 0.08 | 0.04 | 0.03 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 26 | 52 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0 | 0.98 | 0 | 0.97 | 0 | 0.74 | 0 | 0.59 | 0.01 | 0.27 | 0.03 |