Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 225-225
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 225-225
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 225-225
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
33 | 3.75 | 3.27 | 6.30 | 0 | 12.5 | 33 | 3.66 | 3.28 | 6.34 | 0 | 12.5 | 33 | 3.58 | 3.31 | 6.39 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 33) | Sum on 1 analyzed binary loop (md-icpx-O3 - 33) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 33) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 229-229
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 229-229
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 229-229
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
34 | 3.51 | 3.19 | 6.13 | 0 | 12.5 | 34 | 3.47 | 3.16 | 6.10 | 0 | 12.5 | 34 | 3.48 | 3.18 | 6.13 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 34) | Sum on 1 analyzed binary loop (md-icpx-O3 - 34) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 34) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 221-221
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 221-221
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 221-221
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
32 | 3.31 | 2.94 | 5.66 | 0 | 12.5 | 32 | 3.21 | 2.91 | 5.63 | 0 | 12.5 | 32 | 3.12 | 2.92 | 5.63 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 32) | Sum on 1 analyzed binary loop (md-icpx-O3 - 32) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 32) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 208-208
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 208-208
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 208-208
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
29 | 2.90 | 2.61 | 5.02 | 0 | 12.5 | 29 | 3.05 | 2.69 | 5.21 | 0 | 12.5 | 29 | 2.95 | 2.71 | 5.22 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 29) | Sum on 1 analyzed binary loop (md-icpx-O3 - 29) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 29) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 212-212
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 212-212
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 212-212
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
30 | 3.02 | 2.67 | 5.14 | 0 | 12.5 | 30 | 2.92 | 2.63 | 5.09 | 0 | 12.5 | 30 | 3.05 | 2.61 | 5.03 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 30) | Sum on 1 analyzed binary loop (md-icpx-O3 - 30) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 30) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 216-216
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 216-216
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 216-216
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
31 | 2.89 | 2.54 | 4.89 | 0 | 12.5 | 31 | 2.95 | 2.58 | 4.98 | 0 | 12.5 | 31 | 2.80 | 2.53 | 4.88 | 0 | 12.5 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 31) | Sum on 1 analyzed binary loop (md-icpx-O3 - 31) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 31) |
Analysis | Count | Analysis | Count | Analysis | Count |
Loop Computation Issues | | Loop Computation Issues | | Loop Computation Issues | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_vector.h: 1264-1264
- /home/fmusial/MD_Benchmarks/simulation.cpp: 312-326
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 312-326
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 312-326
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
43 | 0.17 | 0.13 | 0.24 | 0 | 12.5 | 43 | 0.29 | 0.24 | 0.46 | 100 | 50 | 43 | 0.31 | 0.24 | 0.47 | 91.3 | 46.47 |
| | |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (md-icpx-O3 - 43) | Sum on 1 analyzed binary loop (md-icpx-Ofast - 43) |
Analysis | Count | Analysis | Count | Analysis | Count |
| | Loop Computation Issues | | Loop Computation Issues | |
| | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 |
| | Control Flow Issues | | Control Flow Issues | |
| | Presence of calls | 1 | Presence of calls | 1 |
| | Data Access Issues | | Data Access Issues | |
| | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 |
| | Vectorization Roadblocks | | Vectorization Roadblocks | |
| | Presence of calls | 1 | Presence of calls | 1 |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 341-345
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 341-345
| Loop Source Regions | - /home/fmusial/MD_Benchmarks/simulation.cpp: 341-345
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
44 | 0.27 | 0.19 | 0.36 | 0 | 12.5 | 46 | 0.25 | 0.20 | 0.39 | 100 | 50 | 46 | 0.23 | 0.17 | 0.33 | 100 | 50 |
| | |
Sum on 1 analyzed binary loop (md-icpx-O2 - 44) | Sum on 1 analyzed binary loop (md-icpx-O3 - 46) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count | Analysis | Count |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_vector.h: 1264-1264
- /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 107-109
- /home/fmusial/MD_Benchmarks/simulation.cpp: 118-118
- /home/fmusial/MD_Benchmarks/simulation.cpp: 124-126
| Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_vector.h: 1264-1264
- /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 107-109
- /home/fmusial/MD_Benchmarks/simulation.cpp: 118-118
- /home/fmusial/MD_Benchmarks/simulation.cpp: 124-126
| Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_vector.h: 1264-1264
- /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 107-116
- /home/fmusial/MD_Benchmarks/simulation.cpp: 118-118
- /home/fmusial/MD_Benchmarks/simulation.cpp: 124-126
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
21 | 0.11 | 0.09 | 0.17 | 0 | 6.25 | 21 | 0.12 | 0.08 | 0.16 | 0 | 6.25 | 21 | 0.10 | 0.09 | 0.17 | 0 | 6.25 |
| | 23 | 0.02 | 0.00 | 0.01 | 100 | 41.67 |
| | |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count | Analysis | Count |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | | Loop Source Regions | | Loop Source Regions | |
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
7 | 0.01 | 0.00 | 0.00 | 0 | 0 | 74 | 0.01 | 0.00 | 0.00 | 0 | 0 | 7 | 0.01 | 0.00 | 0.00 | 0 | 0 |
23 | 0.01 | 0.00 | 0.00 | 0 | 0 | 7 | 0.01 | 0.00 | 0.00 | 0 | 0 | 12 | 0.01 | 0.00 | 0.00 | 0 | 0 |
18 | 0.01 | 0.00 | 0.00 | 0 | 0 | 12 | 0.01 | 0.00 | 0.00 | 0 | 0 | 17 | 0.05 | 0.03 | 0.05 | 0 | 0 |
12 | 0.02 | 0.00 | 0.00 | 0 | 0 | 18 | 0.01 | 0.00 | 0.00 | 0 | 0 | 18 | 0.01 | 0.00 | 0.00 | 0 | 0 |
17 | 0.05 | 0.03 | 0.05 | 0 | 0 | 23 | 0.02 | 0.00 | 0.00 | 0 | 0 | 55 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 17 | 0.04 | 0.02 | 0.05 | 0 | 0 | 74 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| | 61 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| | |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count | Analysis | Count |
Run Skylake ICPX O2 | Run Skylake ICPX O3 | Run Skylake ICPX Ofast |
Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 51-53
- /home/fmusial/MD_Benchmarks/simulation.cpp: 62-62
- /home/fmusial/MD_Benchmarks/simulation.cpp: 68-69
| Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 51-53
- /home/fmusial/MD_Benchmarks/simulation.cpp: 62-62
- /home/fmusial/MD_Benchmarks/simulation.cpp: 68-69
| Loop Source Regions | - /usr/lib64/gcc/x86_64-pc-linux-gnu/15.1.1/../../../../include/c++/15.1.1/bits/stl_algobase.h: 239-239
- /home/fmusial/MD_Benchmarks/simulation.cpp: 51-53
- /home/fmusial/MD_Benchmarks/simulation.cpp: 62-62
- /home/fmusial/MD_Benchmarks/simulation.cpp: 68-69
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
11 | 0.05 | 0.03 | 0.05 | 0 | 6.25 | 11 | 0.04 | 0.02 | 0.04 | 0 | 6.25 | 11 | 0.05 | 0.03 | 0.05 | 0 | 6.25 |
| | |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count | Analysis | Count |