Loops
▶mmq.cpp: 1138 - 6.92 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
557 | 1.40 | 1.13 | 6.92 | 0 | 0 | 997.84 | |||||||||||||||||||||||||||||||||||
Sum on 1 analyzed binary loop (libggml-cpu.so - 557) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Control Flow Issues | |||||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||||||
Presence of more than 4 paths | 1 |
▶ggml-cpu.c: 1291 - 3.67 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
97 | 0.41 | 0.21 | 1.42 | 0 | 11.38 | 3.22 | 66 | 0.06 | 0.02 | 0.28 | 0 | 11.34 | 28.71 | 49 | 0.04 | 0.01 | 0.07 | 0 | 11.32 | 57.99 | 97 | 0.54 | 0.24 | 1.65 | 0 | 11.38 | 1.8 | 66 | 0.05 | 0.02 | 0.24 | 0 | 11.51 | 20.09 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 97) | Sum on 1 analyzed binary loop (libggml-cpu.so - 66) | Sum on 1 analyzed binary loop (libggml-cpu.so - 49) | Sum on 1 analyzed binary loop (libggml-cpu.so - 97) | Sum on 1 analyzed binary loop (libggml-cpu.so - 66) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | ||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 |
▶sgemm.cpp: 399 - 2.84 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1843 | 0.18 | 0.12 | 1.58 | 100 | 85.29 | 1943.42 | 1834 | 0.02 | 0.00 | 0.03 | 100 | 84.48 | 3480.96 | ||||||||||||||||||||||||||||
1845 | 0.03 | 0.00 | 0.04 | 100 | 84.48 | 3243.39 | 1832 | 0.16 | 0.09 | 1.19 | 100 | 85.29 | 2406.29 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1843) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1832) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶sgemm.cpp: 144 - 2.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2663 | 0.14 | 0.08 | 0.49 | 0 | 0 | 2768.99 | 3047 | 0.13 | 0.07 | 0.50 | 0 | 0 | 2974.14 | 2392 | 0.02 | 0.00 | 0.02 | 0 | 0 | 3436.53 | 3065 | 0.12 | 0.07 | 0.50 | 0 | 0 | 3018.37 | ||||||||||||||
3045 | 0.01 | 0.00 | 0.01 | 0 | 0 | 6738.45 | 2394 | 0.12 | 0.07 | 0.48 | 0 | 0 | 3214.07 | 3063 | 0.01 | 0.00 | 0.01 | 0 | 0 | 5666.7 | |||||||||||||||||||||
Sum on 1 analyzed binary loop (libggml-cpu.so - 2663) | Sum on 1 analyzed binary loop (libggml-cpu.so - 3047) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 2394) | Sum on 1 analyzed binary loop (libggml-cpu.so - 3065) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 |
▶vec.h: 89 - 1.42 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2015 | 0.06 | 0.03 | 0.18 | 100 | 50 | 16.8 | 1407 | 0.09 | 0.04 | 0.59 | 100 | 50 | 14.24 | 2078 | 0.07 | 0.02 | 0.17 | 100 | 50 | 15.13 | 1390 | 0.09 | 0.04 | 0.48 | 100 | 50 | 12.03 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 2015) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1407) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 2078) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1390) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||
More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 |
▶ops.cpp: 5552 - 1.35 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1387 | 0.10 | 0.05 | 0.70 | 0 | 12.5 | 3.31 | 1644 | 0.09 | 0.04 | 0.30 | 0 | 12.5 | 4.11 | 1370 | 0.06 | 0.03 | 0.35 | 0 | 12.5 | 2.02 | |||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1387) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1644) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1370) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||
Presence of indirect access | 1 | Presence of indirect access | 0 | Presence of indirect access | 1 | ||||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||
Presence of indirect access | 1 | Presence of indirect access | 0 | Presence of indirect access | 1 |
▶<unknown>: 0 - 1.18 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1195 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2750 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2750 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1958 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1208 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2999 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2134 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2485 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2807 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2105 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2804 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 3074 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2405 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1126 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1453 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2194 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1354 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1419 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2301 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2640 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 3308 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2197 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2792 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2994 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2516 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1272 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2909 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1055 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2644 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2897 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2299 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2628 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 3184 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2217 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 3014 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 3109 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
2304 | 0.02 | 0.00 | 0.00 | 0 | 0 | 0 | 2638 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2096 | 0.00 | 0.00 | 0.00 | 0 | 0 | 148.67 | 2226 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2906 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 3108 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
681 | 0.06 | 0.02 | 0.14 | 100 | 50 | 16.61 | 2741 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 55 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2106 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2903 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2088 | 0.00 | 0.00 | 0.00 | 0 | 0 | 399 |
11 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 1 | 0.00 | 0.00 | 0.00 | 0 | 0 | 303.66 | 1757 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 595 | 0.05 | 0.02 | 0.12 | 100 | 50 | 0.9 | 2611 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 55 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 |
700 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 317 | 0.00 | 0.00 | 0.00 | 0 | 0 | 25.3 | 512 | 0.01 | 0.00 | 0.01 | 0 | 0 | 3171.62 | 316 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 0 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 90 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA |
365 | 0.00 | 0.00 | 0.00 | 0 | 0 | 169.5 | 838 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 91 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 8 | 0.01 | 0.00 | 0.00 | 0 | 0 | 141.47 | 1 | 0.01 | 0.00 | 0.00 | 0 | 0 | 3.45 | 2085 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA |
1747 | 0.02 | 0.00 | 0.01 | 0 | 0 | 263.17 | 0 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 591 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 1515 | 0.01 | 0.00 | 0.01 | 0 | 0 | 773.98 | 2619 | 0.01 | 0.00 | 0.00 | 0 | 0 | 151.63 | 17 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA |
451 | 0.02 | 0.00 | 0.02 | 0 | 0 | 107.29 | 1 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2099 | 0.01 | 0.00 | 0.00 | 0 | 0 | 487.82 | 46 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 379 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 430 | 0.01 | 0.00 | 0.00 | 0 | 0 | 49.87 |
2301 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 2600 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 580 | 0.01 | 0.00 | 0.01 | 0 | 0 | 422.03 | 280 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 107 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | |||||||
812 | 0.03 | 0.01 | 0.05 | 0 | 6.25 | 65.33 | 107 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 11 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 58 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2032 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2.11 | |||||||
1750 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2617.84 | 695 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2294.29 | 140 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 57 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 697 | 0.01 | 0.00 | 0.00 | 0 | 0 | 3563.21 | |||||||
1350 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 531 | 0.01 | 0.00 | 0.03 | 0 | 0 | 72.3 | 24 | 0.00 | 0.00 | 0.00 | 0 | 0 | 167.25 | 93 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | ||||||||||||||
403 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 93 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 211 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2063 | 0.01 | 0.00 | 0.00 | 0 | 0 | 703.5 | ||||||||||||||
108 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 108 | 0.01 | 0.00 | 0.00 | 0 | 0 | 18.98 | 2069 | 0.01 | 0.00 | 0.01 | 0 | 0 | 1701.05 | |||||||||||||||||||||
122 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 2105 | 0.01 | 0.00 | 0.01 | 0 | 0 | 991.83 | 1488 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | |||||||||||||||||||||
6 | 0.03 | 0.00 | 0.02 | 100 | 100 | 0 | 1421 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 108 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | |||||||||||||||||||||
1720 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 407 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | 409 | 0.00 | 0.00 | 0.00 | 0 | 0 | NA | |||||||||||||||||||||
546 | 0.03 | 0.01 | 0.07 | 0 | 0 | 742.55 | 1968 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 339 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | |||||||||||||||||||||
1836 | 0.04 | 0.01 | 0.09 | 15.56 | 9.17 | 664.83 | 1969 | 0.01 | 0.00 | 0.01 | 0 | 0 | 2.23 | 1 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | |||||||||||||||||||||
1572 | 0.01 | 0.00 | 0.01 | 0 | 0 | 277.5 | 377 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | ||||||||||||||||||||||||||||
1811 | 0.02 | 0.00 | 0.01 | 0 | 0 | 381.16 | 2005 | 0.01 | 0.00 | 0.01 | 0 | 0 | 1859.89 | ||||||||||||||||||||||||||||
113 | 0.03 | 0.01 | 0.07 | 0 | 0 | 666.61 | 2000 | 0.01 | 0.00 | 0.00 | 0 | 0 | 863.52 | ||||||||||||||||||||||||||||
2924 | 1.38 | 0.01 | 0.04 | 0 | 0 | 745.31 | |||||||||||||||||||||||||||||||||||
1577 | 0.04 | 0.01 | 0.08 | 0 | 7.81 | 67.59 | |||||||||||||||||||||||||||||||||||
2661 | 0.02 | 0.00 | 0.02 | 0 | 0 | 3133.39 | |||||||||||||||||||||||||||||||||||
1749 | 0.03 | 0.01 | 0.05 | 0 | 6.25 | 329.17 | |||||||||||||||||||||||||||||||||||
698 | 0.06 | 0.01 | 0.07 | 0 | 6.25 | 47.74 | |||||||||||||||||||||||||||||||||||
1126 | 0.05 | 0.01 | 0.08 | 0 | 0 | 1420.41 | |||||||||||||||||||||||||||||||||||
1131 | 0.02 | 0.00 | 0.02 | 0 | 0 | 5437.12 | |||||||||||||||||||||||||||||||||||
1755 | 0.03 | 0.01 | 0.06 | 100 | 100 | 47.8 | |||||||||||||||||||||||||||||||||||
Sum on 6 analyzed binary loops (exec - 681, libggml-cpu.so - 1836, libggml-cpu.so - 1577, libggml-cpu.so - 1749, libggml-cpu.so - 698, libggml-cpu.so - 1755) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (exec - 595) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | |||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||||
Out of user code | Out of user code | 1 |
▶ops.cpp: 4325 - 0.75 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1758 | 0.04 | 0.02 | 0.11 | 0 | 7.81 | 63.08 | 1322 | 0.04 | 0.02 | 0.25 | 0 | 7.81 | 57.8 | 1517 | 0.04 | 0.01 | 0.09 | 0 | 7.81 | 71.63 | 1820 | 0.03 | 0.01 | 0.07 | 100 | 31.25 | 85.83 | 1303 | 0.05 | 0.02 | 0.23 | 100 | 39.29 | 48.62 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1758) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1322) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1517) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1820) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1303) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||
More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | 1 | ||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | 1 |
▶ops.cpp: 6210 - 0.74 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
996 | 0.06 | 0.03 | 0.38 | 19.61 | 9.8 | 773.26 | 2135 | 0.04 | 0.01 | 0.09 | 3.33 | 6.88 | 708.86 | 983 | 0.06 | 0.02 | 0.27 | 7.55 | 7.84 | 786.09 | |||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 996) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 2135) | Sum on 1 analyzed binary loop (libggml-cpu.so - 983) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 0 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||||||||||||||
Low iteration count | 0 | Low iteration count | 0 | Low iteration count | 1 | ||||||||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | ||||||||||||||||||||||||||||||||||||
Low iteration count | 0 | Low iteration count | 0 | Low iteration count | 1 | ||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 0 | Presence of special instructions executing on a single port | 0 | ||||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||
Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | ||||||||||||||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 0 | Presence of special instructions executing on a single port | |||||||||||||||||||||||||||||||||||||
Use of masked instructions | 0 | Use of masked instructions | 1 | Use of masked instructions |
▶vec.h: 677 - 0.66 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2012 | 0.04 | 0.01 | 0.06 | 100 | 100 | 38.78 | 1401 | 0.01 | 0.00 | 0.01 | 100 | 100 | 634.32 | 1672 | 0.03 | 0.01 | 0.06 | 100 | 100 | 69.79 | 2075 | 0.02 | 0.00 | 0.03 | 100 | 100 | 113.21 | 1384 | 0.01 | 0.00 | 0.00 | 100 | 100 | 854.99 | |||||||
1759 | 0.02 | 0.00 | 0.02 | 100 | 100 | 115.9 | 1321 | 0.02 | 0.00 | 0.04 | 100 | 100 | 219.66 | 1664 | 0.01 | 0.00 | 0.00 | 100 | 100 | 426.31 | 1821 | 0.02 | 0.00 | 0.02 | 100 | 100 | 195.36 | 1389 | 0.05 | 0.02 | 0.23 | 100 | 100 | 36.66 | |||||||
1406 | 0.04 | 0.01 | 0.14 | 100 | 100 | 46.04 | 1302 | 0.03 | 0.00 | 0.06 | 100 | 100 | 97.46 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1406) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1389) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | |||||||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | |||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access |
▶mmq.cpp: 303 - 0.48 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
638 | 0.06 | 0.02 | 0.11 | 90.91 | 38.76 | 0 | 384 | 0.05 | 0.02 | 0.12 | 90.91 | 38.76 | 0 | 540 | 0.06 | 0.02 | 0.12 | 100 | 41.39 | 0 | 386 | 0.05 | 0.02 | 0.13 | 90.91 | 38.76 | 0 | ||||||||||||||
Sum on 1 analyzed binary loop (libggml-cpu.so - 638) | Sum on 1 analyzed binary loop (libggml-cpu.so - 384) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 540) | Sum on 1 analyzed binary loop (libggml-cpu.so - 386) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 0 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | ||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 |
▶binary-ops.cpp: 18 - 0.43 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
914 | 0.04 | 0.01 | 0.05 | 0 | 6.25 | 77.1 | 716 | 0.04 | 0.01 | 0.15 | 100 | 50 | 48.96 | 695 | 0.03 | 0.01 | 0.05 | 0 | 6.25 | 70.9 | 913 | 0.02 | 0.01 | 0.04 | 0 | 6.25 | 81.43 | 703 | 0.04 | 0.01 | 0.15 | 100 | 50 | 41.52 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 716) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 703) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||||
More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 |
▶binary-ops.cpp: 10 - 0.42 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
816 | 0.06 | 0.01 | 0.07 | 0 | 6.25 | 51.92 | 634 | 0.04 | 0.01 | 0.07 | 100 | 50 | 92.28 | 561 | 0.04 | 0.01 | 0.05 | 0 | 6.25 | 70.05 | 815 | 0.04 | 0.01 | 0.07 | 0 | 6.25 | 43.83 | 619 | 0.04 | 0.01 | 0.15 | 100 | 50 | 38.53 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 816) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 815) | Sum on 1 analyzed binary loop (libggml-cpu.so - 619) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||||
More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | More than 10% of the vector loads instructions are unaligned | 1 |
▶vec.h: 1444 - 0.40 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2001 | 0.03 | 0.01 | 0.06 | 0 | 6.25 | 285.05 | 1396 | 0.03 | 0.01 | 0.11 | 33.33 | 12.5 | 234.94 | 1666 | 0.04 | 0.01 | 0.07 | 0 | 6.25 | 102.65 | 2064 | 0.03 | 0.01 | 0.06 | 0 | 6.25 | 259.77 | 1379 | 0.03 | 0.01 | 0.11 | 0 | 6.25 | 290.58 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1666) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶vec.h: 1084 - 0.39 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1236 | 0.05 | 0.02 | 0.11 | 98 | 98.13 | 1184.97 | 1095 | 0.04 | 0.01 | 0.10 | 100 | 100 | 1324.38 | 1299 | 0.05 | 0.01 | 0.10 | 98 | 98.13 | 1336.35 | |||||||||||||||||||||
1238 | 0.03 | 0.00 | 0.03 | 88.58 | 75.12 | 4978.02 | 1104 | 0.04 | 0.00 | 0.03 | 90.63 | 77.73 | 5266.58 | 1303 | 0.02 | 0.00 | 0.02 | 88.58 | 75.12 | 6153.04 | |||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1236) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1095) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1299) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | 1 | |||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 0 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 0 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||||||||||||||
Use of masked instructions | 1 | Use of masked instructions | 1 | Use of masked instructions | 1 |
▶ggml-impl.h: 346 - 0.36 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
4 | 0.09 | 0.03 | 0.19 | 7.69 | 8.17 | 2.29 | 4 | 0.09 | 0.03 | 0.18 | 9.09 | 8.52 | 2.21 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 4) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 4) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 0 | ||||||||||||||||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 |
▶quants.c: 298 - 0.35 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3307 | 1.31 | 0.01 | 0.05 | 58.33 | 28.75 | 790.31 | 2214 | 1.43 | 0.01 | 0.10 | 60 | 28.75 | 707.09 | 2662 | 1.16 | 0.01 | 0.04 | 60.7 | 29.66 | 889.34 | 3325 | 1.32 | 0.01 | 0.05 | 60.7 | 29.66 | 776.01 | 2203 | 1.57 | 0.01 | 0.11 | 59.65 | 29.28 | 643.58 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶vec.cpp: 372 - 0.34 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
963 | 0.05 | 0.01 | 0.16 | 100 | 100 | 1581.66 | 951 | 0.05 | 0.01 | 0.18 | 100 | 100 | 1406.99 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 963) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 951) | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||||
Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||||||||||||||||||||||
Inefficient Vectorization | Inefficient Vectorization | ||||||||||||||||||||||||||||||||||||||||
Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||||||||||||||||||
Use of masked instructions | 1 | Use of masked instructions | 1 |
▶sgemm.cpp: 425 - 0.27 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1844 | 0.01 | 0.00 | 0.01 | 90.91 | 33.81 | 2951.71 | 1833 | 0.01 | 0.00 | 0.01 | 90.48 | 34.23 | 2210.89 | ||||||||||||||||||||||||||||
1842 | 0.03 | 0.01 | 0.11 | 90.91 | 33.81 | 4816.6 | 1831 | 0.04 | 0.01 | 0.15 | 90.48 | 34.23 | 2769.41 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 822 - 0.25 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
455 | 0.04 | 0.01 | 0.14 | 78.57 | 69.64 | 0 | 446 | 0.04 | 0.01 | 0.11 | 52.38 | 47.77 | 0 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 455) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | |||||||||||||||||||||||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||||||||||||||||||||
Control Flow Issues | |||||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||||||
Data Access Issues | |||||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | |||||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||||||
Presence of constant non-unit stride data access | 1 |
▶ops.cpp: 6220 - 0.24 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2113 | 0.05 | 0.02 | 0.14 | 1.96 | 6.62 | 678.54 | 1724 | 0.04 | 0.02 | 0.10 | 2.17 | 6.66 | 629.74 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 2113) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1724) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||||||||||||||||
Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||||||||||||||||||
Control Flow Issues | Control Flow Issues | ||||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||||||||||||||||||||||
Data Access Issues | Data Access Issues | ||||||||||||||||||||||||||||||||||||||||
More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||||||||||||||||||||||
Presence of calls | 1 | Presence of calls | 1 |
▶vec.cpp: 411 - 0.19 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
965 | 0.03 | 0.01 | 0.07 | 90.45 | 73.75 | 3755.59 | 952 | 0.04 | 0.01 | 0.12 | 90.47 | 73.78 | 2305.39 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ops.cpp: 6446 - 0.14 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
990 | 0.02 | 0.00 | 0.04 | 69.23 | 36.54 | 252.08 | 1712 | 0.02 | 0.00 | 0.02 | 0 | 6.25 | 255.7 | 2128 | 0.03 | 0.01 | 0.03 | 0 | 6.25 | 186.89 | 977 | 0.03 | 0.00 | 0.04 | 69.23 | 36.54 | 290.53 | ||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ggml-cpu.c: 3204 - 0.14 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5 | 0.02 | 0.00 | 0.01 | 100 | 100 | 0 | 5 | 0.02 | 0.00 | 0.04 | 100 | 83.33 | 0 | 6 | 0.02 | 0.00 | 0.03 | 100 | 100 | 0 | 5 | 0.02 | 0.00 | 0.03 | 100 | 100 | 0 | 5 | 0.02 | 0.00 | 0.03 | 100 | 83.33 | 0 | |||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ops.cpp: 5915 - 0.10 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1404 | 0.01 | 0.00 | 0.03 | 100 | 50 | 726.26 | 1667 | 0.02 | 0.00 | 0.03 | 0 | 6.25 | 574.02 | 1387 | 0.02 | 0.00 | 0.04 | 100 | 50 | 446.2 | |||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 2068 - 0.10 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
710 | 0.03 | 0.01 | 0.05 | 0 | 0 | 45.5 | 712 | 0.03 | 0.01 | 0.05 | 0 | 0 | 44.84 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 1597 - 0.10 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
408 | 0.02 | 0.00 | 0.05 | 87.67 | 85.7 | 74.87 | 399 | 0.02 | 0.00 | 0.05 | 87.67 | 85.7 | 60.75 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 1140 - 0.05 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
511 | 0.01 | 0.00 | 0.01 | 100 | 100 | 1772.6 | 499 | 0.01 | 0.00 | 0.01 | 100 | 100 | 2504.81 | ||||||||||||||||||||||||||||
514 | 0.01 | 0.00 | 0.01 | 100 | 100 | 3122.06 | 498 | 0.01 | 0.00 | 0.01 | 100 | 100 | 2406.99 | ||||||||||||||||||||||||||||
513 | 0.01 | 0.00 | 0.00 | 100 | 100 | 5672.31 | 501 | 0.01 | 0.00 | 0.00 | 100 | 100 | 5023.08 | ||||||||||||||||||||||||||||
500 | 0.01 | 0.00 | 0.01 | 100 | 100 | 3921.71 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 1141 - 0.05 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
457 | 0.04 | 0.01 | 0.05 | 0 | 0 | 2553.11 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ggml-cpu.c: 1664 - 0.04 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
71 | 0.01 | 0.00 | 0.02 | 0 | 8.82 | 9.73 | 71 | 0.01 | 0.00 | 0.02 | 0 | 9.38 | 5.75 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 1570 - 0.03 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
533 | 0.02 | 0.00 | 0.03 | 0 | 0 | 86.34 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 1573 - 0.03 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
321 | 0.01 | 0.00 | 0.03 | 0 | 0 | 47.43 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶traits.cpp: 13 - 0.02 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
331 | 0.01 | 0.00 | 0.02 | 0 | 12.5 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶mmq.cpp: 2488 - 0.01 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
561 | 0.01 | 0.00 | 0.01 | 0 | 6.82 | 468.39 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶basic_string.h: 1077 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2836 | 0.02 | 0.00 | 0.00 | 0 | 11.81 | 0 | |||||||||||||||||||||||||||||||||||
3412 | 0.01 | 0.00 | 0.00 | 0 | 12.02 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ggml-impl.h: 355 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
474 | 0.01 | 0.00 | 0.00 | 85.72 | 20.16 | 14.87 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶hashtable.h: 2627 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3041 | 0.01 | 0.00 | 0.00 | 0 | 12.5 | 0 | 3132 | 0.01 | 0.00 | 0.00 | 0 | 12.5 | 0 | ||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶ops.cpp: 1168 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1031 | 0.01 | 0.00 | 0.00 | 0 | 12.5 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶hashtable_policy.h: 386 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2810 | 0.01 | 0.00 | 0.00 | 0 | 11.46 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶hashtable.h: 2386 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2108 | 0.02 | 0.00 | 0.00 | 0 | 11.88 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶unique_ptr.h: 93 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
9 | 0.01 | 0.00 | 0.00 | 3.64 | 10.97 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶llama-vocab.cpp: 2148 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2914 | 0.01 | 0.00 | 0.00 | 0 | 8.19 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶hashtable.h: 1840 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2802 | 0.02 | 0.00 | 0.00 | 0 | 11.33 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
▶hashtable_policy.h: 434 - 0.00 %
Run orig_default | Run aocc_default | Run gcc_default | Run icx_10 | Run aocc_5 | Run gcc_3 | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2109 | 0.01 | 0.00 | 0.00 | 0 | 10.94 | 0 | |||||||||||||||||||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |