Help is available by moving the cursor above any symbol or by checking MAQAO website.
▶Filter Information
There is no filter information to display
Global Metrics
Total Time (s)
30.13
Max (Thread Active Time) (s)
14.68
Average Active Time (s)
14.49
Activity Ratio (%)
94.9
Average number of active threads
92.354
Affinity Stability (%)
98.2
GFLOPS
134.009
Time in analyzed loops (%)
3.47
Time in analyzed innermost loops (%)
3.34
Time in user code (%)
17.4
Compilation Options Score (%)
99.9
Array Access Efficiency (%)
89.4
Potential Speedups
Perfect Flow Complexity
1.01
Perfect OpenMP/MPI/Pthread/TBB
3.45
Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution
5.44
No Scalar Integer
Potential Speedup
1.00
Nb Loops to get 80%
2
FP Vectorised
Potential Speedup
1.00
Nb Loops to get 80%
4
Fully Vectorised
Potential Speedup
1.02
Nb Loops to get 80%
5
FP Arithmetic Only
Potential Speedup
1.02
Nb Loops to get 80%
2
CQA Potential Speedups Summary
Average Active Threads Count⏎
FLOPS Breakdown⏎
Loop Based Profile⏎
Innermost Loop Based Profile⏎
Application Categorization⏎
Compilation Options⏎
Source Object
Issue
▼libllama.so–
○hashtable.h
○llama-vocab.cpp
▼libggml-cpu.so–
○binary-ops.cpp
○vec.cpp
○mmq.cpp
○sgemm.cpp
○ops.cpp
○common.h
○ggml-cpu.c
○quants.c
▼libggml-base.so–
▼–
○
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)