Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 1.74 E3 | 871.18 | 435.86 | 218.28 | 109.66 | 73.07 | 55.10 | 44.08 | 37.11 | 31.96 | 27.93 | |
| Max (Thread Active Time) (s) | 1.74 E3 | 871.05 | 435.78 | 218.23 | 109.62 | 73.03 | 55.06 | 44.05 | 37.07 | 31.94 | 27.88 | |
| Average Active Time (s) | 1.74 E3 | 871.01 | 435.70 | 218.14 | 109.47 | 72.97 | 54.97 | 43.96 | 36.94 | 31.77 | 27.77 | |
| Activity Ratio (%) | 100.0 | 100.0 | 100.0 | 99.9 | 99.8 | 99.9 | 99.8 | 99.8 | 99.6 | 99.5 | 99.5 | |
| Average number of active threads | 1.000 | 2.000 | 3.999 | 7.995 | 15.972 | 23.969 | 31.925 | 39.896 | 47.780 | 55.678 | 63.634 | |
| Affinity Stability (%) | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | |
| GFLOPS | 15.510 | 30.999 | 61.963 | 123.731 | 246.323 | 369.733 | 490.400 | 613.046 | 728.390 | 845.382 | 968.483 | |
| Time in analyzed loops (%) | 100.0 | 99.9 | 99.8 | 99.7 | 99.3 | 99.5 | 99.1 | 99.0 | 98.2 | 97.8 | 98.0 | |
| Time in analyzed innermost loops (%) | 100.0 | 99.9 | 99.8 | 99.7 | 99.3 | 99.5 | 99.1 | 99.0 | 98.2 | 97.8 | 98.0 | |
| Time in user code (%) | 100 | 99.9 | 99.8 | 99.8 | 99.3 | 99.5 | 99.1 | 99.0 | 98.2 | 97.8 | 98.1 | |
| Compilation Options Score (%) | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Array Access Efficiency (%) | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | 56.2 | |
| Potential Speedups | ||||||||||||
| Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.01 | 1.02 | 1.03 | 1.02 | |
| Scalability - Gap | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.01 | 1.02 | 1.03 | 1.03 | |
| No Scalar Integer | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Fully Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Only FP Arithmetic | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Nb Loops to get 80% | 1 | 1 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | |
| Source Object | Issue |
|---|---|
| ▼exec | |
| ▼Step10_orig.c | |
| ○ | |
| ▼main.c | |
| ○ |
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Application | /home/eoseret/qaas/qaas_runs/178-162-9706/intel/HACCmk/run/oneview_runs/defaults/orig/exec | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2026-06-16 19:41:27 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | MPI; OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
| Machine | ip-172-31-38-240.ec2.internal | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | |||||||||||
| Cache Size | |||||||||||
| Number of Cores | |||||||||||
| Maximal Frequency | 0.00 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 6.1.170-213.321.amzn2023.aarch64 #1 SMP Thu May 14 12:18:13 UTC 2026 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | exec: Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm -store-to-load-forwarding-conflict-detection=0 -I /home/eoseret/qaas/qaas_runs/178-162-9706/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas/qaas_runs/178-162-9706/intel/HACCmk/build/build -O3 -mcpu=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -ffast-math -MD -MT CMakeFiles/HACCmk.dir/src/Step10_orig.c.o -MF CMakeFiles/HACCmk.dir/src/Step10_orig.c.o.d -o CMakeFiles/HACCmk.dir/src/Step10_orig.c.o -c /home/eoseret/qaas/qaas_runs/178-162-9706/intel/HACCmk/build/HACCmk/src/Step10_orig.c | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of threads observed | 1 | 2 | 4 | 8 | 16 | 24 | 32 | 40 | 48 | 56 | 64 |
| Frequency Driver | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | OV scalability run using armclang | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |