Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | r11 | r12 | r13 | r14 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 295.90 | 162.05 | 89.47 | 55.87 | 41.14 | 37.74 | 37.45 | 37.59 | 37.82 | 37.46 | 37.86 | 38.21 | 38.99 | 39.91 | 40.61 | |
| Max (Thread Active Time) (s) | 295.77 | 161.95 | 89.33 | 55.67 | 40.66 | 37.03 | 36.24 | 37.00 | 36.68 | 33.81 | 35.51 | 35.34 | 35.25 | 35.49 | 35.86 | |
| Average Active Time (s) | 295.77 | 157.53 | 82.54 | 47.54 | 31.69 | 27.64 | 26.69 | 26.26 | 25.87 | 24.75 | 24.56 | 24.35 | 24.50 | 24.71 | 24.79 | |
| Activity Ratio (%) | 100.0 | 97.2 | 92.3 | 85.1 | 77.1 | 73.3 | 71.4 | 70.0 | 68.5 | 66.2 | 65.0 | 63.9 | 63.0 | 62.0 | 61.2 | |
| Average number of active threads | 1.000 | 1.944 | 3.690 | 6.807 | 12.325 | 17.576 | 22.811 | 27.948 | 32.831 | 37.004 | 41.516 | 45.887 | 50.272 | 54.482 | 58.608 | |
| Affinity Stability (%) | 100.0 | 99.5 | 98.6 | 97.3 | 96.3 | 95.9 | 95.6 | 95.5 | 95.5 | 95.6 | 95.7 | 95.7 | 95.6 | 95.7 | 95.8 | |
| GFLOPS | 2.108 | 3.920 | 7.106 | 11.405 | 15.615 | 17.143 | 17.512 | 17.147 | 17.293 | 18.762 | 17.868 | 17.962 | 18.006 | 17.887 | 17.708 | |
| Time in analyzed loops (%) | 91.7 | 90.0 | 88.3 | 84.2 | 75.1 | 69.3 | 66.2 | 64.2 | 61.3 | 58.6 | 55.6 | 53.7 | 51.9 | 50.6 | 49.7 | |
| Time in analyzed innermost loops (%) | 74.8 | 68.5 | 67.6 | 65.9 | 60.8 | 57.8 | 56.8 | 56.3 | 54.2 | 52.1 | 49.6 | 48.2 | 46.7 | 45.7 | 45.0 | |
| Time in user code (%) | 92.0 | 90.3 | 88.5 | 84.3 | 75.2 | 69.4 | 66.2 | 64.3 | 61.3 | 58.6 | 55.6 | 53.7 | 51.9 | 50.6 | 49.7 | |
| Compilation Options Score (%) | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| Array Access Efficiency (%) | 63.8 | 62.8 | 63.0 | 63.5 | 64.3 | 64.9 | 65.3 | 65.6 | 65.6 | 65.8 | 65.8 | 65.9 | 66.0 | 66.0 | 66.1 | |
| Potential Speedups | ||||||||||||||||
| Perfect Flow Complexity | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.00 | 1.01 | 1.02 | 1.05 | 1.07 | 1.06 | 1.12 | 1.12 | 1.02 | 1.13 | 1.13 | 1.12 | 1.12 | 1.13 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.04 | 1.11 | 1.24 | 1.42 | 1.53 | 1.56 | 1.62 | 1.64 | 1.59 | 1.69 | 1.70 | 1.69 | 1.69 | 1.70 | |
| Scalability - Gap | 1.00 | 0.55 | 0.30 | 0.19 | 0.14 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.13 | 0.14 | |
| No Scalar Integer | Potential Speedup | 1.11 | 1.12 | 1.12 | 1.10 | 1.08 | 1.06 | 1.05 | 1.04 | 1.04 | 1.03 | 1.03 | 1.03 | 1.03 | 1.02 | 1.02 |
| Nb Loops to get 80% | 11 | 11 | 11 | 12 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | |
| FP Vectorised | Potential Speedup | 1.22 | 1.24 | 1.23 | 1.23 | 1.21 | 1.21 | 1.21 | 1.21 | 1.20 | 1.19 | 1.18 | 1.18 | 1.17 | 1.17 | 1.17 |
| Nb Loops to get 80% | 4 | 6 | 6 | 6 | 5 | 5 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | |
| Fully Vectorised | Potential Speedup | 1.56 | 1.59 | 1.57 | 1.51 | 1.41 | 1.35 | 1.31 | 1.29 | 1.27 | 1.25 | 1.23 | 1.22 | 1.21 | 1.21 | 1.20 |
| Nb Loops to get 80% | 26 | 28 | 28 | 27 | 25 | 23 | 21 | 19 | 17 | 17 | 16 | 16 | 15 | 15 | 15 | |
| Only FP Arithmetic | Potential Speedup | 1.31 | 1.30 | 1.29 | 1.25 | 1.19 | 1.15 | 1.12 | 1.10 | 1.09 | 1.08 | 1.07 | 1.07 | 1.06 | 1.06 | 1.06 |
| Nb Loops to get 80% | 28 | 28 | 28 | 28 | 28 | 27 | 26 | 26 | 26 | 26 | 26 | 26 | 25 | 25 | 25 | |
| Source Object | Issue |
|---|---|
| ▼exec | |
| ▼IJVector_parcsr.c | |
| ○ | |
| ▼amg.c | |
| ○ | |
| ▼csr_matrix.c | |
| ○ | |
| ▼par_strength.c | |
| ○ | |
| ▼random.c | |
| ○ | |
| ▼par_lr_interp.c | |
| ○ | |
| ▼vector.c | |
| ○ | |
| ▼par_multi_interp.c | |
| ○ | |
| ▼csr_matvec.c | |
| ○ | |
| ▼IJMatrix_parcsr.c | |
| ○ | |
| ▼par_coarsen.c | |
| ○ | |
| ▼csr_matop.c | |
| ○ | |
| ▼par_csr_matop.c | |
| ○ | |
| ▼par_coarse_parms.c | |
| ○ | |
| ▼par_interp.c | |
| ○ | |
| ▼ams.c | |
| ○ |
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | r11 | r12 | r13 | r14 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Application | /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/run/oneview_runs/defaults/orig/exec | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2026-06-16 19:01:22 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | MPI; OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
| Machine | ip-172-31-9-132.ec2.internal | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | |||||||||||||||
| Cache Size | |||||||||||||||
| Number of Cores | |||||||||||||||
| Maximal Frequency | 0.00 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 6.1.170-213.321.amzn2023.aarch64 #1 SMP Thu May 14 12:18:13 UTC 2026 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | exec: Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm -store-to-load-forwarding-conflict-detection=0 -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/utilities -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/IJ_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/krylov -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/seq_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG -O3 -mcpu=native -Wno-error=implicit-function-declaration -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/seq_mv.dir/AMG/seq_mv/csr_matvec.c.o -MF CMakeFiles/seq_mv.dir/AMG/seq_mv/csr_matvec.c.o.d -o CMakeFiles/seq_mv.dir/AMG/seq_mv/csr_matvec.c.o -c /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/seq_mv/csr_matvec.c -I /home/eoseret/tools/mpi/openmpi-armclang-22.1/include | exec: Arm Toolchain for Linux 22.1.0 clang version 22.1.0 (https://github.com/arm/arm-toolchain.git c95792353373404441df364b5a762338e5642230) /opt/arm/arm-toolchain-for-linux/bin/clang-22 -frtlib-add-rpath -fveclib=ArmPL -mllvm -gvn-add-phi-translation=1 -mllvm -store-to-load-forwarding-conflict-detection=0 -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/utilities -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/parcsr_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/parcsr_ls -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/IJ_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/krylov -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/seq_mv -I /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG -O3 -mcpu=native -Wno-error=implicit-function-declaration -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-command-line -fopenmp=libomp -D TIMER_USE_MPI -D HYPRE_USING_OPENMP -D HYPRE_HOPSCOTCH -D HYPRE_USING_PERSISTENT_COMM -D HYPRE_BIGINT -MD -MT CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -MF CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o.d -o CMakeFiles/parcsr_ls.dir/AMG/parcsr_ls/ams.c.o -c /home/eoseret/qaas/qaas_runs/178-162-9307/intel/AMG/build/AMG/AMG/parcsr_ls/ams.c -I /home/eoseret/tools/mpi/openmpi-armclang-22.1/include | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of threads observed | 1 | 2 | 4 | 8 | 16 | 24 | 32 | 40 | 48 | 56 | 64 | 72 | 80 | 88 | 96 |
| Frequency Driver | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | OV scalability run using armclang | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |