Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|---|
Total Time (s) | 92.36 | 196.16 | 404.04 | 810.03 | 1.65 E3 | 3.43 E3 | 5.08 E3 | |
Profiled Time (s) | 41.29 | 45.14 | 45.83 | 45.90 | 46.70 | 48.49 | 47.92 | |
Time in analyzed loops (%) | 40.4 | 41.3 | 37.3 | 36.7 | 36.9 | 38.6 | 37.9 | |
Time in analyzed innermost loops (%) | 40.2 | 41.1 | 37.1 | 36.5 | 36.7 | 38.5 | 37.7 | |
Time in user code (%) | 40.6 | 41.6 | 37.6 | 37.1 | 37.2 | 39.0 | 38.3 | |
Compilation Options Score (%) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
Array Access Efficiency (%) | 93.5 | 93.1 | 93.1 | 92.9 | 93.0 | 93.0 | 93.0 | |
Potential Speedups | ||||||||
Perfect Flow Complexity | 1.02 | 1.02 | 1.02 | 1.02 | 1.02 | 1.02 | 1.02 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.01 | 1.05 | 1.02 | 1.02 | 1.03 | 1.10 | 1.12 | |
No Scalar Integer | Potential Speedup | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 |
Nb Loops to get 80% | 3 | 3 | 3 | 3 | 3 | 3 | 3 | |
FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Nb Loops to get 80% | 3 | 4 | 4 | 4 | 3 | 3 | 4 | |
Fully Vectorised | Potential Speedup | 1.05 | 1.05 | 1.05 | 1.05 | 1.05 | 1.05 | 1.05 |
Nb Loops to get 80% | 4 | 4 | 4 | 4 | 4 | 4 | 4 | |
Only FP Arithmetic | Potential Speedup | 1.04 | 1.04 | 1.04 | 1.04 | 1.04 | 1.04 | 1.04 |
Nb Loops to get 80% | 7 | 7 | 7 | 6 | 7 | 6 | 7 |
Source Object | Issue |
---|---|
▼[vdso] | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libqmcwfs.so | |
▼WaveFunction.cpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼BsplineAllocator.hpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼OhmmsVector.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼TwoBodyJastrowRef.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼DiracDeterminantRef.cpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼einspline_spo_ref.hpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼SPOSet.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼OneBodyJastrowRef.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼DelayedUpdate.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼DiracMatrix.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼BsplineFunctor.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libplatform_cpu_LA.so | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼exec | |
▼miniqmc.cpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libqmcparticle.so | |
▼ParticleSet.cpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libqmcutil.so | |
▼NewTimer.cpp | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libqmcparticle_omptarget.so | |
▼ParticleBConds3DSoa.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼SoaDistanceTableAAOMPTarget.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼SoaDistanceTableABOMPTarget.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼OhmmsVector.h | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|
Application | /home/kcamus/qaas_runs/170-254-9426/intel/miniqmc/run/binaries/clang_14/exec | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2023-12-14 18:51:44 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | MPI; OpenMP; | same as r5 |
Machine | ip-172-31-68-94 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ZEN_V4 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | AMD EPYC 9R14 96-Core Processor | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Cache Size | 1024 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of Cores | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Maximal Frequency | 3.701953 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 6.2.0-1017-aws #17~22.04.1-Ubuntu SMP Fri Nov 17 21:07:13 UTC 2023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ZEN_V4 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | libqmcwfs.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcutil.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) + [vdso]: N/A libqmcparticle_omptarget.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcparticle.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libplatform_cpu_LA.so: N/A libqmcwfs.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcutil.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) + [vdso]: N/A libqmcparticle_omptarget.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcparticle.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libplatform_cpu_LA.so: N/A libqmcwfs.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcutil.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) + [vdso]: N/A libqmcparticle_omptarget.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcparticle.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | same as r1 | libplatform_cpu_LA.so: N/A libqmcwfs.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcutil.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) + [vdso]: N/A libqmcparticle.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcparticle_omptarget.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libplatform_cpu_LA.so: N/A libqmcwfs.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcutil.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libqmcparticle.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) + [vdso]: N/A libqmcparticle_omptarget.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | same as r5 |
Number of processes observed | 4 | 8 | 16 | 32 | 64 | 128 | 192 |
Number of threads observed | 4 | 8 | 16 | 32 | 64 | 137 | 210 |
Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.18.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | 535ba85dde2e553c697e5c5ade9cf3d067862478::20231208-181925 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | - | - | - | - | - | - | - |