Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- r0: ../champ/tests/CI_test/VMC-C4H6-ci1010_pVTZ-15000-dets/test_ov1_o3/
- r1: /home/kcamus/Trex/champ/champ_july2023/champ/tests/CI_test/VMC-C4H6-ci1010_pVTZ-15000-dets/champ_ifort_ov1_o3_o1m1_15kfull/
| Metric | r0 | r1 |
|---|
| Total Time (s) | 62.96 | 66.62 |
| Profiled Time (s) | 61.79 | 65.59 |
| GFLOPS | 0.0 | Not Implemented Yet |
| Time in analyzed loops (%) | 73.6 | 74.3 |
| Time in analyzed innermost loops (%) | 57.6 | 58.4 |
| Time in user code (%) | 78.9 | 79.0 |
| Compilation Options Score (%) | 100 | 100 |
| Array Access Efficiency (%) | 72.4 | 72.0 |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | 1.00 |
| Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 |
| No Scalar Integer | Potential Speedup | 1.18 | 1.19 |
| Nb Loops to get 80% | 18 | 19 |
| FP Vectorised | Potential Speedup | 1.23 | 1.23 |
| Nb Loops to get 80% | 26 | 25 |
| Fully Vectorised | Potential Speedup | 2.45 | 2.47 |
| Nb Loops to get 80% | 41 | 41 |
| Only FP Arithmetic | Potential Speedup | 1.50 | 1.56 |
| Nb Loops to get 80% | 39 | 38 |
| Source Object | Issue |
| ▼vmc.mov1– | |
| ▼nonloc.f– | |
| ○ | |
| ▼jassav.f– | |
| ○ | |
| ▼matinv.f90– | |
| ○ | |
| ▼optorb.f– | |
| ○ | |
| ▼optjas.f– | |
| ○ | |
| ▼determinant_psit.f– | |
| ○ | |
| ▼orbitals.f– | |
| ○ | |
| ▼optwf_sr_more.f– | |
| ○ | |
| ▼deriv_nonlpsi.f– | |
| ○ | |
| ▼gammai.f– | |
| ○ | |
| ▼hpsie.f– | |
| ○ | |
| ▼splfit.f– | |
| ○ | |
| ▼get_norbterm.f90– | |
| ○ | |
| ▼distances.f– | |
| ○ | |
| ▼optwf_sr.f90– | |
| ○ | |
| ▼jastrow4e.f– | |
| ○ | |
| ▼optci.f– | |
| ○ | |
| ▼multideterminante.f– | |
| ○ | |
| ▼multiply_slmi_mderiv.f– | |
| ○ | |
| ▼readps_gauss.f– | |
| ○ | |
| ▼deriv_nonloc.f– | |
| ○ | |
| ▼determinante.f– | |
| ○ | |
| ▼metrop_mov1_slat.f– | |
| ○ | |
| ▼jastrowe.f– | |
| ○ | |
| ▼pot_local.f– | |
| ○ | |
| ▼determinant.f– | |
| ○ | |
| ▼acuest.f– | |
| ○ | |
| ▼hpsi.f– | |
| ○ | |
| ▼scale_dist.f– | |
| ○ | |
| ▼detsav.f– | |
| ○ | |
| ▼deriv_jastrow4.f90– | |
| ○ | |
| ▼jastrow4.f– | |
| ○ | |
| ▼set_input_data.f90– | |
| ○ | |
| ▼slm.f90– | |
| ○ | |
| ▼rotqua.f– | |
| ○ | |
| ▼basis_fns.f– | |
| ○ | |
| ▼nonlpsi.f– | |
| ○ | |
| ▼determinante_psit.f– | |
| ○ | |
| ▼bxmatrices.f– | |
| ○ | |
| ▼optx_jas_ci.f– | |
| ○ | |
| ▼multideterminant.f– | |
| ○ | |
| Source Object | Issue |
| ▼vmc.mov1– | |
| ▼nonloc.f– | |
| ○ | |
| ▼jassav.f– | |
| ○ | |
| ▼matinv.f90– | |
| ○ | |
| ▼optorb.f– | |
| ○ | |
| ▼optjas.f– | |
| ○ | |
| ▼determinant_psit.f– | |
| ○ | |
| ▼orbitals.f– | |
| ○ | |
| ▼optwf_sr_more.f– | |
| ○ | |
| ▼scale_dist.f– | |
| ○ | |
| ▼gammai.f– | |
| ○ | |
| ▼hpsie.f– | |
| ○ | |
| ▼splfit.f– | |
| ○ | |
| ▼get_norbterm.f90– | |
| ○ | |
| ▼distances.f– | |
| ○ | |
| ▼optwf_sr.f90– | |
| ○ | |
| ▼jastrow4e.f– | |
| ○ | |
| ▼optci.f– | |
| ○ | |
| ▼multideterminante.f– | |
| ○ | |
| ▼multiply_slmi_mderiv.f– | |
| ○ | |
| ▼deriv_nonloc.f– | |
| ○ | |
| ▼readps_gauss.f– | |
| ○ | |
| ▼deriv_nonlpsi.f– | |
| ○ | |
| ▼bxmatrices.f– | |
| ○ | |
| ▼jastrowe.f– | |
| ○ | |
| ▼nonlpsi.f– | |
| ○ | |
| ▼hpsi.f– | |
| ○ | |
| ▼metrop_mov1_slat.f– | |
| ○ | |
| ▼basis_fns.f– | |
| ○ | |
| ▼deriv_jastrow4.f90– | |
| ○ | |
| ▼jastrow4.f– | |
| ○ | |
| ▼set_input_data.f90– | |
| ○ | |
| ▼pot_local.f– | |
| ○ | |
| ▼vmc.f– | |
| ○ | |
| ▼determinante.f– | |
| ○ | |
| ▼detsav.f– | |
| ○ | |
| ▼determinante_psit.f– | |
| ○ | |
| ▼determinant.f– | |
| ○ | |
| ▼slm.f90– | |
| ○ | |
| ▼multideterminant.f– | |
| ○ | |
| r0 | r1 |
| Application | ../../../bin/vmc.mov1 | ./../../../bin/vmc.mov1 |
| Timestamp | 2023-08-31 18:12:52 | 2023-07-03 16:33:01 |
| Experiment Type | Sequential | MPI; |
| Machine | skylake | same as r0 |
| Architecture | x86_64 | same as r0 |
| Micro Architecture | SKYLAKE | same as r0 |
| Model Name | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | same as r0 |
| Cache Size | 36608 KB | same as r0 |
| Number of Cores | 26 | same as r0 |
| Maximal Frequency | 2.1 GHz | same as r0 |
| OS Version | Linux 6.4.1-arch2-1 #1 SMP PREEMPT_DYNAMIC Tue, 04 Jul 2023 08:39:40 +0000 | Linux 6.2.12-arch1-1 #1 SMP PREEMPT_DYNAMIC Thu, 20 Apr 2023 16:11:55 +0000 |
| Architecture used during static analysis | x86_64 | same as r0 |
| Micro Architecture used during static analysis | SKYLAKE | same as r0 |
| Compilation Options |
vmc.mov1: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.7.0 Build 20220726_000000 -I/home/kcamus/comparative/champ/champ/buildO3/src/module -I/home/kcamus/comparative/champ/champ/buildO3/src/parser -I/home/kcamus/intel/oneapi/mpi/2021.7.0//include -I/home/kcamus/intel/oneapi/mpi/2021.7.0/include -DTARGET_ARCHITECTURE=\"avx512\" -DVECTORIZATION=\"avx512\" -xCORE-AVX512 -O3 -fPIC -implicitnone -finline -ip -align array64byte -fma -ftz -fno-omit-frame-pointer -g -no-pie -fpp -mcmodel=small -shared-intel -dyncom=grid3d_data,orbital_num_spl,orbital_num_lag,orbital_num_spl2,grid3d_data -D_MPI_ -DCLUSTER -fixed -132 -c -o CMakeFiles/shared_objects.dir/multideterminant.f.o | vmc.mov1: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I/home/kcamus/Trex/champ/champ_july2023/champ/buildifort/src/module -I/home/kcamus/Trex/champ/champ_july2023/champ/buildifort/src/parser -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -DTARGET_ARCHITECTURE=\"avx512\" -DVECTORIZATION=\"avx512\" -xCORE-AVX512 -O3 -fPIC -implicitnone -finline -ip -align array64byte -fma -ftz -fno-omit-frame-pointer -g -fpp -mcmodel=small -shared-intel -dyncom=grid3d_data,orbital_num_spl,orbital_num_lag,orbital_num_spl2,grid3d_data -D_MPI_ -DCLUSTER -fixed -132 -c -o CMakeFiles/shared_objects.dir/multideterminant.f.o |
| Number of processes observed | 1 | same as r0 |
| Number of threads observed | 1 | same as r0 |
| Frequency Driver | intel_cpufreq | same as r0 |
| Frequency Governor | schedutil | same as r0 |
| Huge Pages | always | same as r0 |
| Hyperthreading | off | same as r0 |
| Number of sockets | 2 | same as r0 |
| Number of cores per socket | 26 | same as r0 |
| MAQAO version | 2.17.8 | 2.17.4 |
| MAQAO build | 0639d6ed13e6e77a0ec82d15a3f0912eac9390b5::20230829-171632 | c4bfa955d5e47d9b8b38aac6a834dea51884fbad::20230627-084729-0700 |
| Comments | - | - |