* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 645735)miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 384
Tile size = 384
Number of tiles = 1
Number of electrons = 768
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 196608000 bytes (187.5 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.1018 0.1018 1 0.101776123
Total 0.6550 0.0001 1 0.655049086
Diffusion 0.2426 0.0053 5 0.048517799
Accept move 0.0015 0.0015 1913 0.000000763
Complete Updates 0.0022 0.0000 5 0.000437403
DeterminantRef::update 0.0022 0.0022 10 0.000218225
Current Gradient 0.0183 0.0019 3840 0.000004754
DeterminantRef::ratio 0.0153 0.0153 3840 0.000003982
OneBodyJastrowRef 0.0006 0.0006 3840 0.000000161
TwoBodyJastrowRef 0.0004 0.0004 3840 0.000000114
Kinetic Energy 0.0053 0.0053 5 0.001064825
OneBodyJastrowRef 0.0000 0.0000 5 0.000005436
TwoBodyJastrowRef 0.0000 0.0000 5 0.000003624
Make move 0.0174 0.0174 3840 0.000004541
New Gradient 0.1324 0.0023 3840 0.000034491
DeterminantRef::ratio 0.0037 0.0037 3840 0.000000973
DeterminantRef::spovgl 0.1147 0.0076 3840 0.000029862
Single-Particle Orbitals 0.1071 0.1071 3840 0.000027894
OneBodyJastrowRef 0.0019 0.0019 3840 0.000000484
TwoBodyJastrowRef 0.0099 0.0099 3840 0.000002577
Set active 0.0190 0.0190 3840 0.000004953
Update 0.0412 0.0012 1913 0.000021536
DeterminantRef::update 0.0311 0.0311 1913 0.000016256
OneBodyJastrowRef 0.0003 0.0003 1913 0.000000180
TwoBodyJastrowRef 0.0086 0.0086 1913 0.000004491
Initialization 0.0719 0.0285 1 0.071857929
DeterminantRef::inverse 0.0127 0.0127 2 0.006327510
DeterminantRef::spovgl 0.0282 0.0022 2 0.014075041
Single-Particle Orbitals 0.0259 0.0259 768 0.000033737
OneBodyJastrowRef 0.0003 0.0003 1 0.000308990
TwoBodyJastrowRef 0.0023 0.0023 1 0.002251148
Pseudopotential 0.3405 0.0057 5 0.068096447
Make move 0.0720 0.0720 15792 0.000004558
Value 0.2628 0.0095 15792 0.000016639
DeterminantRef::ratio 0.0036 0.0036 15792 0.000000227
DeterminantRef::spoval 0.2302 0.0045 15792 0.000014577
Single-Particle Orbitals 0.2257 0.2257 15792 0.000014295
OneBodyJastrowRef 0.0043 0.0043 15792 0.000000273
TwoBodyJastrowRef 0.0151 0.0151 15792 0.000000959
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 6.91528e+08
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.86729e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.73232e+06
* Info: Process finished (host skylake, process 645735)
Your experiment path is /home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0
To display your profiling results:
##############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-390-4082/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1693907620/tools/lprof_npsu_run_0 #
##############################################################################################################################################################################################