* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 825857)miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.7570 0.7570 1 0.757001877
Total 67.3967 0.0004 1 67.396748066
Diffusion 36.8496 0.0400 5 7.369915581
Accept move 0.1245 0.1245 15371 0.000008102
Complete Updates 0.2642 0.0000 5 0.052840042
DeterminantRef::update 0.2642 0.2642 10 0.026418519
Current Gradient 1.5523 0.0254 30720 0.000050531
DeterminantRef::ratio 1.5117 1.5117 30720 0.000049210
OneBodyJastrowRef 0.0084 0.0084 30720 0.000000273
TwoBodyJastrowRef 0.0068 0.0068 30720 0.000000223
Kinetic Energy 0.3721 0.3717 5 0.074417400
OneBodyJastrowRef 0.0002 0.0002 5 0.000049257
TwoBodyJastrowRef 0.0002 0.0002 5 0.000032616
Make move 1.9712 1.9712 30720 0.000064165
New Gradient 10.6941 0.0339 30720 0.000348116
DeterminantRef::ratio 0.2325 0.2325 30720 0.000007570
DeterminantRef::spovgl 9.4078 0.4918 30720 0.000306243
Single-Particle Orbitals 8.9160 8.9160 30720 0.000290235
OneBodyJastrowRef 0.1087 0.1087 30720 0.000003540
TwoBodyJastrowRef 0.9111 0.9111 30720 0.000029658
Set active 2.0080 2.0080 30720 0.000065364
Update 19.8232 0.0167 15371 0.001289648
DeterminantRef::update 18.8940 18.8940 15371 0.001229196
OneBodyJastrowRef 0.0040 0.0040 15371 0.000000258
TwoBodyJastrowRef 0.9085 0.9085 15371 0.000059106
Initialization 4.3910 1.1318 1 4.391011953
DeterminantRef::inverse 1.2219 1.2219 2 0.610953569
DeterminantRef::spovgl 1.8606 0.1260 2 0.930297971
Single-Particle Orbitals 1.7346 1.7346 6144 0.000282317
OneBodyJastrowRef 0.0191 0.0191 1 0.019071102
TwoBodyJastrowRef 0.1576 0.1576 1 0.157649040
Pseudopotential 26.1557 0.1208 5 5.231142759
Make move 7.8382 7.8382 122580 0.000063944
Value 18.1967 0.1013 122580 0.000148447
DeterminantRef::ratio 0.3323 0.3323 122580 0.000002711
DeterminantRef::spoval 16.9324 0.2151 122580 0.000138134
Single-Particle Orbitals 16.7173 16.7173 122580 0.000136379
OneBodyJastrowRef 0.0996 0.0996 122580 0.000000813
TwoBodyJastrowRef 0.7310 0.7310 122580 0.000005963
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.44124e+09
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 6.29392e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.44323e+06
* Info: Process finished (host skylake, process 825857)
Your experiment path is /home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0
To display your profiling results:
#####################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#####################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0 #
#####################################################################################################################################################################################