* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 854583)miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.7474 0.7474 1 0.747354031
Total 60.2357 0.0004 1 60.235708952
Diffusion 34.6383 0.0559 5 6.927662039
Accept move 0.1292 0.1292 15371 0.000008408
Complete Updates 0.2596 0.0000 5 0.051922417
DeterminantRef::update 0.2596 0.2596 10 0.025960112
Current Gradient 1.5672 0.0263 30720 0.000051016
DeterminantRef::ratio 1.5252 1.5252 30720 0.000049647
OneBodyJastrowRef 0.0094 0.0094 30720 0.000000305
TwoBodyJastrowRef 0.0064 0.0064 30720 0.000000207
Kinetic Energy 0.3826 0.3821 5 0.076518440
OneBodyJastrowRef 0.0002 0.0002 5 0.000042582
TwoBodyJastrowRef 0.0002 0.0002 5 0.000047207
Make move 1.1054 1.1054 30720 0.000035984
New Gradient 9.7099 0.0325 30720 0.000316076
DeterminantRef::ratio 0.2399 0.2399 30720 0.000007810
DeterminantRef::spovgl 8.5806 0.4953 30720 0.000279318
Single-Particle Orbitals 8.0853 8.0853 30720 0.000263194
OneBodyJastrowRef 0.0869 0.0869 30720 0.000002830
TwoBodyJastrowRef 0.7699 0.7699 30720 0.000025062
Set active 1.9370 1.9370 30720 0.000063054
Update 19.4915 0.0150 15371 0.001268067
DeterminantRef::update 18.6384 18.6384 15371 0.001212568
OneBodyJastrowRef 0.0045 0.0045 15371 0.000000293
TwoBodyJastrowRef 0.8336 0.8336 15371 0.000054229
Initialization 4.1711 1.1345 1 4.171133041
DeterminantRef::inverse 1.2190 1.2190 2 0.609495044
DeterminantRef::spovgl 1.6756 0.1286 2 0.837776065
Single-Particle Orbitals 1.5469 1.5469 6144 0.000251777
OneBodyJastrowRef 0.0144 0.0144 1 0.014414787
TwoBodyJastrowRef 0.1277 0.1277 1 0.127665997
Pseudopotential 21.4258 0.0976 5 4.285166025
Make move 4.3783 4.3783 122580 0.000035718
Value 16.9499 0.1061 122580 0.000138276
DeterminantRef::ratio 0.3332 0.3332 122580 0.000002718
DeterminantRef::spoval 15.8337 0.2100 122580 0.000129171
Single-Particle Orbitals 15.6237 15.6237 122580 0.000127457
OneBodyJastrowRef 0.0911 0.0911 122580 0.000000743
TwoBodyJastrowRef 0.5858 0.5858 122580 0.000004779
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.85034e+09
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 6.69571e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.76183e+06
* Info: Process finished (host skylake, process 854583)
Your experiment path is /home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0
To display your profiling results:
##############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0 #
##############################################################################################################################################################################################