* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 854702)miniqmc not built from git repository
number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.8025 0.8025 1 0.802459272
Total 98.0507 0.0004 1 98.050725156
Diffusion 47.5266 0.0369 5 9.505317380
Accept move 0.1245 0.1245 15371 0.000008099
Complete Updates 0.2647 0.0000 5 0.052947227
DeterminantRef::update 0.2647 0.2647 10 0.026472354
Current Gradient 1.6974 0.0216 30720 0.000055253
DeterminantRef::ratio 1.6653 1.6653 30720 0.000054208
OneBodyJastrowRef 0.0062 0.0062 30720 0.000000202
TwoBodyJastrowRef 0.0043 0.0043 30720 0.000000139
Kinetic Energy 0.5543 0.5539 5 0.110864296
OneBodyJastrowRef 0.0003 0.0003 5 0.000054058
TwoBodyJastrowRef 0.0002 0.0002 5 0.000031585
Make move 6.9111 6.9111 30720 0.000224971
New Gradient 11.0330 0.0299 30720 0.000359148
DeterminantRef::ratio 0.4206 0.4206 30720 0.000013690
DeterminantRef::spovgl 8.3793 0.4516 30720 0.000272762
Single-Particle Orbitals 7.9276 7.9276 30720 0.000258061
OneBodyJastrowRef 0.2118 0.2118 30720 0.000006895
TwoBodyJastrowRef 1.9915 1.9915 30720 0.000064829
Set active 6.9355 6.9355 30720 0.000225764
Update 19.9691 0.0142 15371 0.001299143
DeterminantRef::update 18.7965 18.7965 15371 0.001222856
OneBodyJastrowRef 0.0023 0.0023 15371 0.000000149
TwoBodyJastrowRef 1.1561 1.1561 15371 0.000075214
Initialization 5.2883 2.0802 1 5.288272575
DeterminantRef::inverse 1.2411 1.2411 2 0.620569486
DeterminantRef::spovgl 1.6055 0.1162 2 0.802763216
Single-Particle Orbitals 1.4893 1.4893 6144 0.000242397
OneBodyJastrowRef 0.0364 0.0364 1 0.036377624
TwoBodyJastrowRef 0.3250 0.3250 1 0.325003089
Pseudopotential 45.2354 0.0792 5 9.047088632
Make move 27.5685 27.5685 122580 0.000224902
Value 17.5878 0.0893 122580 0.000143480
DeterminantRef::ratio 0.7658 0.7658 122580 0.000006248
DeterminantRef::spoval 15.6640 0.1962 122580 0.000127786
Single-Particle Orbitals 15.4678 15.4678 122580 0.000126186
OneBodyJastrowRef 0.1033 0.1033 122580 0.000000843
TwoBodyJastrowRef 0.9654 0.9654 122580 0.000007876
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.36539e+09
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 4.87997e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 834495
* Info: Process finished (host skylake, process 854702)
Your experiment path is /home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0
To display your profiling results:
##############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
##############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/gcc_5/oneview_results_1694529111/tools/lprof_npsu_run_0 #
##############################################################################################################################################################################################