options

Executable Output


* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 825857)miniqmc not built from git repository

number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 
Stack timer profile
Timer                             Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                0.7570     0.7570              1       0.757001877
Total                               67.3967     0.0004              1      67.396748066
  Diffusion                         36.8496     0.0400              5       7.369915581
    Accept move                      0.1245     0.1245          15371       0.000008102
    Complete Updates                 0.2642     0.0000              5       0.052840042
      DeterminantRef::update         0.2642     0.2642             10       0.026418519
    Current Gradient                 1.5523     0.0254          30720       0.000050531
      DeterminantRef::ratio          1.5117     1.5117          30720       0.000049210
      OneBodyJastrowRef              0.0084     0.0084          30720       0.000000273
      TwoBodyJastrowRef              0.0068     0.0068          30720       0.000000223
    Kinetic Energy                   0.3721     0.3717              5       0.074417400
      OneBodyJastrowRef              0.0002     0.0002              5       0.000049257
      TwoBodyJastrowRef              0.0002     0.0002              5       0.000032616
    Make move                        1.9712     1.9712          30720       0.000064165
    New Gradient                    10.6941     0.0339          30720       0.000348116
      DeterminantRef::ratio          0.2325     0.2325          30720       0.000007570
      DeterminantRef::spovgl         9.4078     0.4918          30720       0.000306243
        Single-Particle Orbitals     8.9160     8.9160          30720       0.000290235
      OneBodyJastrowRef              0.1087     0.1087          30720       0.000003540
      TwoBodyJastrowRef              0.9111     0.9111          30720       0.000029658
    Set active                       2.0080     2.0080          30720       0.000065364
    Update                          19.8232     0.0167          15371       0.001289648
      DeterminantRef::update        18.8940    18.8940          15371       0.001229196
      OneBodyJastrowRef              0.0040     0.0040          15371       0.000000258
      TwoBodyJastrowRef              0.9085     0.9085          15371       0.000059106
  Initialization                     4.3910     1.1318              1       4.391011953
    DeterminantRef::inverse          1.2219     1.2219              2       0.610953569
    DeterminantRef::spovgl           1.8606     0.1260              2       0.930297971
      Single-Particle Orbitals       1.7346     1.7346           6144       0.000282317
    OneBodyJastrowRef                0.0191     0.0191              1       0.019071102
    TwoBodyJastrowRef                0.1576     0.1576              1       0.157649040
  Pseudopotential                   26.1557     0.1208              5       5.231142759
    Make move                        7.8382     7.8382         122580       0.000063944
    Value                           18.1967     0.1013         122580       0.000148447
      DeterminantRef::ratio          0.3323     0.3323         122580       0.000002711
      DeterminantRef::spoval        16.9324     0.2151         122580       0.000138134
        Single-Particle Orbitals    16.7173    16.7173         122580       0.000136379
      OneBodyJastrowRef              0.0996     0.0996         122580       0.000000813
      TwoBodyJastrowRef              0.7310     0.7310         122580       0.000005963

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.44124e+09
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 6.29392e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.44323e+06


* Info: Process finished (host skylake, process 825857)

Your experiment path is /home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0

To display your profiling results:
#####################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                      COMMAND                                                                       #
#####################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/orig/oneview_results_1694512752/tools/lprof_npsu_run_0  #
#####################################################################################################################################################################################

×