options

Executable Output


* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting

* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 854583)miniqmc not built from git repository

number of ranks : 1, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 1
OpenMP threads = 1
Number of walkers per rank = 1

SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow, 
determinant update, and distance table + einspline of the 
reference implementation 
================================== 
Stack timer profile
Timer                             Inclusive_time  Exclusive_time  Calls       Time_per_call
Setup                                0.7474     0.7474              1       0.747354031
Total                               60.2357     0.0004              1      60.235708952
  Diffusion                         34.6383     0.0559              5       6.927662039
    Accept move                      0.1292     0.1292          15371       0.000008408
    Complete Updates                 0.2596     0.0000              5       0.051922417
      DeterminantRef::update         0.2596     0.2596             10       0.025960112
    Current Gradient                 1.5672     0.0263          30720       0.000051016
      DeterminantRef::ratio          1.5252     1.5252          30720       0.000049647
      OneBodyJastrowRef              0.0094     0.0094          30720       0.000000305
      TwoBodyJastrowRef              0.0064     0.0064          30720       0.000000207
    Kinetic Energy                   0.3826     0.3821              5       0.076518440
      OneBodyJastrowRef              0.0002     0.0002              5       0.000042582
      TwoBodyJastrowRef              0.0002     0.0002              5       0.000047207
    Make move                        1.1054     1.1054          30720       0.000035984
    New Gradient                     9.7099     0.0325          30720       0.000316076
      DeterminantRef::ratio          0.2399     0.2399          30720       0.000007810
      DeterminantRef::spovgl         8.5806     0.4953          30720       0.000279318
        Single-Particle Orbitals     8.0853     8.0853          30720       0.000263194
      OneBodyJastrowRef              0.0869     0.0869          30720       0.000002830
      TwoBodyJastrowRef              0.7699     0.7699          30720       0.000025062
    Set active                       1.9370     1.9370          30720       0.000063054
    Update                          19.4915     0.0150          15371       0.001268067
      DeterminantRef::update        18.6384    18.6384          15371       0.001212568
      OneBodyJastrowRef              0.0045     0.0045          15371       0.000000293
      TwoBodyJastrowRef              0.8336     0.8336          15371       0.000054229
  Initialization                     4.1711     1.1345              1       4.171133041
    DeterminantRef::inverse          1.2190     1.2190              2       0.609495044
    DeterminantRef::spovgl           1.6756     0.1286              2       0.837776065
      Single-Particle Orbitals       1.5469     1.5469           6144       0.000251777
    OneBodyJastrowRef                0.0144     0.0144              1       0.014414787
    TwoBodyJastrowRef                0.1277     0.1277              1       0.127665997
  Pseudopotential                   21.4258     0.0976              5       4.285166025
    Make move                        4.3783     4.3783         122580       0.000035718
    Value                           16.9499     0.1061         122580       0.000138276
      DeterminantRef::ratio          0.3332     0.3332         122580       0.000002718
      DeterminantRef::spoval        15.8337     0.2100         122580       0.000129171
        Single-Particle Orbitals    15.6237    15.6237         122580       0.000127457
      OneBodyJastrowRef              0.0911     0.0911         122580       0.000000743
      TwoBodyJastrowRef              0.5858     0.5858         122580       0.000004779

========== Throughput ============ 

Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.85034e+09
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 6.69571e+09
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.76183e+06


* Info: Process finished (host skylake, process 854583)

Your experiment path is /home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0

To display your profiling results:
##############################################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                                           COMMAND                                                                           #
##############################################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-451-1869/intel/miniqmc/run/oneview_runs/unicore/icx_6/oneview_results_1694528831/tools/lprof_npsu_run_0  #
##############################################################################################################################################################################################

×