* Info: Detected 1 Lprof instances in skylake: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 672229)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.8.0/bin/mpiicpc
Compiler Flags: "-march=native -O3 -xSKYLAKE-AVX512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 1 threads on rank 0
0-> 0
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 64
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 1
Spatial decomp: 1 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 32 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 1 1 / 1
(Rx,Ry,Rz) R in XYZ: 1x1x1 1x1x1 / 1x1x1
(PQR) TOTAL: 1 16 / 16
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 61440 0.469
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 1572864 12.000
j_plane 1572864 12.000
k_plane 1572864 12.000
mixelem_to_fraction 4352 0.033
phi 6553600 50.000
phi_out 6553600 50.000
psi 25165824 192.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 25165824 192.000
sigt_zonal 262144 2.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 68494704 522.573
Generation Complete!
Steady State Solve
==================
iter 0: particle count=7.487488e+07, change=1.000000e+00
iter 1: particle count=1.125855e+08, change=3.349511e-01
iter 2: particle count=1.313924e+08, change=1.431351e-01
iter 3: particle count=1.407381e+08, change=6.640521e-02
iter 4: particle count=1.453680e+08, change=3.184924e-02
iter 5: particle count=1.476542e+08, change=1.548355e-02
iter 6: particle count=1.487795e+08, change=7.563193e-03
iter 7: particle count=1.493316e+08, change=3.697158e-03
iter 8: particle count=1.496017e+08, change=1.805479e-03
iter 9: particle count=1.497334e+08, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.01387
LPlusTimes 10 3.03619
LTimes 10 3.36880
Population 10 0.18320
Scattering 10 30.43471
Solve 1 41.75041
Source 10 0.03391
SweepSolver 10 4.07554
SweepSubdomain 160 3.94863
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.013874,3.036187,3.368797,0.183204,30.434708,41.750408,0.033911,4.075543,3.948633
Figures of Merit
================
Throughput: 6.027683e+06 [unknowns/(second/iteration)]
Grind time : 1.659012e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 96.88606 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 25165824
END
* Info: Process finished (host skylake, process 672229)
Your experiment path is /home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/169-391-8990/intel/Kripke/run/oneview_runs/unicore/icx_1/oneview_results_1693930893/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################