Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- r0: run_0
- r1: n2
- r2: n4
- r3: n8
Metric | r0 | r1 | r2 | r3 |
---|
Total Time (s) | 80.13 | 44.27 | 23.73 | 15.25 |
Max (Thread Active Time) (s) | 76.37 | 38.76 | 20.75 | 12.38 |
Average Active Time (s) | 76.21 | 38.52 | 20.64 | 12.28 |
Activity Ratio (%) | 95.1 | 89.6 | 87.0 | 80.7 |
Average number of active threads | 34.238 | 62.640 | 125.247 | 231.890 |
Affinity Stability (%) | 96.6 | 91.4 | 89.2 | 83.0 |
Time in analyzed loops (%) | 82.2 | 80.3 | 75.1 | 64.4 |
Time in analyzed innermost loops (%) | 53.3 | 51.6 | 48.2 | 41.7 |
Time in user code (%) | 80.1 | 78.2 | 73.2 | 62.9 |
Compilation Options Score (%) | 100 | 100 | 100 | 100 |
Array Access Efficiency (%) | 80.3 | 80.3 | 80.4 | 80.5 |
|
Potential Speedups |
Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 |
Perfect OpenMP + MPI + Pthread | 1.03 | 1.05 | 1.08 | 1.10 |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.06 | 1.07 | 1.11 | 1.26 |
Scalability - Gap | 1.00 | 1.11 | 1.18 | 1.52 |
No Scalar Integer | Potential Speedup | 1.24 | 1.23 | 1.21 | 1.17 |
Nb Loops to get 80% | 25 | 25 | 25 | 25 |
FP Vectorised | Potential Speedup | 1.18 | 1.18 | 1.17 | 1.14 |
Nb Loops to get 80% | 34 | 34 | 34 | 33 |
Fully Vectorised | Potential Speedup | 2.34 | 2.29 | 2.12 | 1.82 |
Nb Loops to get 80% | 41 | 41 | 41 | 41 |
Only FP Arithmetic | Potential Speedup | 1.72 | 1.69 | 1.61 | 1.48 |
Nb Loops to get 80% | 41 | 41 | 41 | 41 |
Source Object | Issue |
▼AVBP_V7_dev.KRAKEN– | |
▼cons_tens.f90– | |
○ | |
▼get_Y.f90– | |
○ | |
▼gradqen.f90– | |
○ | |
▼laxwe.f90– | |
○ | |
▼grad_4obj.f90– | |
○ | |
▼nsflux_les.f90– | |
○ | |
▼mod_adj_graph.f90– | |
○ | |
▼ns_timestep.f90– | |
○ | |
▼euler_timestep.f90– | |
○ | |
▼mod_pmesh_transfer.f90– | |
○ | |
▼avis_lp_rre.f90– | |
○ | |
▼specsource_cell.f90– | |
○ | |
▼add_source.f90– | |
○ | |
▼specflux_visc_c_nv.f90– | |
○ | |
▼heatflux_nv2.f90– | |
○ | |
▼update_rho.f90– | |
○ | |
▼temperature.f90– | |
○ | |
▼mod_pmesh_scatter_add.f90– | |
○ | |
▼div.f90– | |
○ | |
▼FE_add_dw.f90– | |
○ | |
▼specflux_invc.f90– | |
○ | |
▼ave.f90– | |
○ | |
▼scatter_o_sub.f90– | |
○ | |
▼scatter_grad.f90– | |
○ | |
▼compute_FE_implicit_residual.f90– | |
○ | |
▼efcy_dyn.f90– | |
○ | |
▼update.f90– | |
○ | |
▼rot_2delta.f90– | |
○ | |
▼dlongc.f90– | |
○ | |
▼rrate_cell.f90– | |
○ | |
▼scatter_o_add.f90– | |
○ | |
▼savis_Colin_spec.f90– | |
○ | |
▼wale_cell.f90– | |
○ | |
▼mass_product.f90– | |
○ | |
▼gather_o_cpy.f90– | |
○ | |
▼calc_visc_eff.f90– | |
○ | |
▼specsource_ener.f90– | |
○ | |
▼thermo_variables.f90– | |
○ | |
▼prebound.f90– | |
○ | |
▼central.f90– | |
○ | |
▼central_nv.f90– | |
○ | |
▼scheme.f90– | |
○ | |
▼savis_Colin_NS.f90– | |
○ | |
▼scale.f90– | |
○ | |
▼boxe_2delta.f90– | |
○ | |
▼calc_diffus.f90– | |
○ | |
▼correct_central_bnd_generic.f90– | |
○ | |
▼stress_nv2.f90– | |
○ | |
▼eflux.f90– | |
○ | |
▼wtowp.f90– | |
○ | |
▼avis_lp.f90– | |
○ | |
▼mod_copy.f90– | |
○ | |
▼get_uvwT.f90– | |
○ | |
▼velocity_group.f90– | |
○ | |
▼cons_tens_cell.f90– | |
○ | |
▼scatter_add.f90– | |
○ | |
▼compute_diffus_max.f90– | |
○ | |
Source Object | Issue |
▼AVBP_V7_dev.KRAKEN– | |
▼cons_tens.f90– | |
○ | |
▼get_Y.f90– | |
○ | |
▼gradqen.f90– | |
○ | |
▼laxwe.f90– | |
○ | |
▼grad_4obj.f90– | |
○ | |
▼nsflux_les.f90– | |
○ | |
▼mod_adj_graph.f90– | |
○ | |
▼ns_timestep.f90– | |
○ | |
▼euler_timestep.f90– | |
○ | |
▼mod_pmesh_transfer.f90– | |
○ | |
▼avis_lp_rre.f90– | |
○ | |
▼specsource_cell.f90– | |
○ | |
▼add_source.f90– | |
○ | |
▼specflux_visc_c_nv.f90– | |
○ | |
▼heatflux_nv2.f90– | |
○ | |
▼update_rho.f90– | |
○ | |
▼temperature.f90– | |
○ | |
▼mod_pmesh_scatter_add.f90– | |
○ | |
▼div.f90– | |
○ | |
▼FE_add_dw.f90– | |
○ | |
▼specflux_invc.f90– | |
○ | |
▼ave.f90– | |
○ | |
▼scatter_o_sub.f90– | |
○ | |
▼scatter_grad.f90– | |
○ | |
▼compute_FE_implicit_residual.f90– | |
○ | |
▼efcy_dyn.f90– | |
○ | |
▼update.f90– | |
○ | |
▼rot_2delta.f90– | |
○ | |
▼dlongc.f90– | |
○ | |
▼rrate_cell.f90– | |
○ | |
▼scatter_o_add.f90– | |
○ | |
▼savis_Colin_spec.f90– | |
○ | |
▼wale_cell.f90– | |
○ | |
▼mass_product.f90– | |
○ | |
▼gather_o_cpy.f90– | |
○ | |
▼calc_visc_eff.f90– | |
○ | |
▼specsource_ener.f90– | |
○ | |
▼thermo_variables.f90– | |
○ | |
▼prebound.f90– | |
○ | |
▼central.f90– | |
○ | |
▼central_nv.f90– | |
○ | |
▼scheme.f90– | |
○ | |
▼savis_Colin_NS.f90– | |
○ | |
▼scale.f90– | |
○ | |
▼boxe_2delta.f90– | |
○ | |
▼calc_diffus.f90– | |
○ | |
▼correct_central_bnd_generic.f90– | |
○ | |
▼stress_nv2.f90– | |
○ | |
▼eflux.f90– | |
○ | |
▼wtowp.f90– | |
○ | |
▼avis_lp.f90– | |
○ | |
▼mod_copy.f90– | |
○ | |
▼get_uvwT.f90– | |
○ | |
▼velocity_group.f90– | |
○ | |
▼cons_tens_cell.f90– | |
○ | |
▼scatter_add.f90– | |
○ | |
▼compute_diffus_max.f90– | |
○ | |
Source Object | Issue |
▼AVBP_V7_dev.KRAKEN– | |
▼cons_tens.f90– | |
○ | |
▼get_Y.f90– | |
○ | |
▼gradqen.f90– | |
○ | |
▼laxwe.f90– | |
○ | |
▼grad_4obj.f90– | |
○ | |
▼nsflux_les.f90– | |
○ | |
▼mod_adj_graph.f90– | |
○ | |
▼ns_timestep.f90– | |
○ | |
▼euler_timestep.f90– | |
○ | |
▼mod_pmesh_transfer.f90– | |
○ | |
▼avis_lp_rre.f90– | |
○ | |
▼specsource_cell.f90– | |
○ | |
▼add_source.f90– | |
○ | |
▼specflux_visc_c_nv.f90– | |
○ | |
▼heatflux_nv2.f90– | |
○ | |
▼update_rho.f90– | |
○ | |
▼temperature.f90– | |
○ | |
▼mod_pmesh_scatter_add.f90– | |
○ | |
▼div.f90– | |
○ | |
▼FE_add_dw.f90– | |
○ | |
▼specflux_invc.f90– | |
○ | |
▼ave.f90– | |
○ | |
▼scatter_o_sub.f90– | |
○ | |
▼scatter_grad.f90– | |
○ | |
▼compute_FE_implicit_residual.f90– | |
○ | |
▼efcy_dyn.f90– | |
○ | |
▼update.f90– | |
○ | |
▼rot_2delta.f90– | |
○ | |
▼dlongc.f90– | |
○ | |
▼rrate_cell.f90– | |
○ | |
▼scatter_o_add.f90– | |
○ | |
▼savis_Colin_spec.f90– | |
○ | |
▼wale_cell.f90– | |
○ | |
▼mass_product.f90– | |
○ | |
▼gather_o_cpy.f90– | |
○ | |
▼calc_visc_eff.f90– | |
○ | |
▼specsource_ener.f90– | |
○ | |
▼thermo_variables.f90– | |
○ | |
▼prebound.f90– | |
○ | |
▼central.f90– | |
○ | |
▼central_nv.f90– | |
○ | |
▼scheme.f90– | |
○ | |
▼savis_Colin_NS.f90– | |
○ | |
▼scale.f90– | |
○ | |
▼boxe_2delta.f90– | |
○ | |
▼calc_diffus.f90– | |
○ | |
▼correct_central_bnd_generic.f90– | |
○ | |
▼stress_nv2.f90– | |
○ | |
▼eflux.f90– | |
○ | |
▼wtowp.f90– | |
○ | |
▼avis_lp.f90– | |
○ | |
▼mod_copy.f90– | |
○ | |
▼get_uvwT.f90– | |
○ | |
▼velocity_group.f90– | |
○ | |
▼cons_tens_cell.f90– | |
○ | |
▼scatter_add.f90– | |
○ | |
▼compute_diffus_max.f90– | |
○ | |
Source Object | Issue |
▼AVBP_V7_dev.KRAKEN– | |
▼cons_tens.f90– | |
○ | |
▼get_Y.f90– | |
○ | |
▼gradqen.f90– | |
○ | |
▼laxwe.f90– | |
○ | |
▼grad_4obj.f90– | |
○ | |
▼nsflux_les.f90– | |
○ | |
▼mod_adj_graph.f90– | |
○ | |
▼ns_timestep.f90– | |
○ | |
▼euler_timestep.f90– | |
○ | |
▼mod_pmesh_transfer.f90– | |
○ | |
▼avis_lp_rre.f90– | |
○ | |
▼specsource_cell.f90– | |
○ | |
▼add_source.f90– | |
○ | |
▼specflux_visc_c_nv.f90– | |
○ | |
▼heatflux_nv2.f90– | |
○ | |
▼update_rho.f90– | |
○ | |
▼temperature.f90– | |
○ | |
▼mod_pmesh_scatter_add.f90– | |
○ | |
▼div.f90– | |
○ | |
▼FE_add_dw.f90– | |
○ | |
▼specflux_invc.f90– | |
○ | |
▼ave.f90– | |
○ | |
▼scatter_o_sub.f90– | |
○ | |
▼scatter_grad.f90– | |
○ | |
▼compute_FE_implicit_residual.f90– | |
○ | |
▼efcy_dyn.f90– | |
○ | |
▼update.f90– | |
○ | |
▼rot_2delta.f90– | |
○ | |
▼dlongc.f90– | |
○ | |
▼rrate_cell.f90– | |
○ | |
▼scatter_o_add.f90– | |
○ | |
▼savis_Colin_spec.f90– | |
○ | |
▼wale_cell.f90– | |
○ | |
▼mass_product.f90– | |
○ | |
▼gather_o_cpy.f90– | |
○ | |
▼calc_visc_eff.f90– | |
○ | |
▼specsource_ener.f90– | |
○ | |
▼thermo_variables.f90– | |
○ | |
▼prebound.f90– | |
○ | |
▼central.f90– | |
○ | |
▼central_nv.f90– | |
○ | |
▼scheme.f90– | |
○ | |
▼savis_Colin_NS.f90– | |
○ | |
▼scale.f90– | |
○ | |
▼boxe_2delta.f90– | |
○ | |
▼calc_diffus.f90– | |
○ | |
▼correct_central_bnd_generic.f90– | |
○ | |
▼stress_nv2.f90– | |
○ | |
▼eflux.f90– | |
○ | |
▼wtowp.f90– | |
○ | |
▼avis_lp.f90– | |
○ | |
▼mod_copy.f90– | |
○ | |
▼get_uvwT.f90– | |
○ | |
▼velocity_group.f90– | |
○ | |
▼cons_tens_cell.f90– | |
○ | |
▼scatter_add.f90– | |
○ | |
▼compute_diffus_max.f90– | |
○ | |
| r0 | r1 | r2 | r3 |
Experiment Name | | | | |
Application | /home/exter/camus/avbp-dev/HOST/KRAKEN/BIN/AVBP_V7_dev.KRAKEN | same as r0 | same as r0 | same as r0 |
Timestamp | 2025-02-06 15:40:11 | same as r0 | same as r0 | same as r0 |
Experiment Type | MPI; | same as r0 | same as r0 | same as r0 |
Machine | node177,node182,node178,node183,node179,node180,node184,node181 | same as r0 | same as r0 | same as r0 |
Architecture | x86_64 | same as r0 | same as r0 | same as r0 |
Micro Architecture | SKYLAKE | same as r0 | same as r0 | same as r0 |
Model Name | Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz | same as r0 | same as r0 | same as r0 |
Cache Size | 25344 KB | same as r0 | same as r0 | same as r0 |
Number of Cores | 18 | same as r0 | same as r0 | same as r0 |
Maximal Frequency | 3.7 GHz | same as r0 | same as r0 | same as r0 |
OS Version | Linux 4.18.0-553.el8_10.x86_64 #1 SMP Fri May 24 13:05:10 UTC 2024 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | SKYLAKE | same as r0 | same as r0 | same as r0 |
Compilation Options |
AVBP_V7_dev.KRAKEN: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/softs/local_intel/phdf5/1.8.20/include -I/softs/local_intel/parmetis/403_64/include -I/softs/local_intel/ptscotch/6.0.5a/include -I. -I../SOURCES/GENERIC/ -IAMR_INTERFACE/ -IBNDY/ -ICFD/ -ICHEM/ -ICHEM/ANALYTIC/ -ICHEM/ANALYTIC/LIB/ -ICHEM/HYB/ -ICHEM/NOX/ -ICHEM/SOOT_ANALYTIC/ -ICOMMON/ -ICOUPLING/ -IGENERIC/ -IIO/ -ILAGRANGE/ -ILAGRANGE/SOOT_EL/ -ILES/ -IMAIN/ -IMAIN/COMPUTE/ -IMAIN/SLAVE/ -INUMERICS/ -IPARSER/ -IPLASMA/ -IPLASMA/CHEMISTRY/ -IPLASMA/CHEMISTRY/CUSTOM_KINETICS_LIB/ -IPLASMA/DRIFTDIFFUSION/ -IPLASMA/DRIFTDIFFUSION/SCHEMES/ -IPLASMA/ELECTROMAG/ -IPLASMA/EULER/ -IPLASMA/FREEZE/ -IPLASMA/PHOTO/ -IPLASMA/THERMO/ -IPMESH/generic/ -IPMESH/interf_avbp/ -IPMESH/interp_tree_search/ -IPMESH/pmeshlib/ -IPMESH/pproc/ -ISMOOTH/ -ITTC/ -ITTC/LES/ -I/softs/intel/oneapi/mpi/2021.10.0//include -I/softs/intel/oneapi/mpi/2021.10.0/include -g -O3 -fpp -traceback -fno-alias -ip -assume byterecl -convert big_endian -align -march=core-avx2 -fma -axCORE-AVX2 -DHAS_PMETIS -DPARMETIS4 -DMETIS5 -DHAS_PTSCOTCH -c -o GENERIC/gather_o_cpy.o | same as r0 | same as r0 | same as r0 |
Number of processes observed | 36 | 72 | 144 | 288 |
Number of threads observed | 36 | 72 | 144 | 288 |
Frequency Driver | intel_cpufreq | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 |
Huge Pages | always | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 18 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.21.1 | same as r0 | same as r0 | same as r0 |
MAQAO build | 5485021ea6c10887b73ecb44ccd8bc21f8bac10a::20250204-111307 | same as r0 | same as r0 | same as r0 |
Comments | | same as r0 | same as r0 | same as r0 |