| | | | | | | requested parallelism | walltime sum (s) | nb instances | any sync average per thread time (s) | any wait average per thread time (s) | parallelism overhead (%) | local speedup if perfectly balanced | global speedup if perfectly balanced |
start addr | function name | source location | level | ancestor thread num | invoker | parallel or teams | run_52_threads | run_52_threads | run_52_threads | run_52_threads | run_52_threads | run_52_threads | run_52_threads | run_52_threads |
md-icpx-Ofast:0x4034a9 | velocityVerlet(Particles&, CellList&, int, int) | simulation.cpp:140 | 0 | 0 | runtime | parallel | 52 | 47.465 | 500 | 5.429 | 5.429 | 11.4 | 1.129 | 1.119 |
md-icpx-Ofast:0x4033f4 | velocityVerlet(Particles&, CellList&, int, int) | simulation.cpp:356 | 0 | 0 | runtime | parallel | 52 | 0.335 | 500 | 28.4 E-3 | 28.4 E-3 | 8.50 | 1.093 | 1.001 |
md-icpx-Ofast:0x4029c1 | main | main.cpp:47 | 0 | 0 | runtime | parallel | 52 | 0.218 | 500 | 13.6 E-3 | 13.6 E-3 | 6.24 | 1.067 | 1.000 |
md-icpx-Ofast:0x4032e2 | computeForces(Particles&, CellList const&) | simulation.cpp:140 | 0 | 0 | runtime | parallel | 52 | 93.3 E-3 | 1.00 | 9.62 E-3 | 9.62 E-3 | 10.3 | 1.115 | 1.000 |
md-icpx-Ofast:0x403087 | assignParticlesToCells(Particles const&, CellList&) | stl_vector.h:802 | 0 | 0 | runtime | parallel | 52 | 88.2 E-3 | 51.0 | 5.42 E-3 | 5.41 E-3 | 6.14 | 1.065 | 1.000 |
md-icpx-Ofast:0x403019 | assignParticlesToCells(Particles const&, CellList&) | simulation.cpp:76 | 0 | 0 | runtime | parallel | 52 | 34.2 E-3 | 51.0 | 8.91 E-3 | 8.90 E-3 | 26.0 | 1.352 | 1.000 |
md-icpx-Ofast:0x402f08 | assignParticlesToCells(Particles const&, CellList&) | simulation.cpp:41 | 0 | 0 | runtime | parallel | 52 | 26.3 E-3 | 51.0 | 2.96 E-3 | 2.96 E-3 | 11.3 | 1.127 | 1.000 |
md-icpx-Ofast:0x402e17 | assignParticlesToCells(Particles const&, CellList&) | stl_vector.h:1119 | 0 | 0 | runtime | parallel | 52 | 7.16 E-3 | 51.0 | 495 E-6 | 477 E-6 | 6.91 | 1.074 | 1.000 |
md-icpx-Ofast:0x403062 | assignParticlesToCells(Particles const&, CellList&) | simulation.cpp:89 | 0 | 0 | runtime | parallel | 52 | 3.76 E-3 | 51.0 | 493 E-6 | 487 E-6 | 13.1 | 1.151 | 1.000 |