options

Loops Index

Columns Filter

Level Max Thread Time / Walltime run_2_threads (%) Max Thread Time / Walltime run_4_threads (%) Max Thread Time / Walltime run_8_threads (%) Max Thread Time / Walltime run_16_threads (%) Max Thread Time / Walltime run_32_threads (%) Max Thread Time / Walltime run_64_threads (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_16_threads (%) Exclusive Coverage run_32_threads (%) Exclusive Coverage run_64_threads (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_16_threads (%) Inclusive Coverage run_32_threads (%) Inclusive Coverage run_64_threads (%) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_64_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_16_threads Speedup If Perfect Load Balancing run_32_threads Speedup If Perfect Load Balancing run_64_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) Level Max Thread Time / Walltime Exclusive Coverage Inclusive Coverage Max Exclusive Time Over Threads Max Inclusive Time Over Threads Exclusive Time w.r.t. Wall Time Inclusive Time w.r.t. Wall Time Nb Threads Vectorization Ratio Vector Length Use Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency Efficiency Potential Speed-Up
Run 1 Run 2 Run 3 Run 4 Run 5 Run 6
Loop idSource LocationSource FunctionLevelMax Thread Time / Walltime run_2_threads (%)Max Thread Time / Walltime run_4_threads (%)Max Thread Time / Walltime run_8_threads (%)Max Thread Time / Walltime run_16_threads (%)Max Thread Time / Walltime run_32_threads (%)Max Thread Time / Walltime run_64_threads (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_16_threads (%)Exclusive Coverage run_32_threads (%)Exclusive Coverage run_64_threads (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_16_threads (%)Inclusive Coverage run_32_threads (%)Inclusive Coverage run_64_threads (%)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Nb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_64_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_16_threadsSpeedup If Perfect Load Balancing run_32_threadsSpeedup If Perfect Load Balancing run_64_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)
19md-acfl-O3 - simulation.cpp:142-193 [...]computeForces(Particles&, CellList const&) [clone .omp_outlined]InBetween39.8839.1839.0639.3938.1134.5239.8038.9737.7736.2133.6230.0541.1440.3639.1737.5334.8331.07206.58104.7254.1728.8315.147.55213.43108.2556.1929.9315.667.85206.12104.0952.3226.4513.316.47213.08107.8054.2727.4113.796.692481632640255.242211.011.041.111.171.2NANANANANA0.00100.990.380.980.570.970.940.971.0710.12
26md-acfl-O3 - simulation.cpp:142-229 [...]computeForces(Particles&, CellList const&) [clone .omp_outlined]Outermost12.4412.0911.7510.9910.039.2112.3012.0411.4510.569.648.7198.9298.4096.9894.2590.1286.0364.4632.3116.308.043.992.01513.28264.67137.1472.2738.8720.6063.6832.1715.867.713.821.88512.31262.84134.3568.8435.6718.522481632640251.67221.011.011.031.061.071.11NANANANANA0.00100.990.12101.0301.0401.060
20md-acfl-O3 - simulation.cpp:208-208computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost10.6910.4210.2610.059.588.9610.6410.2510.129.758.908.3610.6410.2510.129.758.908.3655.3627.8614.227.363.811.9655.3627.8614.227.363.811.9655.1027.3714.027.123.521.8055.1027.3714.027.123.521.8024816326402512.9141.011.021.021.051.111.1210000100.00101.0100.980.180.970.320.980.20.960.36
24md-acfl-O3 - simulation.cpp:225-225computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost7.868.328.518.929.319.747.818.268.338.488.599.067.818.268.338.488.599.0640.7322.2311.796.533.702.1340.7322.2311.796.533.702.1340.4522.0711.556.203.401.9540.4522.0711.556.203.401.9524816326402512.9141.011.011.031.071.121.1210000100.00100.920.690.881.040.821.560.742.210.653.19
25md-acfl-O3 - simulation.cpp:229-229computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost7.227.667.998.559.1910.087.207.597.858.278.629.147.207.597.858.278.629.1437.4220.4911.096.263.652.2137.4220.4911.096.263.652.2137.2820.2810.886.043.411.9737.2820.2810.886.043.411.9724816326402512.91411.011.031.051.11.1510000100.00100.920.610.861.120.771.890.682.730.593.73
23md-acfl-O3 - simulation.cpp:221-221computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost7.197.497.988.108.689.637.157.477.857.978.359.017.157.477.857.978.359.0137.2520.0211.075.933.452.1137.2520.0211.075.933.452.1137.0119.9510.885.823.301.9437.0119.9510.885.823.301.9424816326402512.9141.011.011.021.031.071.1210000100.00100.930.540.851.170.81.630.72.50.63.64
22md-acfl-O3 - simulation.cpp:216-216computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost6.466.356.396.186.196.156.436.236.155.965.625.376.436.236.155.965.625.3733.4716.968.864.522.461.3533.4716.968.864.522.461.3533.3216.638.524.352.231.1633.3216.638.524.352.231.1624816326402512.9141.011.021.051.061.141.210000100.0010100.980.140.960.250.940.360.90.53
21md-acfl-O3 - simulation.cpp:212-212computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost6.266.246.236.095.986.086.256.206.045.735.565.296.256.206.045.735.565.2932.4416.698.644.452.381.3332.4416.698.644.452.381.3332.3916.578.374.192.201.1432.3916.578.374.192.201.1424816326402512.91411.011.041.081.111.210000100.00100.980.140.970.20.970.190.920.450.890.59
18md-acfl-O3 - simulation.cpp:142-179 [...]computeForces(Particles&, CellList const&) [clone .omp_outlined]InBetween0.931.051.131.191.081.070.920.971.030.970.880.711.341.391.401.321.221.034.822.811.570.870.430.237.053.892.101.100.580.314.772.601.430.710.350.156.963.711.940.960.480.222481632640251141.011.091.111.251.271.58NANANANANA0.00100.920.080.840.170.840.150.860.130.970.02
17md-acfl-O3 - simulation.cpp:142-176 [...]computeForces(Particles&, CellList const&) [clone .omp_outlined]InBetween0.450.470.400.410.500.550.420.420.370.350.340.310.420.420.370.350.340.312.341.260.560.300.200.122.341.260.560.300.200.122.191.120.520.250.130.072.191.120.520.250.130.07248163264022.37112.641.071.131.091.21.541.84NANANANANA0.00100.980.011.0601.0701.0201.020
37md-acfl-O3 - simulation.cpp:313-325velocityVerlet(Particles&, CellList&, int, int) [clone .omp_outlined]Single0.080.080.100.110.110.110.080.060.070.060.050.050.080.060.070.060.050.050.420.200.140.080.050.030.420.200.140.080.050.030.410.160.100.040.020.010.410.160.100.040.020.0124816295810093.271111.011.291.421.92.332.031000910.00101.2901.0701.201.4301.110
39md-acfl-O3 - simulation.cpp:342-344velocityVerlet(Particles&, CellList&, int, int) [clone .omp_outlined.11]Single0.080.090.140.150.310.530.070.080.120.110.260.360.070.080.120.110.260.360.400.250.200.110.130.120.400.250.200.110.130.120.390.220.160.080.100.080.390.220.160.080.100.082481632641001001111.021.151.251.331.271.510006050.00100.890.010.60.050.580.050.240.190.150.31
16md-acfl-O3 - simulation.cpp:107-125 [...]assignParticlesToCells(Particles const&, CellList&) [clone .omp_outlined.8]Single0.050.050.060.100.110.140.040.050.060.050.090.090.040.050.060.050.090.090.230.140.090.070.040.030.230.140.090.070.040.030.220.120.080.040.040.020.220.120.080.040.040.02248163264018.758.39221.061.171.112.021.311.584000357.14100.9300.680.020.790.010.390.050.360.06
10md-acfl-O3 - simulation.cpp:51-68 [...]assignParticlesToCells(Particles const&, CellList&) [clone .omp_outlined.2]Single0.030.020.040.080.060.070.030.020.030.060.030.040.030.020.030.060.030.040.160.060.050.060.030.020.160.060.050.060.030.020.150.060.040.050.010.010.150.060.040.050.010.01248163258019.125.99221.11.081.231.212.221.694000180.00101.2100.8900.390.040.780.010.550.02
27md-acfl-O3 - simulation.cpp:142-142computeForces(Particles&, CellList const&) [clone .omp_outlined]Innermost0.000.000.000.000.000.020.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000001023.21114.8000001NANANANANA0.0010
×