options

Loops Index

Columns Filter

Level Max Thread Time / Walltime run_2_threads (%) Max Thread Time / Walltime run_4_threads (%) Max Thread Time / Walltime run_8_threads (%) Max Thread Time / Walltime run_16_threads (%) Max Thread Time / Walltime run_32_threads (%) Max Thread Time / Walltime run_64_threads (%) Max Thread Time / Walltime run_96_threads (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_16_threads (%) Exclusive Coverage run_32_threads (%) Exclusive Coverage run_64_threads (%) Exclusive Coverage run_96_threads (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_16_threads (%) Inclusive Coverage run_32_threads (%) Inclusive Coverage run_64_threads (%) Inclusive Coverage run_96_threads (%) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_64_threads Nb Threads run_96_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_16_threads Speedup If Perfect Load Balancing run_32_threads Speedup If Perfect Load Balancing run_64_threads Speedup If Perfect Load Balancing run_96_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%) Level Max Thread Time / Walltime Exclusive Coverage Inclusive Coverage Max Exclusive Time Over Threads Max Inclusive Time Over Threads Exclusive Time w.r.t. Wall Time Inclusive Time w.r.t. Wall Time Nb Threads Vectorization Ratio Vector Length Use Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency Efficiency Potential Speed-Up
Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7
Loop idSource LocationSource FunctionLevelMax Thread Time / Walltime run_2_threads (%)Max Thread Time / Walltime run_4_threads (%)Max Thread Time / Walltime run_8_threads (%)Max Thread Time / Walltime run_16_threads (%)Max Thread Time / Walltime run_32_threads (%)Max Thread Time / Walltime run_64_threads (%)Max Thread Time / Walltime run_96_threads (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_16_threads (%)Exclusive Coverage run_32_threads (%)Exclusive Coverage run_64_threads (%)Exclusive Coverage run_96_threads (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_16_threads (%)Inclusive Coverage run_32_threads (%)Inclusive Coverage run_64_threads (%)Inclusive Coverage run_96_threads (%)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_64_threadsNb Threads run_96_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_16_threadsSpeedup If Perfect Load Balancing run_32_threadsSpeedup If Perfect Load Balancing run_64_threadsSpeedup If Perfect Load Balancing run_96_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
23md-gcc-O3 - simulation.cpp:179-227 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]InBetween43.1642.6142.3942.0339.6737.3134.7142.8842.3140.6338.4235.2032.0329.7896.9996.1794.0791.5287.6984.5381.68206.83104.2453.8328.0214.277.254.82466.25237.46123.1664.6534.7418.4812.90205.46103.5051.5825.6112.656.214.12464.75235.22119.4261.0031.5216.3911.3024816326496050.915.911.821.741.011.011.041.091.131.171.17NANANANANA0.00100.990.3110.17101.0201.0301.040
17md-gcc-O3 - simulation.cpp:206-206 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost19.8819.5518.9818.2918.3617.2516.6119.7819.2218.5117.5416.1414.9613.9719.7819.2218.5117.5416.1414.9613.9795.2547.8424.1012.206.613.352.3195.2547.8424.1012.206.613.352.3194.8047.0023.4911.695.802.901.9394.8047.0023.4911.695.802.901.932481632649605011.8211.021.031.041.141.161.1910000100.00101.0101.0101.0101.0201.0201.020
20md-gcc-O3 - simulation.cpp:219-219 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost8.528.759.069.6410.4411.5912.228.458.608.789.209.7810.3710.688.458.608.789.209.7810.3710.6840.8121.4011.506.423.752.251.7040.8121.4011.506.423.752.251.7040.5021.0311.156.133.522.011.4840.5021.0311.156.133.522.011.482481632649605011.821.011.021.031.051.071.121.1510000100.00100.960.320.910.810.831.610.722.740.633.850.574.58
21md-gcc-O3 - simulation.cpp:223-223 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost7.237.577.888.319.0210.0410.607.217.427.587.878.208.708.857.217.427.587.878.208.708.8534.6518.5110.015.543.241.951.4734.6518.5110.015.543.241.951.4734.5418.169.635.242.951.691.2334.5418.169.635.242.951.691.232481632649605011.8211.021.041.061.11.161.210000100.00100.950.360.90.780.821.390.732.190.643.140.593.65
22md-gcc-O3 - simulation.cpp:227-227 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost6.717.167.437.978.819.7310.996.686.907.217.608.098.678.916.686.907.217.608.098.678.9132.1417.529.435.313.171.891.5232.1417.529.435.313.171.891.5232.0316.889.155.072.911.681.2332.0316.889.155.072.911.681.232481632649605011.8211.041.031.051.091.121.2410000100.00100.950.350.880.90.791.60.692.520.63.510.544.09
19md-gcc-O3 - simulation.cpp:214-214 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost6.096.106.196.026.375.925.986.075.985.825.575.294.954.866.075.985.825.575.294.954.8629.1714.937.864.012.291.150.8329.1714.937.864.012.291.150.8329.0914.637.393.711.900.960.6729.0914.637.393.711.900.960.672481632649605011.8211.021.061.081.21.21.2410000100.00100.990.040.980.090.980.110.960.230.950.260.90.48
18md-gcc-O3 - simulation.cpp:210-210 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]Innermost5.955.815.675.865.625.925.515.915.735.545.314.994.834.645.915.735.545.314.994.834.6428.5314.207.203.902.021.150.7728.5314.207.203.902.021.150.7728.3214.037.033.541.800.940.6428.3214.037.033.541.800.940.642481632649605011.821.011.011.021.11.131.231.1910000100.00101.0101.010100.990.070.940.270.920.38
24md-gcc-O3 - simulation.cpp:176-227 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]InBetween1.091.231.281.261.101.030.941.081.201.221.100.870.740.6198.0797.3695.2992.6288.5685.2782.305.213.011.620.840.400.200.13471.39240.47124.6365.3535.0918.6313.015.172.931.550.730.310.140.09469.92238.15120.9761.7331.8316.5311.3924816326496059.38111.131.011.031.041.151.271.391.53NANANANANA0.00100.880.140.830.210.880.131.0401.1301.270
38md-gcc-O3 - simulation.cpp:313-325velocityVerlet(Particles&, CellList&, int, int) [clone ._omp_fn.0]Innermost0.500.510.500.520.610.800.790.500.490.460.450.450.470.420.500.490.460.450.450.470.422.411.250.630.350.220.160.112.411.250.630.350.220.160.112.401.200.580.300.160.090.062.401.200.580.300.160.090.062481632649605011.0421.011.041.091.171.351.711.881003062.5010101.030100.920.040.820.080.850.06
16md-gcc-O3 - simulation.cpp:158-227 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]InBetween0.180.240.230.240.330.330.360.170.210.200.180.170.170.1498.2497.5795.4992.7988.7285.4482.430.880.590.290.160.120.060.05472.27241.06124.9065.4635.1518.6613.030.840.500.250.120.060.030.02470.76238.65121.2361.8531.8916.5711.4124816326496036.671121.051.171.141.3622.012.6NANANANANA0.00100.830.030.820.040.890.020.870.020.810.030.910.01
36md-gcc-O3 - simulation.cpp:342-344 [...]velocityVerlet(Particles&, CellList&, int, int) [clone ._omp_fn.1]Innermost0.090.090.150.150.260.360.470.080.090.120.100.150.210.280.080.090.120.100.150.210.280.430.230.190.100.090.070.060.430.230.190.100.090.070.060.400.220.150.070.050.040.040.400.220.150.070.050.040.04248163264960501121.081.041.241.441.771.691.67000000.00100.920.010.670.040.720.030.470.080.30.150.210.22
4md-gcc-O3 - simulation.cpp:114-125 [...]assignParticlesToCells(Particles const&, CellList&) [clone ._omp_fn.4]Innermost0.030.040.050.090.070.230.180.030.030.040.060.040.050.080.030.030.040.060.040.050.080.160.100.070.060.020.050.030.160.100.070.060.020.050.030.140.080.050.040.010.010.010.140.080.050.040.010.010.0124816325285033.3310.49221.121.291.321.521.83.552.071000325.00100.8800.720.010.450.030.640.010.430.030.280.06
15md-gcc-O3 - simulation.cpp:145-227 [...]computeForces(Particles&, CellList const&) [clone ._omp_fn.0]InBetween0.020.020.040.030.040.080.110.020.010.020.010.020.020.0198.2697.5895.5292.8188.7485.4582.450.090.050.050.020.020.010.02472.33241.07124.9465.4932.4416.9711.650.080.030.030.010.010.000.00470.84238.68121.2661.8631.8916.5711.412481421303044.4461.111.0811.021.131.821.671.811.82.052.31NANANANANA0.00101.4500.670.011.0300.9100.7300.820
10md-gcc-O3 - simulation.cpp:58-68 [...]assignParticlesToCells(Particles const&, CellList&) [clone ._omp_fn.1]Innermost0.010.020.020.040.040.080.110.010.010.010.020.010.020.020.010.010.010.020.010.020.020.070.050.030.030.010.020.020.070.050.030.030.010.020.020.070.030.020.010.000.000.000.070.030.020.010.000.000.0024815202944032.8910.492211.641.62.2722.022.321000150.00101.1800.8700.7900.8700.60.010.460.01
66md-gcc-O3 - simulation.cpp:140-346 [...]velocityVerlet(Particles&, CellList&, int, int)Single0.000.000.000.000.000.000.040.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000000001048.811110000001NANANANANA0.0010
35md-gcc-O3 - simulation.cpp:338-344 [...]velocityVerlet(Particles&, CellList&, int, int) [clone ._omp_fn.1]Outermost0.000.000.000.000.010.030.040.000.000.000.000.000.000.000.000.000.000.000.150.210.280.000.000.000.000.000.010.000.000.000.000.000.020.050.060.000.000.000.000.000.000.000.000.000.000.000.050.040.040000135045.971.241.111.670000111NANANANANA0.00101010
×