options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_16_threads (%) Exclusive Coverage run_32_threads (%) Exclusive Coverage run_48_threads (%) Exclusive Coverage run_64_threads (%) Exclusive Coverage run_80_threads (%) Exclusive Coverage run_96_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_16_threads (%) Inclusive Coverage run_32_threads (%) Inclusive Coverage run_48_threads (%) Inclusive Coverage run_64_threads (%) Inclusive Coverage run_80_threads (%) Inclusive Coverage run_96_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_80_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_80_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_80_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_80_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Nb Threads run_80_threads Nb Threads run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads GFLOPS run_80_threads GFLOPS run_96_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_16_threads Speedup If Perfect Load Balancing run_32_threads Speedup If Perfect Load Balancing run_48_threads Speedup If Perfect Load Balancing run_64_threads Speedup If Perfect Load Balancing run_80_threads Speedup If Perfect Load Balancing run_96_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_80_threads) Efficiency (run_80_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_16_threads (%)Exclusive Coverage run_32_threads (%)Exclusive Coverage run_48_threads (%)Exclusive Coverage run_64_threads (%)Exclusive Coverage run_80_threads (%)Exclusive Coverage run_96_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_16_threads (%)Inclusive Coverage run_32_threads (%)Inclusive Coverage run_48_threads (%)Inclusive Coverage run_64_threads (%)Inclusive Coverage run_80_threads (%)Inclusive Coverage run_96_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_80_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_80_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_80_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_80_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsNb Threads run_80_threadsNb Threads run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsGFLOPS run_80_threadsGFLOPS run_96_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_16_threadsSpeedup If Perfect Load Balancing run_32_threadsSpeedup If Perfect Load Balancing run_48_threadsSpeedup If Perfect Load Balancing run_64_threadsSpeedup If Perfect Load Balancing run_80_threadsSpeedup If Perfect Load Balancing run_96_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_80_threads) Efficiency(run_80_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
4kmeans-gcc-O3-all - main.cpp:61-67k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0]Innermost85.4485.7085.9085.7785.4985.9785.6785.3485.8085.6585.4485.7085.9085.7785.4985.9785.6785.3485.8085.6581.1140.8520.6910.395.222.661.781.391.151.0281.1140.8520.6910.395.222.661.781.391.151.0281.1143.9725.2815.8511.098.787.987.617.427.2581.1143.9725.2815.8511.098.787.987.617.427.2512481632486480968.3615.4926.9943.1461.5677.1485.1189.7063.0052.938.3349.313.61.51.67111.011.011.021.021.031.041.041.08NANANANANA0.00100.926.660.8170.6430.90.4646.40.2961.150.2167.520.1771.130.1474.070.1275.67
15kmeans-gcc-O3-all - main.cpp:81-84k_means(int, point_t*, point_t*, int*, point_t*, int, int)Innermost7.787.777.747.717.697.687.687.497.246.997.787.777.747.717.697.687.687.497.246.997.397.407.407.397.407.427.457.477.447.427.397.407.407.397.407.427.457.477.447.427.393.992.281.421.000.780.720.670.630.597.393.992.281.421.000.780.720.670.630.5911111111110.681.252.193.515.016.386.997.497.998.457.6947.441.571.141.7611111111110201341.67100.930.570.811.470.652.710.464.130.295.420.226.030.176.190.156.170.136.08
5kmeans-gcc-O3-all - main.cpp:60-67k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0]InBetween6.786.536.336.486.726.226.396.866.536.8292.2292.2392.2392.2592.2192.1992.0692.2092.3392.476.433.171.620.890.500.240.180.160.140.1287.5543.9122.0511.095.572.801.871.451.221.066.433.351.861.200.870.630.600.610.560.5887.5547.3227.1417.0511.969.418.578.227.987.83124816324864809612.3722.8040.3261.6386.02115.11122.89118.2085.4667.1211.7647.794.591.611.4111.021.071.151.251.281.391.51.731.59NANANANANA0.00100.960.260.860.870.672.130.463.620.324.250.234.950.165.730.145.60.126.03
×