options

Loops Index

1 loops have been discarded from the report because their ratio ((Max Inclusive Time Over Threads * 100) / Max Thread Active Time) is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.00% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Max Thread Time / Walltime run_1_thread (%) Max Thread Time / Walltime run_2_threads (%) Max Thread Time / Walltime run_4_threads (%) Max Thread Time / Walltime run_8_threads (%) Max Thread Time / Walltime run_10_threads (%) Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_10_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_10_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_10_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%) Level Max Thread Time / Walltime Exclusive Coverage Inclusive Coverage Max Exclusive Time Over Threads Max Inclusive Time Over Threads Exclusive Time w.r.t. Wall Time Inclusive Time w.r.t. Wall Time Nb Threads GFLOPS Vectorization Ratio Vector Length Use Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency Efficiency Potential Speed-Up
Run 1 Run 2 Run 3 Run 4 Run 5
Loop idSource LocationSource FunctionLevelMax Thread Time / Walltime run_1_thread (%)Max Thread Time / Walltime run_2_threads (%)Max Thread Time / Walltime run_4_threads (%)Max Thread Time / Walltime run_8_threads (%)Max Thread Time / Walltime run_10_threads (%)Exclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_10_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_10_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_10_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
1kmeans-gcc-Ofast - main.cpp:117-123k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0]Innermost83.6945.8529.758.305.8884.0651.2536.4410.237.4384.0651.2536.4410.237.43313.74193.82109.3630.5721.70313.74193.82109.3630.5721.70313.74193.95110.1229.8221.57313.74193.95110.1229.8221.571248103.196.3612.6225.1331.4058.5719.381.182.18511.061.051.051.030.51000100.00100.819.80.7110.491.3201.450
2kmeans-gcc-Ofast - main.cpp:116-123k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0]InBetween2.961.542.121.070.992.971.492.591.221.1187.0452.7439.0311.448.5511.096.537.813.933.63324.83200.35117.1534.0425.3311.095.647.813.553.23324.83199.59117.9333.3724.801248100.451.120.791.251.182515.632.251811.211.051.141.1502000100.00100.980.030.351.670.390.740.340.73
0kmeans-gcc-Ofast - main.cpp:114-123k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0]Outermost1.271.140.420.170.131.281.250.390.150.1088.3253.9939.4211.598.644.784.801.560.630.49329.61205.15118.3934.4325.674.784.731.170.440.28329.61204.32119.1033.8125.081248101.111.606.167.0812.7409.673113.411.071.41.471.78NANANANANA75.00100.50.621.0201.3501.710
7kmeans-gcc-Ofast - main.cpp:140-145k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.1]Outermost0.000.000.000.090.100.000.000.000.080.070.000.000.001.251.280.000.000.000.340.380.000.000.003.973.910.000.000.000.230.220.000.000.003.663.710008100.000.000.000.000.00010.711111.730001.51.80101330.001010
6kmeans-gcc-Ofast - main.cpp:143-145k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.1]Innermost0.000.000.000.980.980.000.000.001.171.210.000.000.001.171.210.000.000.003.633.620.000.000.003.633.620.000.000.003.423.500.000.000.003.423.500008100.000.000.000.000.00011.611.31.0810.40001.091.060200340.001010
×