options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_26_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_26_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_26_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_26_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_26_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_26_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_26_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_26_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_26_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_26_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_26_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_26_threads) Efficiency (run_26_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_26_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_26_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_26_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_26_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_26_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_26_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_26_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_26_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_26_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_26_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_26_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_26_threads) Efficiency(run_26_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-funroll93.2992.9692.3091.1188.7586.030.000.000.000.000.000.00151.8575.9737.9919.009.505.860.000.000.000.000.000.00151.8580.6945.0927.2218.0814.390.000.000.000.000.000.00124816260.008.3310.3811.8611.9611.080.000.050.020.010.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.127.7513.8622.9634.5643.41GNU C++14 14.2.1 20240910 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ...100.945.490.8414.590.727.570.5242.170.4151.12
Loop 2 - main.cpp:60-67 - kmeans-gcc-O3-funroll+93.2992.9692.3091.1188.7586.035.895.725.535.845.725.65151.8576.0138.3219.169.656.039.594.692.511.330.670.47151.8580.6945.0927.2218.0814.399.594.962.701.741.170.95124816260.000.470.880.880.950.990.000.030.170.070.050.044.087.7814.1721.8632.9740.61100.970.20.890.630.691.830.512.780.393.45
Loop 1 - main.cpp:61-67 - kmeans-gcc-O3-funroll87.4187.2486.7685.2783.0380.3787.4187.2486.7685.2783.0380.37142.2671.3235.8117.838.985.57142.2671.3235.8117.838.985.57142.2675.7342.3925.4716.9213.45142.2675.7342.3925.4716.9213.45124816260.007.869.6011.0511.1310.390.000.080.150.070.050.054.127.7413.8423.0434.6743.61100.945.290.8413.970.725.740.5339.390.4147.67
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-gcc-O3-funroll6.696.656.616.516.356.160.000.000.000.000.000.0010.8910.8710.8710.8610.8810.880.000.000.000.000.000.0010.895.783.231.951.291.030.000.000.000.000.000.001111110.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.460.871.552.573.864.85GNU C++14 14.2.1 20240910 -march=skylake-avx512 -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi ...100.940.380.841.030.71.950.533.010.413.66
Loop 4 - main.cpp:56-95 - kmeans-gcc-O3-funroll [...]+6.696.656.616.516.356.160.000.000.000.000.000.0010.8910.8710.8710.8610.8810.880.000.000.000.000.000.0010.895.783.231.951.291.030.000.000.000.000.000.000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main.cpp:81-84 - kmeans-gcc-O3-funroll6.696.656.616.516.356.166.696.656.616.516.356.1610.8910.8710.8710.8610.8810.8810.8910.8710.8710.8610.8810.8810.895.783.231.951.291.0310.895.783.231.951.291.031111110.000.000.000.000.000.000.000.000.000.000.000.000.460.871.552.573.864.85100.940.380.841.030.71.950.533.010.413.66
Loop 5 - main.cpp:86-92 - kmeans-gcc-O3-funroll0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
unknown_kernel_regionkernel0.010.010.010.030.000.010.000.000.000.000.000.000.020.010.010.020.000.000.000.000.000.000.000.000.020.010.010.010.000.000.000.000.000.000.000.001226120.000.010.000.020.000.000.000.000.000.000.000.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.000.000.000.001.120.000.00
gomp_barrier_wait_endlibgomp.so.1.0.00.000.320.962.244.687.560.000.320.962.244.687.560.000.520.540.540.550.560.000.520.540.540.550.560.000.280.470.670.951.270.000.280.470.670.951.27013715250.000.000.030.020.110.180.000.000.010.000.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.00101010101010
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.060.120.110.220.240.000.060.120.110.220.240.000.060.080.040.040.040.000.060.080.040.040.040.000.050.060.030.040.040.000.050.060.030.040.04024816260.000.040.070.070.080.160.000.020.030.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.00101010101010
×