options

Functions and Loops

Columns Filter

Max Thread Time / Walltime run_1_thread (%) Max Thread Time / Walltime run_2_threads (%) Max Thread Time / Walltime run_4_threads (%) Max Thread Time / Walltime run_8_threads (%) Max Thread Time / Walltime run_16_threads (%) Max Thread Time / Walltime run_32_threads (%) Max Thread Time / Walltime run_64_threads (%) Max Thread Time / Walltime run_96_threads (%) Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_64_threads (%) Coverage run_96_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_64_threads (%) Coverage Excluding Loops run_96_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_64_threads Nb Threads run_96_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_64_threads Deviation (coverage) run_96_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_64_threads Deviation (walltime) run_96_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_64_threads Categories run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_64_threads GFLOPS run_96_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%) Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories GFLOPS Compilation Options Efficiency Potential Speed-Up
Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8
NameModuleMax Thread Time / Walltime run_1_thread (%)Max Thread Time / Walltime run_2_threads (%)Max Thread Time / Walltime run_4_threads (%)Max Thread Time / Walltime run_8_threads (%)Max Thread Time / Walltime run_16_threads (%)Max Thread Time / Walltime run_32_threads (%)Max Thread Time / Walltime run_64_threads (%)Max Thread Time / Walltime run_96_threads (%)Coverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_64_threads (%)Coverage run_96_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_64_threads (%)Coverage Excluding Loops run_96_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_64_threadsNb Threads run_96_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_64_threadsDeviation (coverage) run_96_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_64_threadsDeviation (walltime) run_96_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_64_threadsCategories run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_64_threadsGFLOPS run_96_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0]+kmeans-gcc-Ofast75.1361.6045.5730.0117.9510.605.032.28100.0098.7096.8295.1693.6087.2277.5041.640.000.000.000.000.000.000.000.00132.2967.1434.2317.358.814.742.140.960.000.000.000.000.000.000.000.00132.2966.2833.1816.618.334.161.790.630.000.000.000.000.000.000.000.001248163264960.001.822.303.363.835.656.7210.730.001.240.790.590.340.270.150.17Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.018.0216.0532.2264.72130.47273.83410.89GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -g -Ofast -std=c++14 -fno-omit-frame-pointer -fopenmp1010.210.3110.430.990.70.990.641.1602.180
Loop 1 - main.cpp:113-122 - kmeans-gcc-Ofast+0.000.000.000.000.000.000.000.00100.0098.7096.8295.1693.6087.2277.5041.640.000.000.000.000.000.000.000.00132.2967.1434.2717.368.834.752.160.980.000.000.000.000.000.000.000.00132.2966.2833.1816.618.334.161.790.630.000.000.000.000.000.000.000.00000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 3 - main.cpp:115-122 - kmeans-gcc-Ofast+0.570.500.410.270.180.130.090.05100.0098.7096.8295.1693.6087.2277.5041.640.760.760.810.720.650.660.600.34132.2967.1434.2717.368.834.752.160.981.000.550.310.160.090.060.040.02132.2966.2833.1816.618.334.161.790.631.000.510.280.130.060.030.010.011248163264960.000.080.070.170.160.260.350.410.000.060.020.030.010.010.010.013.747.2313.2528.7564.73120.47243.31350.85100.990.010.910.07101.090101.1402.040
Loop 2 - main.cpp:116-122 - kmeans-gcc-Ofast74.5661.1045.2129.7517.8310.484.992.2899.2497.9496.0194.4492.9686.5676.9041.3099.2497.9496.0194.4492.9686.5676.9041.30131.2966.5933.9617.208.744.692.120.96131.2966.5933.9617.208.744.692.120.96131.2965.7732.9016.498.274.131.770.63131.2965.7732.9016.498.274.131.770.631248163264960.001.742.343.333.775.656.7110.540.001.180.810.580.340.270.150.174.018.0216.0832.2564.72130.55274.06411.381010.1910.2410.430.990.760.990.651.1602.180
do_lookup_xld-linux-aarch64.so.10.000.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANANASystem (%): 100.000.000.000.000.000.000.000.000.00
gomp_team_barrier_wait_endlibgomp.so.1.0.00.001.602.562.722.442.601.782.060.001.303.174.726.0111.0617.7043.760.001.303.174.726.0111.0617.7043.760.001.741.931.581.201.160.750.870.001.741.931.581.201.160.750.870.000.871.090.820.530.530.410.670.000.871.090.820.530.530.410.670148163264960.000.002.303.343.785.266.726.600.000.000.790.580.340.250.150.10NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
__aarch64_ldadd4_acq_rellibgomp.so.1.0.00.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
gomp_barrier_wait_endlibgomp.so.1.0.00.000.000.010.010.000.010.060.660.000.000.000.000.000.020.3811.250.000.000.000.000.000.020.3811.250.000.000.000.000.000.010.030.280.000.000.000.000.000.010.030.280.000.000.000.000.000.000.010.170.000.000.000.000.000.000.010.1700110652950.000.000.000.000.000.000.196.540.000.000.000.000.000.000.000.09NANAOMP (%): 100.00OMP (%): 100.00NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.001010101010101010
gomp_team_startlibgomp.so.1.0.00.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000100.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00NANANANANANAOMP (%): 100.00NA0.000.000.000.000.000.000.000.00
×