options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_16_threads (%) Coverage run_32_threads (%) Coverage run_48_threads (%) Coverage run_64_threads (%) Coverage run_80_threads (%) Coverage run_96_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_16_threads (%) Coverage Excluding Loops run_32_threads (%) Coverage Excluding Loops run_48_threads (%) Coverage Excluding Loops run_64_threads (%) Coverage Excluding Loops run_80_threads (%) Coverage Excluding Loops run_96_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_16_threads (s) Max Inclusive Time Over Threads run_32_threads (s) Max Inclusive Time Over Threads run_48_threads (s) Max Inclusive Time Over Threads run_64_threads (s) Max Inclusive Time Over Threads run_80_threads (s) Max Inclusive Time Over Threads run_96_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_16_threads (s) Max Exclusive Time Over Threads run_32_threads (s) Max Exclusive Time Over Threads run_48_threads (s) Max Exclusive Time Over Threads run_64_threads (s) Max Exclusive Time Over Threads run_80_threads (s) Max Exclusive Time Over Threads run_96_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_16_threads (s) Inclusive Time w.r.t. Wall Time run_32_threads (s) Inclusive Time w.r.t. Wall Time run_48_threads (s) Inclusive Time w.r.t. Wall Time run_64_threads (s) Inclusive Time w.r.t. Wall Time run_80_threads (s) Inclusive Time w.r.t. Wall Time run_96_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_16_threads (s) Exclusive Time w.r.t. Wall Time run_32_threads (s) Exclusive Time w.r.t. Wall Time run_48_threads (s) Exclusive Time w.r.t. Wall Time run_64_threads (s) Exclusive Time w.r.t. Wall Time run_80_threads (s) Exclusive Time w.r.t. Wall Time run_96_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_16_threads Nb Threads run_32_threads Nb Threads run_48_threads Nb Threads run_64_threads Nb Threads run_80_threads Nb Threads run_96_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_16_threads Deviation (coverage) run_32_threads Deviation (coverage) run_48_threads Deviation (coverage) run_64_threads Deviation (coverage) run_80_threads Deviation (coverage) run_96_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_16_threads Deviation (walltime) run_32_threads Deviation (walltime) run_48_threads Deviation (walltime) run_64_threads Deviation (walltime) run_80_threads Deviation (walltime) run_96_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_16_threads Categories run_32_threads Categories run_48_threads Categories run_64_threads Categories run_80_threads Categories run_96_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_16_threads GFLOPS run_32_threads GFLOPS run_48_threads GFLOPS run_64_threads GFLOPS run_80_threads GFLOPS run_96_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_16_threads) Efficiency (run_16_threads) Potential Speed-Up (%) (run_32_threads) Efficiency (run_32_threads) Potential Speed-Up (%) (run_48_threads) Efficiency (run_48_threads) Potential Speed-Up (%) (run_64_threads) Efficiency (run_64_threads) Potential Speed-Up (%) (run_80_threads) Efficiency (run_80_threads) Potential Speed-Up (%) (run_96_threads) Efficiency (run_96_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_16_threads (%)Coverage run_32_threads (%)Coverage run_48_threads (%)Coverage run_64_threads (%)Coverage run_80_threads (%)Coverage run_96_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_16_threads (%)Coverage Excluding Loops run_32_threads (%)Coverage Excluding Loops run_48_threads (%)Coverage Excluding Loops run_64_threads (%)Coverage Excluding Loops run_80_threads (%)Coverage Excluding Loops run_96_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_16_threads (s)Max Inclusive Time Over Threads run_32_threads (s)Max Inclusive Time Over Threads run_48_threads (s)Max Inclusive Time Over Threads run_64_threads (s)Max Inclusive Time Over Threads run_80_threads (s)Max Inclusive Time Over Threads run_96_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_16_threads (s)Max Exclusive Time Over Threads run_32_threads (s)Max Exclusive Time Over Threads run_48_threads (s)Max Exclusive Time Over Threads run_64_threads (s)Max Exclusive Time Over Threads run_80_threads (s)Max Exclusive Time Over Threads run_96_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_16_threads (s)Inclusive Time w.r.t. Wall Time run_32_threads (s)Inclusive Time w.r.t. Wall Time run_48_threads (s)Inclusive Time w.r.t. Wall Time run_64_threads (s)Inclusive Time w.r.t. Wall Time run_80_threads (s)Inclusive Time w.r.t. Wall Time run_96_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_16_threads (s)Exclusive Time w.r.t. Wall Time run_32_threads (s)Exclusive Time w.r.t. Wall Time run_48_threads (s)Exclusive Time w.r.t. Wall Time run_64_threads (s)Exclusive Time w.r.t. Wall Time run_80_threads (s)Exclusive Time w.r.t. Wall Time run_96_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_16_threadsNb Threads run_32_threadsNb Threads run_48_threadsNb Threads run_64_threadsNb Threads run_80_threadsNb Threads run_96_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_16_threadsDeviation (coverage) run_32_threadsDeviation (coverage) run_48_threadsDeviation (coverage) run_64_threadsDeviation (coverage) run_80_threadsDeviation (coverage) run_96_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_16_threadsDeviation (walltime) run_32_threadsDeviation (walltime) run_48_threadsDeviation (walltime) run_64_threadsDeviation (walltime) run_80_threadsDeviation (walltime) run_96_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_16_threadsCategories run_32_threadsCategories run_48_threadsCategories run_64_threadsCategories run_80_threadsCategories run_96_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_16_threadsGFLOPS run_32_threadsGFLOPS run_48_threadsGFLOPS run_64_threadsGFLOPS run_80_threadsGFLOPS run_96_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_16_threads) Efficiency(run_16_threads) Potential Speed-Up (%)(run_32_threads) Efficiency(run_32_threads) Potential Speed-Up (%)(run_48_threads) Efficiency(run_48_threads) Potential Speed-Up (%)(run_64_threads) Efficiency(run_64_threads) Potential Speed-Up (%)(run_80_threads) Efficiency(run_80_threads) Potential Speed-Up (%)(run_96_threads) Efficiency(run_96_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-vectorize91.7491.9391.7991.7092.0991.9292.0892.2592.6392.990.000.000.000.000.000.000.000.000.000.0082.0343.1421.4410.776.213.002.031.591.381.180.000.000.000.000.000.000.000.000.000.0082.0346.4526.4616.6411.529.288.538.208.027.920.000.000.000.000.000.000.000.000.000.0012481632486480960.0010.3412.7914.3514.7212.9111.5410.419.558.860.001.450.590.220.400.090.050.040.030.03Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.027.2812.6019.9530.8238.9743.3348.4953.7457.69GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs -g -O3 -std=c++14 -fno-omit-frame-pointer -fopenmp -ftree-vectorize 100.8810.760.7820.630.6235.20.4551.080.2866.540.273.620.1677.830.1380.780.1182.96
Loop 5 - main.cpp:58-67 - kmeans-gcc-O3-vectorize+91.7491.9391.7991.7092.0991.9292.0892.2592.6392.990.000.000.000.000.000.000.000.000.000.0082.0343.1421.4410.936.283.022.041.621.421.230.000.000.000.000.000.000.000.000.000.0082.0346.4526.4616.6411.529.288.538.208.027.920.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 3 - main.cpp:60-67 - kmeans-gcc-O3-vectorize+91.7491.9391.7991.7092.0991.9292.0892.2592.6392.992.852.892.622.472.772.722.632.642.932.7682.0343.1421.4410.936.283.022.041.621.421.232.551.540.650.400.220.110.100.080.080.0782.0346.4526.4616.6411.529.288.538.208.027.922.551.460.760.450.350.270.240.230.250.2412481632486480960.000.260.280.750.590.800.921.161.081.270.000.310.040.060.020.020.020.020.010.014.428.0616.0926.8834.2446.2252.5958.4758.3166.78100.870.370.840.410.710.720.461.490.291.930.222.060.172.190.132.560.112.45
Loop 4 - main.cpp:60-65 - kmeans-gcc-O3-vectorize88.8889.0489.1789.2389.3289.2089.4589.6289.6990.2388.8889.0489.1789.2389.3289.2089.4589.6289.6990.2379.4841.5920.7910.536.052.911.941.541.341.1679.4841.5920.7910.536.052.911.941.541.341.1679.4844.9925.7016.1911.179.018.287.967.777.6979.4844.9925.7016.1911.179.018.287.967.777.6912481632486480960.0010.6012.5213.8614.2812.5111.2410.189.318.710.001.130.560.240.400.090.040.040.040.034.007.2612.5019.7630.7138.7543.0648.1953.5957.41100.8810.390.7720.220.6134.490.4449.590.2864.610.271.560.1675.640.1378.220.1180.51
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-gcc-O3-vectorize8.268.078.188.267.817.837.657.436.936.540.000.000.000.000.000.000.000.000.000.007.397.397.387.377.377.407.427.427.427.420.000.000.000.000.000.000.000.000.000.007.394.082.361.500.980.790.710.660.600.560.000.000.000.000.000.000.000.000.000.0011111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.671.222.113.315.126.327.067.578.328.98GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs -g -O3 -std=c++14 -fno-omit-frame-pointer -fopenmp -ftree-vectorize100.910.750.781.770.623.170.474.120.295.540.225.990.176.130.155.860.145.63
Loop 13 - main.cpp:56-96 - kmeans-gcc-O3-vectorize [...]+8.268.078.188.267.817.837.657.436.936.540.000.000.000.000.000.000.000.000.000.007.397.397.387.377.377.407.427.427.427.420.000.000.000.000.000.000.000.000.000.007.394.082.361.500.980.790.710.660.600.560.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 9 - main.cpp:56-95 - kmeans-gcc-O3-vectorize [...]+8.268.078.188.267.817.837.657.436.936.540.000.000.000.000.000.000.000.000.000.007.397.397.387.377.377.407.427.427.427.420.000.000.000.000.000.000.000.000.000.007.394.082.361.500.980.790.710.660.600.560.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 10 - main.cpp:56-95 - kmeans-gcc-O3-vectorize [...]+8.268.078.188.267.817.837.657.436.936.540.000.000.000.000.000.000.000.000.000.007.397.397.387.377.377.407.427.427.427.420.000.000.000.000.000.000.000.000.000.007.394.082.361.500.980.790.710.660.600.560.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 12 - main.cpp:56-95 - kmeans-gcc-O3-vectorize [...]+8.268.078.188.267.817.837.657.436.936.540.000.000.000.000.000.000.000.000.000.007.397.397.387.377.377.407.427.427.427.420.000.000.000.000.000.000.000.000.000.007.394.082.361.500.980.790.710.660.600.560.000.000.000.000.000.000.000.000.000.0000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 11 - main.cpp:81-84 - kmeans-gcc-O3-vectorize8.268.078.188.267.817.837.657.436.936.548.268.078.188.267.817.837.657.436.936.547.397.397.387.377.377.407.427.427.427.427.397.397.387.377.377.407.427.427.427.427.394.082.361.500.980.790.710.660.600.567.394.082.361.500.980.790.710.660.600.5611111111110.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.671.222.113.315.126.327.067.578.328.98100.910.750.781.770.623.170.474.120.295.540.225.990.176.130.155.860.145.63
gomp_barrier_wait_endlibgomp.so.1.0.00.000.000.010.030.060.140.120.140.230.260.000.000.010.030.060.140.120.140.230.260.000.000.010.010.010.010.020.000.020.010.000.000.010.010.010.010.020.000.020.010.000.000.000.010.010.010.010.010.020.020.000.000.000.010.010.010.010.010.020.020014920142833440.000.000.000.020.050.090.240.010.300.240.000.000.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.00
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.000.010.010.040.110.150.180.210.210.000.000.010.010.040.110.150.180.210.210.000.000.000.000.010.010.010.010.020.010.000.000.000.000.010.010.010.010.020.010.000.000.000.000.000.010.010.020.020.020.000.000.000.000.000.010.010.020.020.020022614232833360.000.000.000.000.040.120.120.170.280.230.000.000.000.000.000.000.000.000.000.00NANAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.000.000.000.00
×