Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution
1.19
1.20
1.21
No Scalar Integer
Potential Speedup
1.00
1.00
1.05
Nb Loops to get 80%
1
1
1
FP Vectorised
Potential Speedup
2.10
2.09
2.41
Nb Loops to get 80%
6
6
6
Fully Vectorised
Potential Speedup
2.15
2.14
3.04
Nb Loops to get 80%
6
6
7
Only FP Arithmetic
Potential Speedup
1.01
1.01
1.25
Nb Loops to get 80%
4
3
1
Cumulated Speedup If No Scalar Integer
Cumulated Speedup If FP Vectorized
Cumulated Speedup If Fully Vectorized
Cumulated Speedup If Only FP Arithmetic
Loop Based Profiles
Innermost / Single Loops
Inbetween Loops
Outermost Loops
Cumulated Coverage With All Loops
Innermost Loop Based Profiles
Coverage
Count
Application Categorization
Time
Coverage
Compilation Options
Source Object
Issue
▼md-clang-O2–
▼simulation.cpp–
○
-g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. Try to complement -g with -grecord-gcc-switches or -frecord-command-line.
○
-O2, -O3 or -Ofast is missing.
○
-march=(target) is missing.
Source Object
Issue
▼md-clang-O3–
▼simulation.cpp–
○
-g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. Try to complement -g with -grecord-gcc-switches or -frecord-command-line.
○
-O2, -O3 or -Ofast is missing.
○
-march=(target) is missing.
Source Object
Issue
▼md-clang-O3-ffast-math–
▼simulation.cpp–
○
-g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. Try to complement -g with -grecord-gcc-switches or -frecord-command-line.