Loop id | Source Location | Source Function | Level | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing run_0 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Speedup If Data in L1 run_0 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
672 | exec - MultiBsplineRef.hpp:70-73 | miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&) | Innermost | 33.09 | 0.22 | 0.22 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 5 | 0 | 0 | 0 | 7.45 |
679 | exec - MultiBsplineRef.hpp:249-270 [...] | miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<qmcplusplus::TinyVector<double, 3u>, std::allocator<... | Innermost | 23.53 | 0.16 | 0.16 | 1 | 100 | 100 | 1.1 | 1 | 1 | 1 | 1 | 10 | 1 | 0 | 0 | 5.1 |
971 | exec - ParticleBConds.h:185-217 | void qmcplusplus::DTD_BConds<double, 3u, 39>::computeDistances<qmcplusplus::TinyVector<double, 3u>, qmcplusplus::VectorSoAContainer<double, 3u, 64ul, qmcplusplus::Mallocator<double, 64ul> >, qmcplusplus::VectorSoAContainer<dou... | Single | 11.03 | 0.07 | 0.07 | 1 | 90.91 | 89.22 | 1.03 | 1 | 1.01 | 1 | 9 | 0 | 0 | 0 | 1 | 1.1 |
654 | exec - BsplineAllocator.hpp:179-180 | qmcplusplus::BsplineAllocator<double, 64ul, qmcplusplus::Mallocator<double, 64ul> >::setCoefficientsForOrbitals(int, int, Array<double, 3u>&, multi_UBspline_3d_d*) [clone .extracted] | Innermost | 2.94 | 0.02 | 0.02 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 11337.47 |
230 | exec - BsplineFunctor.h:236-241 | qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const | Single | 2.21 | 0.01 | 0.01 | 1 | 92.68 | 71.67 | 1.15 | 1 | 1.53 | 1 | 0 | 1 | 0.5 | 0.5 | 0 | NA |
228 | exec - BsplineFunctor.h:246-260 | qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const | Single | 1.47 | 0.01 | 0.01 | 1 | 100 | 89.39 | 1.03 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 2 | NA |
780 | exec - inner_product.hpp:81-82 [...] | miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<... | Innermost | 1.47 | 0.01 | 0.01 | 1 | 25 | 15.63 | 1 | 2 | 6.86 | 1 | 0 | 2 | 1 | 0 | 0 | NA |
1264 | exec - | __intel_avx_rep_memset | Single | 0.74 | 0 | 0 | 1 | 100 | 50 | 1 | 1 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | NA |
1263 | exec - | __intel_avx_rep_memcpy | Single | 0.74 | 0 | 0 | 1 | 100 | 50 | 1 | 1 | 2 | 0 | 0 | 2 | 0 | 0 | 0 | NA |
778 | exec - inner_product.hpp:81-82 | miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::ratio(qmcplusplus::ParticleSet&, int) | Single | 0.74 | 0 | 0 | 1 | 100 | 100 | 1 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | NA |
991 | exec - ParticleIOUtility.h:70-91 [...] | void qmcplusplus::expandSuperCell<qmcplusplus::ParticleSet>(qmcplusplus::ParticleSet&, qmcplusplus::Tensor<int, 3u> const&) | Innermost | 0.74 | 0 | 0 | 1 | 42.62 | 17.32 | 1.58 | 1.69 | 7.72 | 0 | NA | NA | NA | NA | NA | NA |
241 | exec - TwoBodyJastrowRef.h:153-154 | miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | Innermost | 0.74 | 0 | 0 | 1 | 100 | 100 | 1 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | NA |