options

Expert Summary

Colums Filter

Source Function Level Max Time Over Threads (s) Time w.r.t. Wall Time (s) Coverage (% app. time) Speedup if no scalar integer Speedup if FP arith vectorized Speedup if fully vectorized Speedup if FP only Number of paths Vectorization Ratio (%) Vector Length Use (%) CQA cycles CQA cycles if no scalar integer CQA cycles if FP arith vectorized CQA cycles if fully vectorized CQA cycles if FP only Instance Count min (Iteration count) avg (Iteration count) max (Iteration count) min (Cycles per Iteration) avg (Cycles per Iteration) max (Cycles per Iteration) CAP(FP) BW(FP) SAT(FP) CAP(L1R) BW(L1R) SAT(L1R) CAP(L1W) BW(L1W) SAT(L1W) CAP(L2) BW(L2) SAT(L2) CAP(L3) BW(L3) SAT(L3) CAP(RAM_R) CAP(RAM_W)
IDModuleSource LocationSource FunctionLevelMax Time Over Threads (s)Time w.r.t. Wall Time (s)Coverage (% app. time)Speedup if no scalar integerSpeedup if FP arith vectorizedSpeedup if fully vectorizedSpeedup if FP onlyNumber of pathsVectorization Ratio (%)Vector Length Use (%)CQA cyclesCQA cycles if no scalar integerCQA cycles if FP arith vectorizedCQA cycles if fully vectorizedCQA cycles if FP onlyInstance Countmin (Iteration count)avg (Iteration count)max (Iteration count)min (Cycles per Iteration)avg (Cycles per Iteration)max (Cycles per Iteration)CAP(FP)BW(FP)SAT(FP)CAP(L1R)BW(L1R)SAT(L1R)CAP(L1W)BW(L1W)SAT(L1W)CAP(L2)BW(L2)SAT(L2)CAP(L3)BW(L3)SAT(L3)CAP(RAM_R)CAP(RAM_W)
Loop 831execMultiBsplineRef.hpp:70-73miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&)Innermost0.290.2930.211.001.202.001.201100.0050.006.006.005.003.005.002526724848489.8896.11582759.33NA16NANA64NANA32NANA32NANA15NANANA
Loop 1167execParticleBConds.h:185-217void qmcplusplus::DTD_BConds::computeDistances, qmcplusplus::VectorSoAContainer >, qmcplusplus::VectorSoAContainer > >(qmcplusplus::TinyVector const&, qmcplusplus::VectorSoAContainer > const&, double*, qmcplusplus::VectorSoAContainer >&, int, int, int) constSingle0.220.2223.441.041.702.071.28192.5446.2248.5046.5028.5023.4438.004771216105.4219278170.551747606.12NA16NANA64NANA32NANA32NANA15NANANA
Loop 840execTinyVectorOps.h:59-59,MultiBsplineData.hpp:71-71,MultiBsplineRef.hpp:249-270miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)Innermost0.190.1920.311.051.292.001.121100.0050.0025.7524.5020.0012.8823.007372848484836.12178.73583159.04NA16NANA64NANA32NANA32NANA15NANANA
Loop 812execBsplineAllocator.hpp:179-180qmcplusplus::BsplineAllocator >::setCoefficientsForOrbitals(int, int, Array&, multi_UBspline_3d_d*) [clone .extracted]Innermost0.010.011.561.001.252.001.251100.0050.001.251.251.000.631.00640009696965.1740.83306568.6NA16NANA64NANA32NANA32NANA15NANANA
Loop 1469exec__intel_avx_rep_memsetSingle0.010.011.561.001.002.008.001100.0050.008.008.008.004.001.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1246execNewTimer.h:119-121,stl_tree.h:782-1952std::map, double, std::less >, std::allocator const, double> > >::operator[](qmcplusplus::StackKeyParam<2> const&)Single0.010.011.561.001.008.002.0050.0012.505.005.005.000.632.50NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 302execBsplineFunctor.h:236-241qmcplusplus::BsplineFunctor::evaluateV(int, int, int, double const*, double*) constSingle0.010.011.041.221.003.082.00187.8838.9222.0018.0022.007.1411.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 930execinner_product.hpp:210-211qmcplusplus::DiracMatrix::invert_transpose(qmcplusplus::Matrix > const&, qmcplusplus::Matrix >&, double&, double&)Innermost000.521.071.002.903.75185.7141.073.753.503.751.291.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 240execOneBodyJastrowRef.h:196-197miniqmcreference::OneBodyJastrowRef >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector&)Single000.521.001.542.001.001100.0050.004.004.002.602.004.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1187execostream:667-667,Tensor.h:213-213,OperatorTags.h:43-183,char_traits.h:409-409,ParticleIOUtility.h:70-91,OhmmsVector.h:223-223,TinyVectorTensorOps.h:150-152,InfoStream.h:37-37void qmcplusplus::expandSuperCell(qmcplusplus::ParticleSet&, qmcplusplus::Tensor const&)Innermost000.521.581.697.722.74842.6217.4226.0016.5015.383.379.50NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 836execMultiBsplineRef.hpp:284-295miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)Innermost000.521.001.002.002.001100.0050.0018.0018.0018.009.009.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1173execDistanceTableBA.h:99-99,ParticleBConds.h:249-278qmcplusplus::DistanceTableBA::evaluate(qmcplusplus::ParticleSet&)Innermost000.521.021.445.191.33136.8418.8830.5030.0021.135.8823.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 834execMultiBsplineEvalHelper.hpp:47-49,TinyVectorOps.h:59-59,VectorSoAContainer.h:237-237,MultiBsplineData.hpp:68-79,MultiBsplineRef.hpp:193-295,einspline_spo_ref.hpp:206-208,stl_algobase.h:238-931,stl_vector.h:1126-1258miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)Outermost000.521.521.406.852.843235.9117.88102.2567.2573.2514.9436.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 325execTwoBodyJastrowRef.h:340-345miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int)Single000.521.001.332.001.001100.0050.004.004.003.002.004.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1468exec__intel_avx_rep_memcpySingle000.521.001.002.008.001100.0050.008.008.008.004.001.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 957execOperatorTags.h:63-63,inner_product.hpp:81-82,DiracDeterminantRef.cpp:157-157miniqmcreference::DiracDeterminantRef >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib, std::allocator > >&, qmcplusplus::ParticleAttrib >&, bool)Innermost000.521.002.006.861.00125.0015.638.008.004.001.178.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 955execinner_product.hpp:81-82miniqmcreference::DiracDeterminantRef >::ratio(qmcplusplus::ParticleSet&, int)Single000.521.001.542.001.001100.0050.004.004.002.602.004.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 833execTinyVector.h:146-146,OperatorTags.h:183-183,einspline_spo_ref.hpp:223-227miniqmcreference::einspline_spo_ref::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector >&, qmcplusplus::Vector, std::allocator > >&, qmcplusplus::Vector >&)Innermost000.521.001.008.005.0010.0012.505.005.005.000.631.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 1244execNewTimer.cpp:99-100qmcplusplus::TimerType::stop()Single000.521.001.0016.001.0010.006.252.002.002.000.132.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
Loop 323execTwoBodyJastrowRef.h:340-345miniqmcreference::TwoBodyJastrowRef >::acceptMove(qmcplusplus::ParticleSet&, int)Single000.521.001.332.001.001100.0050.004.004.003.002.004.00NANANANANANANANA16NANA64NANA32NANA32NANA15NANANA
×