Extracting optimal performance from your application requires efficient use of your system's microarchitecture and parallelism at core, socket, and node levels. This webinar will focus on how to do just that using two profiling tools from Intel: Intel Vtune Amplifier and Intel Advisor. This webinar will focus on using these tools on Cray supercomputers such as ALCF's Theta.
After participating in this webinar you will be able to answer questions such as:
- Is my code getting vectorized? What is my vector efficiency?
- Am I using memory wisely? Is this kernel/function DRAM or cache bound? What is my cache hit ratio?
- Am I using the full system? Are all cores doing work? What is preventing further scaling?
- Which routines do I need to optimize and in what way?
About the Speaker
Paulius Velesko is an on-site Intel Application Engineer at the Argonne Leadership Computing Facility. His focus is on node- and core-level application performance on many-core x86 architectures such as Knight's Landing. Paulius has worked with a variety of applications such as XGC, a particle-in-cell plasma code; PhoSim, a Monte Carlo photon simulation; QMCPack, an electronic structure code; and Devito, domain-specific language for stencil compilation.
About ALCF Developer Sessions
This webinar series is aimed at increasing the dialogue between ALCF users and the developers of leadership-class systems and software. Attendees are encouraged to bring any questions related to ALCF systems.