ALCF, NERSC, OLCF co-host HPC software webinar series

announcements
Best Practices for HPC Software Developers webinar

The Argonne and Oak Ridge Leadership Computing Facilities (ALCF and OLCF), the National Energy Research Scientific Computing Center (NERSC), and the Interoperable Design of Extreme-scale Application Software (IDEAS) project are presenting a series of webinars—“Best Practices for HPC Software Developers”—to help users of high-performance computing (HPC) systems carry out their software development more productively.

The IDEAS project focuses on increasing software sustainability and developer productivity for high-performance applications in computational science. IDEAS is a collaboration among researchers at seven US Department of Energy (DOE) laboratories—Argonne National Laboratory, Lawrence Berkeley National Laboratory, Lawrence Livermore National Laboratory, Los Alamos National Laboratory, Oak Ridge National Laboratory (ORNL), Pacific Northwest National Laboratory, and Sandia National Laboratories—and the Colorado School of Mines.

One challenge IDEAS researchers are addressing is the development of an extreme-scale scientific software development kit (xSDK). The xSDK brings together a number of widely used US Department of Energy (DOE) numerical libraries into a common configuration and build environment to simplify usage and interoperability for software developers who rely on these libraries.

Another element is methodologies for software productivity, where researchers adapt, develop, and deploy best practices—drawing on approaches in the software engineering community and translating them into resources that are useful for the HPC computational science community. All work in the project is motivated and validated by applications in extreme-scale science, with primary emphasis on use cases in terrestrial modeling.  Another key aspect of work is outreach, which involves opening a dialogue with the community in the form of training and other events. Our outreach efforts draw upon the experience and content developed in the IDEAS project methodologies, xSDK, and use-case collaborations. 

The webinar series is the IDEAS project’s first broad outreach and training program. “This activity will allow people with substantial expertise in HPC to share their experiences with the community and engage in a conversation,” said David Bernholdt, ORNL Computer Science Research Group leader and IDEAS project outreach lead. “We’re trying to reach out, provide specific suggestions to help developers, and raise awareness that the topic of software developer productivity is an essential aspect of overall scientific productivity.”

Judy Hill, the OLCF liaison for the IDEAS project, added, “Much of the software that runs on Titan and at other facilities is for scientific computing, and we haven’t really focused our training efforts on the sustainability of that software or the engineering process. Historically, we haven’t done a lot to offer basic tools and processes that researchers could use to make their software development easier, which in turn would make it easier for facilities to work with them. That’s a niche that this project is filling by bringing attention to this cause. It’s beneficial not only to researchers and developers but also to the computing facilities.”

For the first session on May 4, Argonne computer scientist Anshu Dubey provided an overview of the project’s main tenets.

“We had a great response to the first webinar, with more than 150 people across the world participating,” said Katherine Riley, ALCF Director of Science and host of the May 4 session. “The strong interest in this series illustrates that there is a growing awareness of the importance of software design and engineering methodologies in the computational science community.”

In the second of the seven planned sessions on May 18, Argonne Distinguished Fellow Barry Smith covered developing, configuring, building, and deploying HPC software. Four of the presenters in the webinar series come from the IDEAS project. The other three come from the three DOE Office of Science User Facilities focused on supercomputing.

“NERSC is delighted to collaborate with the IDEAS project, ALCF, and the OLCF to present such a high-quality series of talks,” said Rebecca Hartman-Baker, the NERSC liaison for the IDEAS project. “The speakers lined up for this series are all first-rate and have very interesting ideas as well as practical advice to share with the HPC community.”

The series is being presented through a teleconferencing service to encourage participation from the audience. Attendees have opportunities to ask questions, and several sessions will include interactive demonstrations. The sessions are being recorded and posted online. So far, the webinar series has had a tremendous response with more than 300 registrations. For more information and to register, visit https://www.olcf.ornl.gov/training-event/webinar-series-best-practices-for-hpc-software-developers/.

“DOE has accomplished a tremendous amount through HPC, and the role of computational science is increasing rapidly,” Bernholdt said. “But as the expectations for computational science get higher, and the computer systems become more complex, the task of developing robust, high-quality software becomes more difficult. It becomes even more challenging as we look to future machines with greater architectural diversity, including the OLCF’s Summit, ALCF’s Aurora, and NERSC’s Cori, and further to exascale systems. It’s important to focus on the software development process and exchange information so that we can develop better software, since making software more reliable and scientifically robust will help us produce better science more quickly.”

Argonne National Laboratory seeks solutions to pressing national problems in science and technology. The nation's first national laboratory, Argonne conducts leading-edge basic and applied scientific research in virtually every scientific discipline. Argonne researchers work closely with researchers from hundreds of companies, universities, and federal, state and municipal agencies to help them solve their specific problems, advance America's scientific leadership and prepare the nation for a better future. With employees from more than 60 nations, Argonne is managed by UChicago Argonne, LLC for the US. Department of Energy's Office of Science.

Oak Ridge National Laboratory is supported by the US Department of Energy’s Office of Science. The single largest supporter of basic research in the physical sciences in the United States, the Office of Science is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.

The National Energy Research Scientific Computing Center (NERSC) is the primary high-performance computing facility for scientific research sponsored by the U.S. Department of Energy's Office of Science. Located at Lawrence Berkeley National Laboratory, the NERSC Center serves more than 6,000 scientists at national laboratories and universities researching a wide range of problems in combustion, climate modeling, fusion energy, materials science, physics, chemistry, computational biology, and other disciplines. Berkeley Lab is a US Department of Energy national laboratory located in Berkeley, California. It conducts unclassified scientific research and is managed by the University of California for the US DOE Office of Science. Learn more about computing sciences at Berkeley Lab.