Argonne Leadership Computing Facility

ALCF Resources
- Computing Resources
  
  Aurora
  
  Polaris
  
  ALCF AI Testbed
  
  Evaluation Testbeds
  
  Storage and Networking
- Facility Expertise
  
  Facility Expertise
Leadership Computing Resources

The ALCF provides users with access to supercomputing resources that are significantly more powerful than systems typically used for open scientific research.

Featured: Aurora
Science
- Output
  
  Projects
  
  Publications
- Allocation Programs
  
  INCITE Program
  
  ALCC Program
  
  Director’s Discretionary
  
  Early Science Program
Computational Science

The ALCF is accelerating scientific discoveries in many disciplines, ranging from chemistry and engineering to physics and materials science.

Featured: Engineering
Community and Outreach
- Partnerships
  
  Industry
  
  Collaborations
- Educational Outreach
  
  Women in STEM
  
  Student Programs
- Community
  
  ALCF Lighthouse Initiative
Growing the HPC Community

The ALCF is committed to providing training and outreach opportunities that prepare researchers to efficiently use its leadership computing systems, while also cultivating a diverse and skilled HPC workforce for the future.
About
- Get to Know More
  
  Leadership
  
  People
  
  Organizational Chart
  
  User Advisory Council
  
  History
- Visit
  
  Visiting ALCF
  
  Tours
- Latest
  
  News
  
  Careers
- Press Kits
  
  ALCF Media Kit
  
  Aurora Media Kit
  
  Reports Archive
Accelerating Science

The Argonne Leadership Computing Facility enables breakthroughs in science and engineering by providing supercomputing resources and expertise to the research community.
Support Center
- Current
  
  Machine Status
  
  Facility Updates
  
  Accounts Website
- Training
  
  Training Videos & Slides
  
  Training Overview
  
  Training and Events
Support Center

The ALCF Support Center assists users with support requests related to their ALCF projects.

Help Desk
Hours: 9:00am-5:00pm CT M-F
Email: support@alcf.anl.gov
Guides
Featured: Get Started

Message Passing on Data-Parallel Architectures

Jeff Stuart

Seminar

05/15/2008, 7pm CT

Argonne National Laboratory

Building 221, Conference Room A216

Host Rob Ross

Event Sponsor Mathematics and Computer Science Division Seminar

The challenges in implementing a message passing interface usable by data-parallel processors are many. To explore these challenges, we design and implement the "DCGN" (pronounced as decagon) API on NVIDIA GPUs that is nearly identical to MPI and allows full access to the underlying architecture. We introduce the notion of data-parallel thread-groups as a way to map resources to MPI ranks. We use a method that also allows the data-parallel processors to run autonomously from user-written CPU code. In order to facilitate communication, we use a sleep-based polling system to store and retrieve messages. Unlike previous systems, our method provides both performance and flexibility. By running a test suite of applications with different communication requirements, we find that a tolerable amount of overhead is incurred, somewhere between one and five percent depending on the application, and indicate the locations where this overhead accumulates. We conclude that with innovation!
s in chipsets and drivers, this overhead will be mitigated and provide similar, if not improved, performance to typical CPU-based MPI implementations.