DataStates-LLM: Lazy Asynchronous Checkpointing for Large Language Models Publications HPDC '24: Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing
EvoStore: Towards Scalable Storage of Evolving Learning Models Publications HPDC '24: Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing
Initial Experiences with DAOS Object Storage on Aurora Publications SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
Copper: Cooperative Caching Layer for Scalable Data Loading in Exascale Supercomputers Publications SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
DFTracer: An Analysis-Friendly Data Flow Tracer for AI-Driven Workflows Publications SC24: International Conference for High Performance Computing, Networking, Storage and Analysis
MalleTrain: Deep Neural Networks Training on Unfillable Supercomputer Nodes Publications ICPE '24: Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering
Advanced Dual-Atom Catalysts on Graphitic Carbon Nitride for Enhanced Hydrogen Evolution via Water Splitting Publications Nanoscale
Orbital-Engineered Anomalous Hall Conductivity in Stable Full Heusler Compounds: A Pathway to Optimized Spintronics Publications Journal of Materials Chemistry C
Topological Fermi-Arc Surface State Covered by Floating Electrons on a Two-Dimensional Electride Publications Nature Communications
Performance-Portable Binary Neutron Star Mergers with AthenaK Publications The Astrophysical Journal Supplement Series