Cobalt Scheduler

Orginally COBALT (Component-Based Lightweight Toolkit) was a set of component-based system software for system and resource management developed within Argonne’s Mathematics and Computer Science Division, the ALCF has adopted the resource scheduling component and continued to enhance it for use within the facility. ALCF sees resource scheduling a major component of future facilities and its research/development efforts are focused on future needs.

Primary Contact: 

Bill Allcock, allcock@anl.gov

Other Collaborators: 

Narayan Desai, Zhiling Lan

Publications: 

Zhou Zhou, Xu Yang, Zhiling Lan, Paul Rich, Wei Tang, Vitali Morozov, Narayan Desai, "Bandwidth-Aware Resource Management for Extreme Scale Systems”, IEEE/ACM International Conference for High Performance Computing, Networking, Storage, and Analysis, November 16 - 21, 2014. [Poster]

Wei Tang, Narayan Desai, Daniel Buettner, Zhiling Lan, "Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/P”,
2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), April 19-23 2010, pp. 1 - 11.

Wei Tang, Zhiling Lan, Narayan Desai, Daniel Buettner, "Fault-aware, utility-based job scheduling on Blue, Gene/P systems”,
IEEE International Conference on Cluster Computing and Workshops, 2009 (CLUSTER ’09), 08/31 - 09/04 2009, pp. 1 - 10.