Data Science Modules

Help Desk

Hours: 9:00am-5:00pm CT M-F
Telephone: 866-508-9181 (Toll-Free, US Only) or 630-252-3111
Email: support@alcf.anl.gov

Theta

Data Science Modules

ALCF Data Science group provides modules to simplify the usage of common data science tools, such as TensorFlow, PyTorch, Horovod, and mpi4py. Users can see a list of available datascience modules with `module avail datascience`. More information about each module can be found by executing `module show <MODULENAME>`.

datascience/mpi4py

This module loads the environment required to run MPI for Python (mpi4py) package. The version of mpi4py is 3.0.1a0.

Note: This module loads intelpython35 and gcc/7.3.0 modules. 

datascience/tensorflow-X

This module loads the environment required to run TensorFlow on Theta. Available versions are 1.4, 1.6, 1.8, 1.10, and 1.12. We also provide 1.13.0rc0, but note that this version is a release candidate, so we recommend to use 1.12, which is the current stable release version.

Note: This module loads intelpython35 and gcc/7.3.0 modules. You will get a core dump if you try to use TensorFlow on the login node, since TensorFlow library was compiled to use AVX512F instructions, which are available on compute nodes.

datascience/horovod-X

This module loads the environment required to run Horovod on Theta. Horovod is a distributed deep learning framework for TensorFlow, Keras, PyTorch, and MXNet. Available versions are 0.13.11, 0.14.1, 0.15.0, and 0.15.2. 

Note: This module loads intelpython35 and gcc/7.3.0 modules. However, it doesn’t load TensorFlow, Keras, or PyTorch. You have to load one of these modules in order to use it together with horovod.

datascience/keras-X

This module loads the environment required to run Keras, which is a high-level Python API to run Tensorflow, CNTK, or Theano. Currently, only version 2.2.2 is available on Theta and it automatically loads TensorFlow 1.10.

Note: This module loads intelpython35, gcc/7.3.0, and datascience/tensorflow-1.10 modules. 

datascience/pytorch-X

This module loads the environment required to run PyTorch, a deep learning platform with Python and C++ API. Available versions are 0.5 and 1.0. 

Note: This module loads intelpython35, gcc/7.3.0, and datascience/tensorflow-1.10 modules. 

Systems