Intro to AI Series: LLM: Embeddings and Tokenization

Archit Vasan, ALCF
Student Training/Education
STS Session 5 Graphic Featuring the Presenter

From February 6 through March 26, 2024, the ALCF will host an 8-part weekly virtual training series to teach undergraduates and graduates the fundamentals of using world-class supercomputers to advance the use of AI for research.

Intro to AI Series: Session 5

Trainees will learn about essential concepts of sequential data modeling, and modeling approaches such as transformers. They will also participate in a virtual tour of ALCF machines, including Aurora, Polaris, and the AI Testbeds.

Lecturer

Archit Vasan is a postdoctoral appointee in the Argonne Leadership Computing Facility with a background in computational biophysics. His research interests at ALCF involve the discovery of cancer drugs using machine Learning coupled to exascale computing. Archit received a BA in Physics and Mathematics from Austin College in 2016. He then received his PhD in Biophysics from the University of Illinois at Urbana-Champaign in 2023 under the guidance of Dr. Emad Tajkhorshid.

ALCF Machine Room Tour Guide

As a lead of the ALCF Catalyst team, Chris Knight works closely with researchers to help them accomplish their scientific goals using leadership computational resources. To address the unique challenges of efficiently using leadership-scale resources, Chris assists researchers with profiling and debugging their codes, discusses strategies and provides general guidance on code parallelization, I/O, load-balancing, workflow design, and data management. Important components of this work are training users on key high-performance computing topics and collaborating with researchers to advance their scientific mission.