LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Authors
Chitty-Venkata, K. T., S. Raskar, B. Kale, F. Ferdaus, A. Tanikanti, K. Raffenetti, V. Taylor, M. Emani, and V. Vishwanath
Publication Date
Name of Publication Source
SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
Publisher
IEEE
Conference Location
Atlanta, GA
DOI
10.1109/SCW63240.2024.00178