How To Scale Your Model

How To Scale Your Model https://jax-ml.github.io/scaling-book/

Wed Feb 5 14:21:25 2025

A Systems View of LLMs on TPUs.

This book aims to demystify the art of scaling LLMs on TPUs. We try to explain how TPUs work, how LLMs actually run at scale, and how to pick parallelism schemes during training and inference that avoid communication bottlenecks.

How To Scale Your Model @ GitHub.

Links per page

Filters