Torch Titan¶
Forgather integration with Torch Titan
Projects¶
- llama3/ - Reproduces Torch Titan's Llama3 base configurations via Forgather, with a few derived configs demonstrating template-based config management.
- tiny_titan/ - First Forgather-native Torch Titan trainer using dependency injection for training assets; includes an FSDP config for a 117M Llama3 model.
- test_parallelisms/ - Compares various parallelism strategies (DDP, tensor parallel, pipeline parallel, etc.) against a single-GPU control with matched effective batch size.
For an overview of the Native Titan vs Forgather Titan approaches and general usage, see docs/trainers/torchtitan.md.