Supercharge Your Model Training
-
Updated
May 16, 2025 - Python
Supercharge Your Model Training
MoDM is a cache-aware, hybrid serving system that accelerates image generation by dynamically combining small and large diffusion models for efficient, high-quality output.
(Unofficial) building Hugging Face SmolLM-blazingly fast and small language model with PyTorch implementation of grouped query attention (GQA)
Add a description, image, and links to the ml-efficiency topic page so that developers can more easily learn about it.
To associate your repository with the ml-efficiency topic, visit your repo's landing page and select "manage topics."