Hugging Face downloads
The Hugging Face tab can browse, filter, and download local models for each runtime.
- llama.cpp uses the Hub
apps=llama.cppfilter and downloads individual GGUF files, including matching projectors when needed. - MLX downloads the full repo instead of one file, because tokenizer, config, safetensors shards, and processor files must stay together.
- Transformers downloads the full repo for PyTorch/Transformers loading.
- Download cards show repo/file progress, bytes, speed, ETA, and cancellation state across refreshes and tab changes.
- A Hugging Face token can be saved through the Hugging Face integration for gated/private models and higher Hub rate limits.