TIL - How to run Hugging Face models with Ollama

ai Dec 27, 2024

You can now deploy any GGUF model with Ollama, in just a few clicks!

You can use any GGUF quants created by the community on Hugging Face directly with Ollama, without the need to create a new Modelfile . It works like a charm with all llama.cpp compatible models, with all sizes, from 0.1B up to 405B parameters.

Simply filter GGUF models, select the quant type as per your requirement, and it is done!

ollama run hf.co/bartowski/Qwen2.5.1-Coder-7B-Instruct-GGUF:Q5_K_L

Recommended Collection of Models

More Information

Recommended for you

project

AI-Powered Cloud Configuration Review

a year ago • 4 min read

Hot Take - Learn Async Programming before LangChain and LangGraph

a year ago • 3 min read

openai

Tagging and Summarizing Articles with OpenAI and LangChain

a year ago • 2 min read

Testing Anti-Patterns You Need to Stop Using

TIL - Python's asyncio.Event Lets Coroutines Signal Each Other Without Polling

TIL - Observer Pattern Decouples Event Producers From Consumers

TIL - collections.deque with maxlen is a Zero-Effort Rolling Window