How to run AI models locally as a beginner?
Most people think running AI models locally is complicated. It's not. Anyone can run powerful AI models like DeepSeek, Llama, and Mistral on their own computer. This guide will show you how, even if you've never written a line of code.
Quick steps:
1. Download Jan (opens in a new tab)
Download Jan from jan.ai (opens in a new tab) - it's free and open source.
2. Choose a model that fits your hardware
Jan helps you pick the right AI model for your computer.
3. Start using AI locally
That's all to run your first AI model locally!
Jan's easy-to-use chat interface after installation.
Keep reading to learn key terms of local AI and the things you should know before running AI models locally.
How Local AI Works
Before diving into the details, let's understand how AI runs on your computer:
Why do we need special tools for local AI? Think of AI models like compressed files - they need to be "unpacked" to work on your computer. Tools like llama.cpp do this job:
- They make AI models run efficiently on regular computers
- Convert complex AI math into something your computer understands
- Help run large AI models even with limited resources
llama.cpp helps millions of people run AI locally on their computers.
What is GGUF and why do we need it?
Original AI models are huge and complex - like trying to read a book in a language your computer doesn't understand. Here's where GGUF comes in:
-
Problem it solves:
- Original AI models are too big (100s of GB)
- They're designed for specialized AI computers
- They use too much memory
-
How GGUF helps:
- Converts models to a smaller size
- Makes them work on regular computers
- Keeps the AI smart while using less memory
When browsing models, you'll see "GGUF" in the name (like "DeepSeek-R1-GGUF"). Don't worry about finding them - Jan automatically shows you the right GGUF versions for your computer.
Understanding AI Models
Think of AI models like apps on your computer - some are light and quick to use, while others are bigger but can do more things. When you're choosing an AI model to run on your computer, you'll see names like "Llama-3-8B" or "Mistral-7B". Let's break down what this means in simple terms.
The "B" in model names (like 7B) stands for "billion" - it's just telling you the size of the AI model. Just like how some apps take up more space on your computer, bigger AI models need more space on your computer.
- Smaller models (1-7B): Work great on most computers
- Bigger models (13B+): Need more powerful computers but can do more complex tasks
Jan Hub makes it easy to understand different model sizes and versions
Good news: Jan helps you pick the right model size for your computer automatically! You don't need to worry about the technical details - just choose a model that matches what Jan recommends for your computer.
What You Can Do with Local AI
Running AI locally gives you:
- Complete privacy - your data stays on your computer
- No internet needed - works offline
- Full control - you decide what models to use
- Free to use - no subscription fees
Hardware Requirements
Before downloading an AI model, consider checking if your computer can run it. Here's a basic guide:
The basics your computer needs:
- A decent processor (CPU) - most computers from the last 5 years will work fine
- At least 8GB of RAM - 16GB or more is better
- Some free storage space - at least 5GB recommended
What Models Can Your Computer Run?
Regular Laptop | 3B-7B models | Good for chatting and writing. Like having a helpful assistant |
Gaming Laptop | 7B-13B models | More capable. Better at complex tasks like coding and analysis |
Powerful Desktop | 13B+ models | Better performance. Great for professional work and advanced tasks |
Not Sure About Your Computer? Start with a smaller model (3B-7B) - Jan will help you choose one that works well on your system.
Getting Started with Models
Model Versions
When browsing models in Jan, you'll see terms like "Q4", "Q6", or "Q8". Here's what that means in simple terms:
These are different versions of the same AI model, just packaged differently to work better on different computers:
- Q4 versions: Like a "lite" version of an app - runs fast and works on most computers
- Q6 versions: The "standard" version - good balance of speed and quality
- Q8 versions: The "premium" version - highest quality but needs a more powerful computer
Pro tip: Start with Q4 versions - they work great for most people and run smoothly on regular computers!
Getting Models from Hugging Face
You'll often see links to "Hugging Face" when downloading AI models. Think of Hugging Face as the "GitHub for AI" - it's where the AI community shares their models. Jan makes it super easy to use:
- Jan has a built-in connection to Hugging Face
- You can download models right from Jan's interface
- No need to visit the Hugging Face website unless you want to explore more options
Setting up your local AI
Getting Models from Hugging Face
You'll often see links to "Hugging Face" when downloading AI models. Think of Hugging Face as the "GitHub for AI" - it's where the AI community shares their models. This sounds technical, but Jan makes it super easy to use:
- Jan has a built-in connection to Hugging Face
- You can download models right from Jan's interface
- No need to visit the Hugging Face website unless you want to explore more options
What powers local AI? Jan uses llama.cpp (opens in a new tab), an inference that makes AI models run efficiently on regular computers. It's like a translator that helps AI models speak your computer's language, making them run faster and use less memory.
1. Get Started
Download Jan from jan.ai (opens in a new tab) - it sets everything up for you.
2. Get an AI Model
You can get models two ways:
1. Use Jan Hub (Recommended):
- Click "Download Model" in Jan
- Pick a recommended model
- Choose one that fits your computer
Use Jan Hub to download AI models
2. Use Hugging Face:
Important: Only GGUF models will work with Jan. Make sure to use models that have "GGUF" in their name.
Step 1: Get the model link
Find and copy a GGUF model link from Hugging Face (opens in a new tab)
Look for models with "GGUF" in their name
Step 2: Open Jan
Launch Jan and go to the Models tab
Navigate to the Models section in Jan
Step 3: Add the model
Paste your Hugging Face link into Jan
Paste your GGUF model link here
Step 4: Download
Select your quantization and start the download
Choose your preferred model size and download
Common Questions
"My computer doesn't have a graphics card - can I still use AI?"
Yes! It will run slower but still work. Start with 7B models.
"Which model should I start with?"
Try a 7B model first - it's the best balance of smart and fast.
"Will it slow down my computer?"
Only while you're using the AI. Close other big programs for better speed.
Need help?
Join our Discord community (opens in a new tab) for support.