Model Library

Every AI model. One place.

Run open-source models locally or connect cloud APIs with your own keys. All models work through the CLI and desktop app.

16 local·13 cloud·Updated regularly

Llama 3.2

Popular
Local

Meta's latest compact model. Great for general chat and coding.

3B2.0 GB
localgpt pull llama3.2

Llama 3.1

Local

Meta's flagship open-source model. Excellent reasoning.

8B4.7 GB
localgpt pull llama3.1

Llama 3.3

Local

Meta's largest open model. State-of-the-art performance.

70B40 GB
localgpt pull llama3.3

Mistral 7B

Popular
Local

Mistral AI's efficient 7B model. Fast and capable.

7B4.1 GB
localgpt pull mistral

Mixtral 8x7B

Local

Mixture-of-experts model. High quality at efficient inference.

47B MoE26 GB
localgpt pull mixtral

Phi-3 Mini

Local

Microsoft's compact model. Punches above its weight.

3.8B2.3 GB
localgpt pull phi3

Gemma 2

Local

Google's open model. Strong multilingual and reasoning.

9B5.4 GB
localgpt pull gemma2

DeepSeek R1

New
Local

DeepSeek's reasoning model. Excellent for math and logic.

7B4.7 GB
localgpt pull deepseek-r1

Qwen 2.5

Local

Alibaba's multilingual model. Strong on Chinese and English.

7B4.4 GB
localgpt pull qwen2.5

DeepSeek Coder V2

Popular
Local

Specialized coding model. Supports 300+ languages.

16B8.9 GB
localgpt pull deepseek-coder-v2

Code Llama

Local

Meta's code-specialized Llama. Great for code completion.

7B3.8 GB
localgpt pull codellama

Qwen 2.5 Coder

Local

Qwen's code-tuned variant. Excellent code generation.

7B4.4 GB
localgpt pull qwen2.5-coder

Stable Diffusion XL

Popular
Local

Generate images locally. No API key needed.

3.5B6.5 GB
localgpt pull stable-diffusion

LLaVA

Local

Vision-language model. Understands images and text.

7B4.5 GB
localgpt pull llava

Whisper Large V3

Local

OpenAI's speech recognition. Transcribe audio locally.

1.5B3.1 GB
localgpt pull whisper

Nomic Embed

Local

High-quality text embeddings for RAG and search.

137M274 MB
localgpt pull nomic-embed-text

GPT-4o

Popular
openai

OpenAI's flagship multimodal model.

$2.50 / $10 per 1M tokens
localgpt run gpt-4o

GPT-4o Mini

openai

Fast and affordable. Great for most tasks.

$0.15 / $0.60 per 1M tokens
localgpt run gpt-4o-mini

o1

openai

OpenAI's reasoning model for complex problems.

$15 / $60 per 1M tokens
localgpt run o1

DALL-E 3

openai

OpenAI's image generation model.

$0.04–$0.12 per image
localgpt run dall-e-3

Claude Sonnet 4

New
anthropic

Anthropic's balanced model. Great reasoning.

$3 / $15 per 1M tokens
localgpt run claude-sonnet-4

Claude Opus 4

anthropic

Anthropic's most capable model.

$15 / $75 per 1M tokens
localgpt run claude-opus-4

Claude 3.5 Haiku

anthropic

Fast and affordable. Excellent for quick tasks.

$0.80 / $4 per 1M tokens
localgpt run claude-3.5-haiku

Gemini 2.0 Flash

google

Google's fast multimodal model.

$0.10 / $0.40 per 1M tokens
localgpt run gemini-2.0-flash

Gemini 1.5 Pro

google

Google's most capable. 1M token context.

$1.25 / $5 per 1M tokens
localgpt run gemini-1.5-pro

Imagen 3

google

Google's image generation model.

$0.03 per image
localgpt run imagen-3

Mistral Large

mistral

Mistral's most capable cloud model.

$2 / $6 per 1M tokens
localgpt run mistral-large

DeepSeek V3

deepseek

DeepSeek's flagship cloud model.

$0.27 / $1.10 per 1M tokens
localgpt run deepseek-chat

DeepSeek R1 (Cloud)

deepseek

Cloud-hosted reasoning model.

$0.55 / $2.19 per 1M tokens
localgpt run deepseek-reasoner

Showing 29 of 29 models

Ready to try?

Install the CLI and start running models in seconds.