Every AI model. One place.
Run open-source models locally or connect cloud APIs with your own keys. All models work through the CLI and desktop app.
Llama 3.2
PopularMeta's latest compact model. Great for general chat and coding.
localgpt pull llama3.2Llama 3.1
Meta's flagship open-source model. Excellent reasoning.
localgpt pull llama3.1Llama 3.3
Meta's largest open model. State-of-the-art performance.
localgpt pull llama3.3Mistral 7B
PopularMistral AI's efficient 7B model. Fast and capable.
localgpt pull mistralMixtral 8x7B
Mixture-of-experts model. High quality at efficient inference.
localgpt pull mixtralPhi-3 Mini
Microsoft's compact model. Punches above its weight.
localgpt pull phi3Gemma 2
Google's open model. Strong multilingual and reasoning.
localgpt pull gemma2DeepSeek R1
NewDeepSeek's reasoning model. Excellent for math and logic.
localgpt pull deepseek-r1Qwen 2.5
Alibaba's multilingual model. Strong on Chinese and English.
localgpt pull qwen2.5DeepSeek Coder V2
PopularSpecialized coding model. Supports 300+ languages.
localgpt pull deepseek-coder-v2Code Llama
Meta's code-specialized Llama. Great for code completion.
localgpt pull codellamaQwen 2.5 Coder
Qwen's code-tuned variant. Excellent code generation.
localgpt pull qwen2.5-coderStable Diffusion XL
PopularGenerate images locally. No API key needed.
localgpt pull stable-diffusionLLaVA
Vision-language model. Understands images and text.
localgpt pull llavaWhisper Large V3
OpenAI's speech recognition. Transcribe audio locally.
localgpt pull whisperNomic Embed
High-quality text embeddings for RAG and search.
localgpt pull nomic-embed-textGPT-4o
PopularOpenAI's flagship multimodal model.
localgpt run gpt-4oGPT-4o Mini
Fast and affordable. Great for most tasks.
localgpt run gpt-4o-minio1
OpenAI's reasoning model for complex problems.
localgpt run o1DALL-E 3
OpenAI's image generation model.
localgpt run dall-e-3Claude Sonnet 4
NewAnthropic's balanced model. Great reasoning.
localgpt run claude-sonnet-4Claude Opus 4
Anthropic's most capable model.
localgpt run claude-opus-4Claude 3.5 Haiku
Fast and affordable. Excellent for quick tasks.
localgpt run claude-3.5-haikuGemini 2.0 Flash
Google's fast multimodal model.
localgpt run gemini-2.0-flashGemini 1.5 Pro
Google's most capable. 1M token context.
localgpt run gemini-1.5-proImagen 3
Google's image generation model.
localgpt run imagen-3Mistral Large
Mistral's most capable cloud model.
localgpt run mistral-largeDeepSeek V3
DeepSeek's flagship cloud model.
localgpt run deepseek-chatDeepSeek R1 (Cloud)
Cloud-hosted reasoning model.
localgpt run deepseek-reasonerShowing 29 of 29 models