Best Local LLM

Updated Daily

inventory_2 23 items

•

trending_up Scored across 12 criteria

•

Top Ranked

Best 1

Ollama (General Platform)

Easiest way to run various local LLMs. Works with JetBrains via Continue, Tabby, or custom scripts for code completion and chat.

Self Hosted Easy To Use AI Tool Code Completion Local Developer Chatbot Platform Ollama Local LLM

8.98 Excellent

Visit

Continue AI

Free Plan Available From Free (with paid tiers available for increased usage)

Continue AI is a highly flexible, open-source extension designed to act as a universal AI coding copilot. Its standout feature is its ability to connect to virtually any LLMlocal, cloud, or privatemak...

Continue AI Extension AI Open Source Flexible Reasoning Context Aware Developer Multi Model Agentic Workflow Coding Copilot Local LLM

8.69 Excellent

Visit

vLLM (Local Deployment)

vLLM is primarily a high-throughput serving engine, but its ability to run models locally makes it invaluable for developers building local AI services. It implements advanced techniques like PagedAtt...

Lm Studio Local Runner Research Experimental High Throughput Local Dev Serving Engine Paged Attention API Optimized GPU Accelerated

8.67 Excellent

Visit

vLLM (API Serving)

vLLM is primarily known for its high-throughput serving capabilities, utilizing advanced techniques like PagedAttention. While it's often used for cloud deployment, running it locally allows developer...

Jetbrains AI Local Performance Backend Service Throughput Batching API Serving

8.49 Excellent

Visit

Ollama Web UI (Open WebUI)

A feature-rich web interface for Ollama, providing a ChatGPT-like experience. Can be paired with LM Studio for model management.

AI Chatbot Web Interface Developer Web UI Interactive Experimental Ollama Integration LLM Runner Local LLM Opensource Chatgpt Style

8.48 Excellent

Visit

Llama 3 8B (via Ollama)

Llama 3 8B represents a massive leap in general reasoning and instruction following for local models. While not exclusively a coding model, its superior coherence and ability to follow complex, multi-...

Jetbrains Local LLM Performance Reasoning General Purpose Llama 3

8.38 Excellent

Visit

Ollama with CodeLlama-7B

Free Plan Available

This combination represents the gold standard for accessible local coding assistance. Ollama provides a simple, robust API layer, while CodeLlama offers specialized performance on code tasks. It is hi...

Jetbrains Local LLM Privacy Easy Setup Code Generation Code Completion Stable AI Assisted Ollama Codellama Local Development

8.22 Excellent

Visit

Mixtral 8x7B (via Ollama)

Mixtral provides massive effective parameter count and superior context handling due to its Mixture-of-Experts (MoE) architecture. This makes it phenomenal for understanding very large codebases or co...

Jetbrains Local LLM Advanced High Capacity Sparse Expert Context Heavy

8.18 Excellent

Visit

Mistral-Instruct-7B (via LM Studio)

Mistral-Instruct 7B delivers impressive code generation and conversational abilities within JetBrains IDEs. Its instruction tuning makes it highly responsive to developer prompts, providing accurate s...

Jetbrains Local LLM User Friendly General Purpose Code Generation Chat Lm Studio Instruct Local LLM Instruction Tuned

8.15 Excellent

Visit

DeepSeek Coder (via Ollama)

DeepSeek Coder is highly regarded in academic circles for its strong performance across a wide array of programming languages. It often provides superior accuracy in understanding niche or complex lan...

Jetbrains Local LLM Multi Language Academic Code Specialized

8.13 Excellent

Visit

Phi-3 Mini

Phi-3 Mini is a remarkably efficient and powerful local LLM, designed for developers seeking a lightweight solution for code completion and natural language processing. Its 8 billion parameters deliv...

Jetbrains Local LLM NLP Open Source Local Deployment Code Completion Code Analysis Developer Tool Debugging Small Model

8.05 Excellent

Jan AI

Jan AI aims to provide a polished, standalone desktop application experience for running local LLMs. It balances the ease of use of LM Studio with a more polished, integrated feel, making it accessibl...

Lm Studio Local Runner Desktop App User Friendly Privacy First Intuitive Interface Model Agnostic Local LLM Private AI

8.01 Excellent

Visit

StarCoder2 (via Local Inference)

StarCoder2, available through local inference frameworks, is a powerful open-source code generation model specifically trained on a massive dataset of code. Its architecture is designed for efficient...

Jetbrains Local LLM Multi Language Academic Code Generation Code Specialized Local Inference

7.94 Very Good

Visit

Microsoft Phi-3 Mini (via Ollama)

Microsoft's Phi-3 Mini is renowned for achieving surprisingly high performance given its small parameter count. When run via Ollama, it offers excellent reasoning capabilities in a very lightweight pa...

Jetbrains Local LLM Efficiency Microsoft Reasoning Small Model

7.94 Very Good

Visit

OpenHermes 2.5 Mistral

OpenHermes 2.5 Mistral is a refined version of the Mistral 7B model, specifically optimized for conversational AI. It boasts enhanced dialogue capabilities and improved code generation performance com...

Jetbrains Local LLM Offline Conversational Code Generation Chat Code Large Model Quantization Mistral Mistral 7B

7.89 Very Good

Visit

PrivateGPT

PrivateGPT is a powerful tool for building private AI assistants that leverage local LLMs and vector databases. It allows you to index your own documents, enabling the model to answer questions based...

Jetbrains Self Hosted AI Knowledge Management Document Search Rag Local LLM Vector Database Private AI Offline AI Developer Toolset

7.85 Very Good

Visit

CodeLlama-13B (via Ollama)

This model remains a benchmark for code generation specifically. The 13B variant offers a significant step up in code quality and complexity handling compared to the 7B version. It excels at generatin...

Jetbrains Local LLM Code Generation Robust Large Model Code Specialized Completion

7.85 Very Good

Visit

MLC-LLM

MLC-LLM is a powerful, hardware-agnostic framework designed to run machine learning models efficiently across various platforms, including mobile and edge devices. For local AI, it offers a unique adv...

Jetbrains AI Local Cross Platform Framework Hardware Agnostic Inference Engine Quantization Model Compilation Hardware Optimization

7.53 Very Good

Cursor IDE (Local LLM Mode)

This specific mode of Cursor allows advanced users to bypass cloud APIs entirely by connecting it to a locally running LLM via Ollama. This provides the highest level of data privacy and control, ensu...

Cursor Local LLM Privacy Self Hosted Advanced User Ollama

7.36 Very Good

Visit

MLC-LLM (Model Compilation)

MLC-LLM focuses on compiling and optimizing models specifically for the target hardware (CPU, GPU, Metal). This deep-level optimization can sometimes yield performance gains that general runners miss,...

Jetbrains AI Local Performance Optimization Framework Hardware Aware

7.33 Very Good

Google Gemma 2B (via Ollama)

Google's Gemma models provide a strong, open-weights alternative backed by Google's research. The 2B variant is extremely efficient, making it highly portable. While its coding specialization might tr...

Jetbrains Local LLM Google Efficiency Multimodality

7.26 Very Good

Mistral Large (via LM Studio)

Mistral Large, accessible through LM Studio, represents a significant leap in local LLM performance. Its 7B parameter Mixture of Experts architecture delivers exceptional code generation capabilities...

Jetbrains Local LLM NLP Open Source Code Generation Large Language Model Lm Studio Mixture Of Experts 7B Parameters Real Time Code Completion

7.14 Very Good

Visit

TinyLlama-1.1B (via Ollama)

For the absolute minimum resource requirement, TinyLlama is unmatched. It runs incredibly fast, even on low-power CPUs, making it perfect for simple, real-time autocomplete suggestions where latency i...

Jetbrains Local LLM Fast Beginner Friendly Autocomplete Minimal Code Assistance

6.87 Good

Visit

You've reached the end — 23 items