Best Local Runner

Updated Daily
inventory_2 17 items
trending_up Scored across 12 criteria

Rankings use category fit, feature coverage, pricing signals, public reception, and recency. Affiliate relationships do not affect scores.

Filter by Tags
0.0 - 10.0
Best 1 LM Studio
LM Studio

LM Studio is a revolutionary desktop application that simplifies running large language models locally. It provides a user-friendly interface for downloading, configuring, and deploying various open-s...

8.94 Excellent
Visit
2 LM Studio (itself as an alternative runner variant)
LM Studio (itself as an alternative runner variant)

The premier all-in-one local LLM runner with built-in model download, management, and inference. The benchmark for local AI.

8.78 Excellent
Visit
3 vLLM (Local Deployment)
vLLM (Local Deployment)

vLLM is primarily a high-throughput serving engine, but its ability to run models locally makes it invaluable for developers building local AI services. It implements advanced techniques like PagedAtt...

8.67 Excellent
Visit
4 Hugging Face Transformers (Local Inference)
Hugging Face Transformers (Local Inference)

While not a dedicated IDE plugin, utilizing the Hugging Face Transformers library directly within a Python script allows developers to load and run the absolute latest, state-of-the-art models locally...

8.41 Excellent
Visit
5 Text Generation WebUI
Text Generation WebUI

Text Generation WebUI is a highly popular open-source LLM inference web interface built around the llama.cpp library. Its renowned for its extensive feature set, including support for various quantiza...

8.26 Excellent
Visit
6 Mixtral 8x7B (via local runner)
Mixtral 8x7B (via local runner)

Mixtral is famous for its Mixture-of-Experts (MoE) architecture, allowing it to achieve performance rivaling much larger models while maintaining reasonable inference speeds when self-hosted. Running...

8.22 Excellent
7 Continue (Local Backend)
Continue (Local Backend)

Continue is a powerful VS Code/JetBrains extension that excels at providing a chat-like interface directly within the IDE, allowing you to interact with various local backends (like Ollama or llama.cp...

8.16 Excellent
Visit
8 llama.cpp-mac
llama.cpp-mac

llama.cpp-mac is a highly optimized port of the llama.cpp library specifically tailored for Apple Silicon Macs. Its designed to deliver exceptional inference performance, particularly with GGUF quanti...

8.10 Excellent
Visit
9 Jan AI
Jan AI

Jan AI aims to provide a polished, standalone desktop application experience for running local LLMs. It balances the ease of use of LM Studio with a more polished, integrated feel, making it accessibl...

8.01 Excellent
Visit
10 llama.cpp-python Bindings
llama.cpp-python Bindings

This package provides Python bindings directly to the highly optimized llama.cpp core. It is the preferred method for developers who want the raw speed and efficiency of llama.cpp but need to interact...

8.00 Excellent
11 DeepSeek Coder
DeepSeek Coder

DeepSeek Coder models are specifically trained on massive, high-quality code datasets, giving them a distinct edge in code generation accuracy across multiple languages. When run locally, they provide...

7.93 Very Good
Visit
12 GPT4All
GPT4All

GPT4All provides a streamlined way to run LLMs on CPUs. It's designed for users who dont have access to powerful GPUs, offering a surprisingly capable experience with optimized models. It focuses on...

7.81 Very Good
Visit
13 StarCoder2
StarCoder2

StarCoder2, trained by DeepMind and Hugging Face, is a highly respected, academically validated model for code generation. It excels at understanding the context provided by surrounding code blocks an...

7.69 Very Good
14 Phi-3 Mini (Local)
Phi-3 Mini (Local)

Microsoft's Phi-3 Mini is celebrated for achieving surprisingly high performance on complex tasks despite its relatively small parameter count. When run locally, it offers incredibly fast inference sp...

7.54 Very Good
15 KoboldAI
KoboldAI

While often marketed for creative writing and roleplaying, KoboldAI provides a robust local inference engine that can be adapted for coding tasks. Its strength lies in its highly configurable text gen...

7.33 Very Good
Visit
16 Code Llama (Local)
Code Llama (Local)

Code Llama, Meta's dedicated coding model, remains a foundational and highly stable choice for local development. It benefits from Meta's massive resources and is specifically tuned for coding tasks....

7.33 Very Good
17 GPT-Engineer (Local Adaptation)
GPT-Engineer (Local Adaptation)

GPT-Engineer is an agentic framework designed to take a high-level prompt and generate a complete, multi-file project structure. When adapted to use local models via Ollama or llama.cpp, it becomes a...

7.01 Very Good
You've reached the end — 17 items

Save to your list

Create your first list and start tracking the tools that matter to you.

Track favorites
Get updates
Compare scores

Already have an account? Sign in

Compare Items

See how they stack up against each other

Comparing
VS
Select 1 more item to compare