search
Get Started
search

Best LLM Runner

Updated Daily
inventory_2 10 items
Filter by Tags

Rankings use category fit, feature coverage, pricing signals, public reception, and recency. Affiliate relationships do not affect scores.

0.0 - 10.0
Best 1 LM Studio (itself as an alternative runner variant)

LM Studio is a desktop application designed to run large language models locally on your computer. It’s notable for its streamlined workflow allowing users to easily download, manage, and execute vari...

2 LocalAI

LocalAI is a powerful and versatile local LLM runner built around the idea of seamless model management. It excels in its intuitive interface, offering granular control over model parameters like temp...

3 Ollama Web UI (Open WebUI)

The Ollama Web UI offers an interactive web interface to run and experiment with local large language models. It’s notable for its ease of use and direct integration with the Ollama LLM runner. Develo...

4 llamafile

LlamaFile is a compact software package designed to run large language models locally. It presents a simple, interactive interface within a single executable file across various platforms. This makes...

5 ExLlamaV2

ExLlamaV2 is a specialized machine learning engine designed to accelerate the processing of Large Language Models like LLaMA. It’s notable for its speed and efficiency, particularly when utilizing GPU...

6 Candle (by Hugging Face)

The Candle project offers a lightweight software solution built in Rust designed to execute Large Language Models (LLMs). It’s notable for its minimalist design and suitability for resource-constraine...

7 koboldcpp

Koboldcpp is a minimalist C++ application designed for interactive fiction and roleplaying experiences. It provides an offline LLM runner based on llama.cpp, offering a streamlined interface suitable...

8 TabbyAPI

TabbyAPI is an open-source Python application that provides a locally hosted API server compatible with OpenAI’s interface. It facilitates development using LLMs from LM Studio, enabling offline exper...

9 Aphrodite Engine

The Aphrodite Engine is a machine-learning tool designed for local, offline deep learning experimentation. It’s notable for its support of tensor parallelism and PagedAttention, enabling the execution...

10 RWKV Runner

The RWKV Runner is a software tool designed to efficiently run RWKV large language models offline. It facilitates inference using both CPU and GPU hardware, providing a cross-platform solution for use...

You've reached the end — 10 items

Save to your list

Save your favorites and follow how their scores change over time.

Save favorites
Get updates
Compare scores

Already have an account? Sign in

Compare Items

See how they stack up against each other

Comparing
VS
Select 1 more item to compare