experiment · 13 · model landscape

AI Model Landscape

Two views of the field. Below the fold: 25 open-weights models small enough to host yourself (full-precision footprint under 500 GB) with one-click links to source, weights, and a hosted-inference button on Featherless where available. Then: a regularly-updated multi-axial comparison of 11 cloud models scored across eight capability axes.

local · updated 2026-05-24 · cloud · updated 2026-05-24

Subjective curated scores. Click model chips to overlay up to four at once — the shape tells the story.

Claude Sonnet 4.7

Anthropic

The current default. Top-tier coding + tool use, 1M context.

vendor site

Claude Opus 4.7

Anthropic

Anthropic's heavyweight. Deepest reasoning, premium tier.

vendor site

Claude Haiku 4

Anthropic

Cheap + fast Anthropic. Surprisingly capable for the price.

vendor site

GPT-4o

OpenAI

Multimodal workhorse. Solid all-rounder, native audio + vision.

vendor site

Gemini 2.5 Pro

Google

Google's flagship. 2M context, video native, integrated thinking.

vendor site

Gemini 2.5 Flash

Google

Cheap Gemini. Astonishingly long context for the price.

vendor site

Grok 4

xAI

xAI's frontier model. Real-time X data, large context, growing tool stack.

vendor site

DeepSeek V3.1

DeepSeek

Open-weights frontier. 671B MoE, hosted API at 1/10th the price.

vendor site

Mistral Large 2

Mistral AI

European frontier. EU-data-resident inference, strong multilingual.

vendor site

Command R+

Cohere

RAG-first cloud model. Citation-aware, tool-use solid.

vendor site

Filter by vendor or kind. Click any link to jump to source, weights, or a hosted runner on Featherless.

kind
vendor

DeepSeek V2.5

DeepSeek · 236B MoE (21B active) · 472 GB
general

MoE generalist + coder fused. Activates 21B per token.

DeepSeek License

DBRX Instruct

Databricks · 132B MoE (36B active) · 264 GB
general

MoE built for the data-platform crowd. Long context, good code.

Databricks Open Model

Mistral Large 2 (123B)

Mistral AI · 123B · 246 GB
general

Mistral's flagship dense weights. Strong multilingual + function calling.

Mistral Research

Command R+ 104B

Cohere · 104B · 208 GB
general

RAG + tool-use specialist. Non-commercial weights.

CC-BY-NC 4.0

Llama 3.2 90B Vision

Meta · 90B · 180 GB
vision

Multimodal Llama. Image + text reasoning at frontier scale.

Llama 3.2 Community

SmolLM2 1.7B

Hugging Face · 1.7B · 3.4 GB
small

On-device class. Runs on a Raspberry Pi 5 with room to spare.

Apache 2.0

Numbers are curated estimates — not a leaderboard. The shape of the radar matters more than any single number. Refreshed monthly from public/data/ai-models-cloud.json and public/data/ai-models-local.json.

Back to the lab index