Installation
Download, run, useful in 3 minutes. Install now →
ModelReins is multi-provider AI routing you own. You download the Companion, it installs a local brain called Bob on your machine, and you dispatch AI jobs across your fleet of workers. Bob learns how you work, and his brain stays on your hardware — we don’t sync it back to our servers.
The Matriarch routes every job to the right model at the right effort level. Claude for architecture, Sonnet for code, Ollama for quick tasks — all from one interface. One subscription per provider, never double-booked. When a provider rate-limits, the router fails over to the next worker on your fleet, with your keys, against your subscription.
ModelReins is a dispatch relay, not a content host. The SaaS routes metadata (who, when, which worker, what capability, what status) — it doesn’t query the content of your prompts or your AI’s responses across tenant boundaries. Your prompts don’t transit a ModelReins-owned gateway on their way to the model; workers call providers directly from your infrastructure. Your AI conversations still go to your AI provider; ModelReins routes the work, not the words.
That’s it. Bob is on your machine. Your workers are online. The Matriarch is routing.
Installation
Download, run, useful in 3 minutes. Install now →
Providers
Compare all 7 providers — cost, speed, local vs cloud. See providers →
The Lexicon
Bob, the Director, the Rail, the Wall — learn the vocabulary. Read the lexicon →
Cost Optimization
Route expensive jobs to cheap models. Run the rest locally for free. Optimize costs →