An intelligent orchestration layer for LLMs.

Devtor optimizes cost, performance, and reliability by dynamically managing prompts, models, context, and execution strategies — continuously improving from past interactions.

Get started How it works

16orchestration capabilities

68%reduction in cost

CLIterminal & web workflows

16orchestration capabilities

68%reduction in cost

CLIterminal & web workflows

The platform

A control plane between you and your LLMs

Devtor orchestrates prompts, models, context, and execution — optimizing cost, performance, and reliability. It picks how, when, and which model to run, learns from every session, and works via CLI, Locally Hosted Webportal, or IDE extensions.

Core promise

The right model, context, and execution strategy for every task.

✓Hybrid super-agent + cheap sub-agents
✓Granular plans · pre-run cost estimates
✓Smart context · latest prompt only
✓Cross-user learning · shared fixes
✓CLI + Locally Hosted Webportal + IDE extensions

Prompt optimization

Tune prompts per model tier. Cheaper models, production output.

Smart context

Relevant messages and files only. No full-history replay.

Cost control

Estimates, budgets, smart routing. Power models when needed.

Organizational learning

Resolved failures indexed org-wide. Same problem, instant fix.

Bring Your Model (BYOM)

Your Infrastructure. Your Models.
Our Agent Orchestration.

Stop being locked into single AI vendors. Devtor is fully model-agnostic, giving you the power to mix local offline intelligence, specialized high-speed endpoints, and private enterprise deployments inside a unified routing layer.

Zero Provider Lock-In

Swap providers with a single config change. Connect commercial APIs, local nodes, or private instances.

Secure Local Workloads

Run offline tasks using Ollama, Llama.cpp, or vLLM. No external network request is ever sent for local workflows.

Unified Configuration Spec

A single standardized spec handles diverse model prompt templates, token limits, and fallback strategies behind the scenes.

Failover Resilience

Configure fallback lists. If your main model hits rate limits or experiences downtime, traffic transparently switches to your standby model.

Hybrid Local & Cloud

Most Popular

Run local open-source models for draft generation and security-critical files, and scale to premium cloud LLMs only when deep reasoning is needed.

Active Router Status

Claude 3.5 Sonnet

Anthropic API

Llama 3.1 8B

Ollama (Local)

DeepSeek-V3

DeepSeek API

Flow: Drafting, tests & simple refactors run 100% locally on your machine at zero cost. Multi-file plans automatically upgrade to Claude 3.5 Sonnet.

models.yaml

YAML Configuration

# ~/.config/devtor/models.yaml
providers:
  anthropic:
    api_key: env.ANTHROPIC_API_KEY
    models:
      - claude-3-5-sonnet-latest:
          tier: advanced
          routing: "reasoning, planning"

  ollama:
    host: "http://localhost:11434"
    models:
      - llama3.1:8b:
          tier: local
          routing: "drafting, refactoring, tests"

Compatible with standard OpenAI API specs and LiteLLM Proxy

How it works

End-to-end orchestration pipeline

See the three phases

InputCLI, Locally Hosted Webportal, or IDE extensions

PlanGranular plans · token estimates · budgets

ContextRelevant messages only · memory control

RouteSmart selection · prompt optimization · hybrid agents

ExecuteThink mode · model-aware · track spend

LearnCross-user fixes · repo rules · pattern cache

How it works