🧠 openrouter-jobrunner

Give it a job. It finds the best LLM. It runs it.

A single-file CLI tool that takes any task — generate an image, analyze a video, write code, summarize a PDF — queries the full OpenRouter model catalog in real-time, picks the optimal model based on what the task actually needs, and executes it. One command, zero model selection.

Why

There are 400+ models on OpenRouter. You shouldn't have to know which one accepts video, which one is cheapest for code, or which one can do tool calling. Describe what you want done, and the jobrunner figures out the rest.

This is different from OpenRouter's built-in openrouter/auto — that routes across ~6 curated models. The jobrunner searches the entire catalog and picks based on:

Required modalities (text, image, video, audio, file input → text, image, audio output)
Budget (set a max $/million tokens, or let it optimize)
Capabilities (tool calling, structured output, reasoning)
Context window (need 100K+? 1M? filtered automatically)
Speed vs quality (prefer free models? cheapest? biggest context?)

Quick Start

git clone https://gist.github.com/<GIST_ID>.git openrouter-jobrunner
cd openrouter-jobrunner
export OPENROUTER_API_KEY="sk-or-..."

# Text task — finds best text model
./jobrunner.sh "Write a bash script that monitors disk usage and sends alerts"

# Image analysis — finds a vision model
./jobrunner.sh "Describe what's in this image" --image https://example.com/photo.jpg

# Video analysis — finds a video-capable model
./jobrunner.sh "Summarize this video" --video https://example.com/clip.mp4

# Code generation — prefers coding models, structured output
./jobrunner.sh "Write a Solidity ERC-20 token with 6 decimals" --prefer coding

# Cheapest possible — free models first
./jobrunner.sh "Translate this to French: Hello world" --budget free

# Force a specific modality filter
./jobrunner.sh "Transcribe this audio" --input-modality audio

# Max budget: $1/M input tokens
./jobrunner.sh "Analyze this codebase for security issues" --max-input-cost 1.0

# See what model it would pick without running
./jobrunner.sh "Explain quantum computing" --dry-run

How It Works

┌─────────────────┐
│   Your Task      │  "Analyze this video for safety violations"
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Task Analyzer   │  Detects: needs video input, text output
│                  │  Infers: reasoning helpful, long output
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Model Catalog   │  GET /api/v1/models → 400+ models
│  (live query)    │  Filter: input_modalities includes "video"
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Ranker          │  Score by: modality fit → pricing → context
│                  │  Apply budget/preference constraints
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Execute         │  POST /api/v1/chat/completions
│                  │  model: "google/gemini-3.1-flash-lite-preview"
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Result          │  Output + metadata:
│                  │  "Used google/gemini-3.1-flash-lite-preview"
│                  │  "Cost: $0.0003 | Tokens: 847"
└─────────────────┘

Files

File	What it does
`jobrunner.sh`	Entry point — parses args, orchestrates everything
`jobrunner.py`	Core logic — model discovery, ranking, execution
`requirements.txt`	Just `requests` (stdlib otherwise)

API Key

Get one at openrouter.ai/keys. Set it:

export OPENROUTER_API_KEY="sk-or-v1-..."

Or create a .env file:

OPENROUTER_API_KEY=sk-or-v1-...

Advanced Usage

As a Python library

from jobrunner import JobRunner

runner = JobRunner(api_key="sk-or-...")

# Find best model without executing
match = runner.find_model(
    task="Generate a pixel art sprite sheet",
    input_modalities=["text"],
    output_modalities=["image"],
    max_input_cost=5.0,  # $/M tokens
)
print(f"Would use: {match['id']} at ${match['pricing']['prompt']}/token")

# Find and execute
result = runner.run("Write a haiku about Ethereum gas fees")
print(result.content)
print(f"Model: {result.model} | Cost: ${result.cost:.6f}")

As an OpenClaw skill

Drop SKILL.md + jobrunner.py into ~/.openclaw/skills/openrouter-jobrunner/ and any agent can use it:

"Use the openrouter-jobrunner skill to find the best model for analyzing this video and run it"

Preference flags

Flag	Effect
`--budget free`	Only free models (pricing = "0")
`--budget cheap`	Sort by lowest cost first
`--budget best`	Sort by capability (largest context, most features)
`--prefer coding`	Boost models with "code" in name/description
`--prefer reasoning`	Boost models that support `reasoning` parameter
`--prefer fast`	Boost models with high max_completion_tokens
`--input-modality X`	Filter to models accepting X (text/image/video/audio/file)
`--output-modality X`	Filter to models outputting X (text/image/audio)
`--min-context N`	Minimum context window (e.g., 100000)
`--max-input-cost N`	Max $/M input tokens (e.g., 1.0)
`--dry-run`	Show selected model + reasoning, don't execute
`--verbose`	Show full model selection reasoning
`--json`	Output result as JSON

What Makes This Different

Full catalog — searches all 400+ OpenRouter models, not a curated subset
Modality-aware — actually checks if the model can handle your input type
Live pricing — always uses current pricing from the API
Transparent — tells you exactly which model it picked and why
Budget control — from free to frontier, you set the ceiling
Zero config — one API key, one command, done

Examples

# "I need to understand what's happening in this security camera footage"
$ ./jobrunner.sh "Analyze this security footage for unusual activity" \
    --video ./footage.mp4 --prefer reasoning --verbose

🔍 Task analysis:
   Input: text + video
   Output: text (analysis)
   Preference: reasoning models

📋 Catalog: 412 models loaded
   After modality filter (video input): 23 models
   After preference boost (reasoning): 23 models (8 boosted)

🏆 Selected: google/gemini-3.1-flash-lite-preview
   Reason: video input ✓ | reasoning ✓ | $0.25/M input | 1M context
   Runner-up: bytedance-seed/seed-2.0-lite ($0.25/M, 262K context)

⚡ Executing...

📝 Result:
   [... analysis output ...]

💰 Cost: $0.000847 | Tokens: 2,391 in / 1,203 out

# "Generate me some code"
$ ./jobrunner.sh "Write a Python FastAPI server with JWT auth, rate limiting, and PostgreSQL" \
    --prefer coding --budget cheap

🏆 Selected: qwen/qwen3.5-9b
   Reason: text→text ✓ | coding boost ✓ | $0.10/M input (cheapest coding match)

⚡ Executing...
[... full FastAPI code ...]
💰 Cost: $0.000312

License

MIT — do whatever you want with it.

austintgriffith/GIST.md

Select an option

No results found