FinOps · Open Source · Multi-Provider
Track every dollar by feature, team, and customer across OpenAI, Anthropic, and Google today — with Azure, AWS Bedrock, Groq, Mistral, and Together on the roadmap. Hard-cap budgets before the API call, not after the bill arrives.
Your invoice says gpt-4o: $4,287. It doesn't say which feature, team, or customer burned it. By the time you trace the spike, it's already on next month's card.
A bad deploy, a runaway agent, or one abusive customer can trigger thousands of API calls before any dashboard turns red. You find out when you open the bill.
OpenAI's usage page. Anthropic's console. Azure Cost Management. Bedrock CloudWatch. No unified view, no way to ask which feature is your biggest AI spend across all providers.
BurnLens runs a local proxy on :8420. Set OPENAI_BASE_URL or ANTHROPIC_BASE_URL and your existing SDK code routes through automatically. Less than 20ms overhead. Full streaming passthrough.
Three request headers — X-BurnLens-Tag-Feature, -Team, -Customer — attribute cost to any dimension you care about. Tags are stripped before reaching the AI provider. They never leave your machine.
Register an API key with a daily dollar limit. At 100%, BurnLens returns 429 before the call is forwarded upstream. 50% and 80% thresholds fire Slack or email alerts.
OpenAI, Anthropic, and Google spend in one view today. Azure, AWS Bedrock, Groq, Mistral, and Together are on the v0.2 / v0.3 roadmap. Model breakdowns, waste detection, and budget tracking — all reconciled to the provider bill.
Cursor, Claude Code, Cline, Windsurf — see cost per repo, developer, or PR. Hard daily caps per API key stop one runaway agent from burning the team's monthly budget overnight.
Tag each request with a customer ID. See which customers drive the most cost. Enforce per-customer monthly spend limits — BurnLens returns 429 before the call is forwarded.
Tag retrieval calls, tool calls, and generation separately. See whether your vector search or synthesis step is the cost driver — and whether it justifies the output quality.
Set per-team monthly budgets, get Slack alerts at 80% and 100%, and export monthly reports that reconcile line-by-line to the actual provider invoice.
| BurnLens | Helicone / Langfuse | Vantage / CloudZero | |
|---|---|---|---|
| Open source | ✓ | Partial | ✗ |
| Local-first (prompts stay local) | ✓ | ✗ | ✗ |
| Hard caps before API call | ✓ | ✗ | ✗ |
| Per-customer attribution | ✓ | ✓ | ✗ |
| Multi-cloud (Azure / AWS / GCP) | Roadmap (v0.2 / v0.3) | Partial | ✓ |
Free forever for individual developers. Pay only when your team needs cloud sync.
burnlens scan — reads Claude Code, Cursor, Codex, and Gemini CLI local logs; imports cost history without a proxy~/.claude/projects/ JSONL, deduplicates turns, attributes cost per session~/.gemini/tmp/burnlens run -- <cmd> attributes every call to repo / dev / PR / branch