Writing

Blog

Long-form on AI cost tracking, the math of LLM bills, and how to keep an AI SaaS profitable. New posts ship as they get written.

Cost tracking

How to Track AI Costs Per User (And Why the OpenAI Total Lies to You)

Your OpenAI bill came in at $4,200 this month. That single number tells you almost nothing. Here is how to track per-user cost, why the manual approach breaks at scale, and the two-line wrapper that automates it.

Read post →

Cost reduction

How to Reduce OpenAI Costs Without Breaking Your Product

Microsoft cut Claude Code for 100k engineers. Uber burned their 2026 AI budget in 4 months. Three real causes of a runaway OpenAI bill, and three fixes that do not degrade output quality.

Read post →

Unit economics

AI SaaS Margins: Why Your Best Customer Might Be Your Worst

Traditional SaaS runs at 80-90% gross margin. AI SaaS is structurally closer to 50-60%. Why power users hurt you, how to calculate margin per user, and what healthy AI SaaS margins actually look like.

Read post →

Agent loops

How to Detect and Stop AI Agent Reasoning Loops

A runaway agent can burn $100 in minutes. Here is what a reasoning loop is, why it is financially dangerous, how to detect it with token velocity, and how to get a Slack alert before it costs you thousands.

Read post →

Spending caps

How to Set Spending Caps on LLM API Calls Per User

Flat subscription, variable per-token cost. Set a monthly cap per user so heavy users do not blow your budget: block the call or silently downgrade to a cheaper model. The naive approach, its failure modes, and the fix.

Read post →

Unit economics

How to Calculate Gross Margin Per User for AI SaaS Products

Margin per user equals plan revenue minus AI cost. Here is the formula, how to calculate it manually from usage logs, why it breaks at scale, and how to see it live per user without the spreadsheet.

Read post →

Cost attribution

How to Get OpenAI API Cost Per User (Not Just Total Bill)

OpenAI only shows account-level totals. It has no idea who your users are. Here is how to attribute cost to each user, tag calls with a userId, and turn the total bill into a per-user breakdown.

Read post →

Model selection

GPT-5 vs GPT-4o-mini: Which Model Should Your SaaS Use in 2026?

GPT-4o-mini is often 10x cheaper than GPT-5. Here are the real 2026 prices, what mini is genuinely good enough for, when GPT-5 earns its cost, and how to route each feature to the right model automatically.

Read post →

Pricing

AI SaaS Pricing in 2026: Why Flat Subscriptions Are Structurally Broken

Traditional SaaS runs 80-90% gross margin. AI-first products sit near 52%. Fixed revenue against variable per-token cost is why flat plans break, and the three fixes that actually hold your margin.

Read post →

Provider costs

DeepSeek vs OpenAI vs Gemini: Real Cost Comparison for Production SaaS (2026)

The spread from cheapest to frontier models is more than 200x on input tokens. The real 2026 price table, why output tokens dominate, and how to compare providers on your actual usage instead of a spec sheet.

Read post →

AI cost strategy

What Uber and Microsoft Learned About AI Costs the Hard Way

Uber burned its entire 2026 AI budget in four months. Microsoft cancelled Claude Code for thousands of engineers. Same mistake, and the lesson for any SaaS reselling AI on a flat plan.

Read post →

Agentic AI

AI Agent Cost Management: How to Build Profitable Agentic Applications

One agent task fans out into dozens of model calls, so cost per run is variable and spikes fast. How to track cost per agent run, budget each task, route sub-steps to cheaper models, and cap the outliers.

Read post →

See the dashboard with real data on the live demo (no signup needed).