Products

Tools that make infra tradeoffs harder to hand-wave.

Built for operators, founders, and engineering teams who need defensible numbers on cost, throughput, memory, retrieval architecture, and deployment shape.

01 · Available now

AI Calculator

Estimate VRAM, GPU count, throughput, TTFT, TPOT, and monthly cost for real inference and training scenarios.

  • Supports cloud and on-prem economics
  • Useful for sizing and stakeholder conversations
  • Updated for newer GPU classes and concurrency assumptions
Focus
Inference
Output
TCO
02 · Available now

RAG Planner

Size the full retrieval pipeline together: chunking, embeddings, vector store, reranking, generation, and deployment model.

  • Choose managed, self-hosted, or hybrid retrieval
  • Compare chunk counts, RAM, storage, and monthly spend
  • Get a clearer view of recall versus latency tradeoffs
Focus
RAG
Output
Architecture
Next up

A small product line, not a junk drawer.

03 · Soon

Eval Harness

Reproducible benchmark scaffolding for your actual data, not toy tasks.

04 · Soon

Prompt Studio

Version, test, and compare prompts without losing experimental discipline.

05 · Soon

Cost Sentinel

Watch spend across providers and spot unit-economics drift before it hurts.

Stay in the loop

Get new tools when they’re ready.

Subscribe to the newsletter and I’ll use it as the launch channel for new calculators, planners, and infrastructure notes.