Products

Tools that make infra tradeoffs harder to hand-wave.

Built for operators, founders, and engineering teams who need defensible numbers on cost, throughput, memory, retrieval architecture, and deployment shape.

01 · Available now

AI Calculator

Estimate VRAM, GPU count, throughput, TTFT, TPOT, and monthly cost for real inference and training scenarios.

Supports cloud and on-prem economics
Useful for sizing and stakeholder conversations
Updated for newer GPU classes and concurrency assumptions

Focus

Inference

Output

TCO

Open calculator

02 · Available now

RAG Planner

Size the full retrieval pipeline together: chunking, embeddings, vector store, reranking, generation, and deployment model.

Choose managed, self-hosted, or hybrid retrieval
Compare chunk counts, RAM, storage, and monthly spend
Get a clearer view of recall versus latency tradeoffs

Focus

RAG

Output

Architecture

Open planner

Next up

A small product line, not a junk drawer.

03 · Soon

Eval Harness

Reproducible benchmark scaffolding for your actual data, not toy tasks.

04 · Soon

Prompt Studio

Version, test, and compare prompts without losing experimental discipline.

05 · Soon

Cost Sentinel

Watch spend across providers and spot unit-economics drift before it hurts.

Stay in the loop

Get new tools when they’re ready.

Subscribe to the newsletter and I’ll use it as the launch channel for new calculators, planners, and infrastructure notes.