Calculator

AI infrastructure calculator for cost, memory, and throughput.

Use this to size inference and training workloads with more realistic assumptions around precision, concurrency, KV cache, throughput, and deployment model.

inference + training

cloud vs on-prem

TTFT + TPOT aware

Calculator Mode

Configuration

Model Size (billion parameters)

Weight Precision

KV Cache Precision

Target Throughput (cluster req/sec)

Concurrent Requests

Avg Input Tokens / Request

Avg Output Tokens / Request

Context Length

Real-World Efficiency

Deployment Target