Planner

Full RAG infrastructure planner.

Plan the full cost and footprint of a production retrieval stack: embeddings, vector database, reranking, generation, storage, and monthly infrastructure spend — all in one place.

latency + recall aware

managed vs self-hosted

monthly TCO projection

Configuration

Quick Presets

Workload Preset

Vector DB Vendor Preset

Corpus Size (GB of source docs)

Chunk Size (tokens)

Chunk Overlap (tokens)

Bytes per Token

Embedding Model

Vector Precision

Metadata per Chunk (KB)

Index Type

Vector DB Operating Model

Replication Factor

Target Query Load (QPS)

Top-K Retrieved

Reranking

Generator Model Size

Generator Precision

Avg Prompt Tokens

Avg Output Tokens

Deployment Preference

Retrieval & Embeddings

️ Vector Database Footprint

Generation Layer

Monthly Cost Summary

️ Recommended Architecture