Newsletter

Weekly AI infra notes worth actually reading.

GPU economics, inference tradeoffs, retrieval architecture, and opinionated analysis for teams shipping AI systems without infinite budget or patience.

Subscribe

Short, sharp, technical.

No vendor fluff. No content treadmill filler. Just useful observations and practical breakdowns from the infra side of AI.

GPU + serving economics
RAG architecture
evals + deployment
What subscribers get

Useful signal, not just volume.

  • GPU benchmark and platform implications
  • Cost math for inference and training workloads
  • Architecture notes on retrieval, reranking, and eval design
  • Opinionated takes on what matters and what’s noise
Latest issue

Why ClusterMind exists.

Most AI infrastructure commentary is too close to the vendors selling the infrastructure. ClusterMind is built to be more candid, more technical, and more useful.

Recent dispatches

A few examples of the lane.

#024 Apr 24 The B200 ramp is real — and it changes the cost curve. hardware
#023 Apr 17 Reranker beats fine-tune: a cheaper win in production RAG. rag
#022 Apr 10 Why your eval harness is probably lying to you. evals