Life notes

A collection of thoughts, experiences, and life updates.

Latest

Designing a Multi-Tenant LLM Inference Platform, Part 2

Scaling a serving cell when cold starts take minutes: sizing warm spare from forecast error, model-local standby, draining, and failing honestly when the KV cache is gone.

Jun 11, 2026

2026

Jun 9 Designing a Multi-Tenant LLM Inference Platform
Jun 6 Designing a Distributed Task Scheduler
Jun 1 A layoff and a newborn
May 9 Welcome, Voi 🐘