LLM INFERENCE SYSTEMS / AI INFRASTRUCTURE

Python Engineer → LLM Inference Systems

I'm a Python engineer with 6+ years of backend experience, building a learning trail around LLM inference systems — KV cache, serving optimization, cache-aware routing, and benchmark-driven engineering.

Current work

Recent notes

All notes

Featured projects

All projects

Technical tags

View all tags