Category P: Inference Patterns

Recipes for production inference patterns including speculative decoding, KV-cache management, streaming, batching, and ensemble methods.