LLM Inference

The mechanics of LLM token generation including sampling pipelines, logit processing, temperature scaling, and decoding strategies.

Reading List