Skip to content

Latest commit

 

History

History
9 lines (6 loc) · 341 Bytes

File metadata and controls

9 lines (6 loc) · 341 Bytes

Session 5: Efficient Memory Management for Large Language Model Serving with PagedAttention

EleutherAI ML Scalability & Performance Reading Group Session 5, in which we covered Paged Attention.

Presenter: Kunjan Patel

Links: