understanding-gpu-architecture-implications-on-llm-serving-workloads
PublicUnderstanding GPU Architecture Implications on LLM Serving Workloads (Master Thesis, ETH Zürich, 2024)
Understanding GPU Architecture Implications on LLM Serving Workloads (Master Thesis, ETH Zürich, 2024)