GPU

Graphics Processing Unit – a processor optimized for parallel computations, originally for rendering graphics but now also widely used for AI and data processing.

13 articles

Semiconductor Engineering

Systematic Analysis of CPU-Induced Slowdowns in Multi-GPU LLM Inference (Georgia Tech)

Mar 28, 2026

A new technical paper, “Characterizing CPU-Induced Slowdowns in Multi-GPU LLM Inference,” was published by the Georgia Institute of Technology. Abstract “Large-scale machine learning workloads increasingly rely on multi-GPU systems, yet their performance is often limited by an overlooked component: the CPU. Through a detailed study of modern large language model...