GPUs

The article discusses the evolving role of GPUs in AI computation as the technology expands beyond traditional data centers. It highlights the increasing relevance of FPGAs in the AI data path due to system constraints.

EDN

Windows 10: Support hasn’t yet ended after all, but Microsoft’s still a fickle-at-best friend

Jan 19, 2026

Microsoft has decided to extend support for Windows 10 for an additional year in response to user backlash. However, the effectiveness of this extension remains questionable.

IEEE Spectrum Semiconductors

IEEE Medal of Honor Recipient Is Nvidia’s CEO Jensen Huang

Jan 16, 2026

Jensen Huang, cofounder and CEO of Nvidia, has been awarded the 2026 IEEE Medal of Honor for his contributions to the development of graphics processing units (GPUs) and their applications in scientific computing and artificial intelligence. The announcement was made by IEEE's president on January 6, 2026.

ArXiv Hardware

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Jan 15, 2026

The article discusses enhancements in attention computation efficiency through the use of FP4 Tensor Cores in Blackwell GPUs, achieving significant speedups in inference. It also introduces an innovative 8-bit attention mechanism for training tasks, demonstrating lossless performance.

2

Electronics Weekly

Q4 PC shipments rose 9.6% YoY

Jan 15, 2026

Q4 PC shipments increased by 9.6% year-over-year, totaling 76.4 million units, as reported by IDC. This growth was largely attributed to the end of support for Windows 10, prompting consumers to upgrade their devices.

ArXiv Hardware

Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications

Jan 14, 2026

The article discusses the concept of disaggregated Large Language Model (LLM) serving, which separates the prefill and decode stages across different GPUs. It highlights the need for systematic benchmarking of performance and energy efficiency in this approach, particularly concerning KV cache transfer methods and optimization strategies.

IEEE Spectrum Semiconductors

The Ultimate 3D Integration Would Cook Future GPUs

Jan 14, 2026

The article discusses the potential of stacking high-bandwidth memory (HBM) directly on top of GPUs to enhance performance by reducing the energy and delay associated with data transfer. This innovation could significantly alleviate the bottleneck currently faced in AI computing.

1

ArXiv Hardware

Memory DisOrder: Memory Re-orderings as a Timerless Side-channel

Jan 13, 2026

The article discusses Memory DisOrder, a timerless side-channel that exploits memory re-orderings to infer activity from other processes in parallel processing units. It highlights the vulnerability of mainstream processors to cross-process signals and demonstrates the potential for covert channel attacks and application fingerprinting.

ArXiv Hardware

Cohet: A CXL-Driven Coherent Heterogeneous Computing Framework with Hardware-Calibrated Full-System Simulation

Jan 12, 2026

The article introduces Cohet, a novel CXL-driven coherent heterogeneous computing framework that aims to enhance the interaction between CPUs and XPUs. It addresses the limitations of traditional heterogeneous systems by creating a unified memory pool and decoupling compute and memory resources.

IEEE Spectrum Semiconductors

Nvidia’s New Rubin Architecture Thrives on Networking

Jan 10, 2026

Nvidia has unveiled its new Vera Rubin architecture at CES 2026, which promises significant reductions in inference costs and GPU requirements for training models compared to the previous Blackwell architecture. The platform is expected to be available to customers later this year.

Electronics Weekly

LG, Hisense, Yingling, GeForce RTX lead CES Innovation Awards 2026

Jan 08, 2026

The Consumer Technology Association (CTA) has announced the winners of the CES Innovation Awards for 2026, highlighting notable products from LG, Hisense, Yingling, and GeForce RTX. This recognition showcases advancements in consumer technology at the CES event.

EE Times

Power Integrity for NVIDIA H200–Based AI Servers: The Role of Capacitors in System Reliability

Jan 08, 2026

The article discusses the critical role of capacitors in maintaining power integrity and reliability for NVIDIA H200 GPUs used in AI servers. It highlights the challenges posed by extreme transients, heat density, and power distribution network (PDN) stability.

ArXiv Hardware

GAP-LA: GPU-Accelerated Performance-Driven Layer Assignment

Jan 07, 2026

The article introduces GAP-LA, a GPU-accelerated framework for optimizing layer assignment in VLSI circuit routing. This framework aims to improve timing, power, and congestion management simultaneously, addressing challenges posed by increasing design complexity.

Electronics Weekly

Top Ten (less 5) PC Vendors In Q3

Jan 06, 2026

The article highlights the top five PC vendors based on preliminary results for Q3 2025, as reported by IDC. It provides insights into worldwide traditional PC shipments, market share, and year-over-year growth.

ArXiv Hardware

Neuro-Channel Networks: A Multiplication-Free Architecture by Biological Signal Transmission

Jan 05, 2026

The article discusses the limitations of current deep learning technologies that rely heavily on GPUs, which are expensive and energy-intensive. It introduces Neuro-Channel Networks (NCN), a new architecture inspired by biological signal transmission that eliminates the need for matrix multiplications.

ArXiv Hardware

Warp-Cortex: An Asynchronous, Memory-Efficient Architecture for Million-Agent Cognitive Scaling on Consumer Hardware

Jan 05, 2026

The article introduces Warp Cortex, an innovative architecture designed to enhance cognitive scaling for multi-agent Large Language Models (LLMs) on consumer hardware. By optimizing memory usage and agent logic, it allows for significantly more concurrent agents than traditional frameworks.

ArXiv Hardware

BitDecoding: Unlocking Tensor Cores for Long-Context LLMs with Low-Bit KV Cache

Jan 05, 2026

The article introduces BitDecoding, a novel inference system that optimally utilizes both CUDA and Tensor Cores for decoding low-bit Key-Value caches in long-context Large Language Models. This approach aims to alleviate memory and bandwidth pressure while maintaining accuracy through advanced quantization techniques.

IEEE Spectrum Semiconductors

The Data Center Boom Is Concentrated in the U.S.

Jan 05, 2026

The article highlights that over half of the upcoming global data centers will be developed in the United States, indicating a significant concentration of data center activity. This trend underscores the U.S. dominance in the data center market, driven by land purchases and construction plans.

ArXiv Hardware

Spiking Manifesto

Dec 31, 2025

The article discusses the inefficiencies of current AI models compared to the human brain, particularly highlighting the energy consumption of artificial neural networks (ANNs) versus spiking neural networks (SNNs). It proposes a new framework for AI architecture that could significantly improve efficiency by leveraging principles from SNNs.

ArXiv Hardware

HERO-Sign: Hierarchical Tuning and Efficient Compiler-Time GPU Optimizations for SPHINCS+ Signature Generation

Dec 31, 2025

The article introduces HERO Sign, a GPU-accelerated implementation of the SPHINCS+ signature scheme that aims to enhance signature generation speed through hierarchical tuning and compiler optimizations. It highlights the potential for improved parallelism in the signature generation process, particularly through a new Tree Fusion strategy.

Browse

Hot Topics

GPUs Dominate AI Compute, FPGAs Move Into the AI Data Path

Windows 10: Support hasn’t yet ended after all, but Microsoft’s still a fickle-at-best friend

IEEE Medal of Honor Recipient Is Nvidia’s CEO Jensen Huang

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Q4 PC shipments rose 9.6% YoY

Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications

The Ultimate 3D Integration Would Cook Future GPUs

Memory DisOrder: Memory Re-orderings as a Timerless Side-channel

Cohet: A CXL-Driven Coherent Heterogeneous Computing Framework with Hardware-Calibrated Full-System Simulation

Nvidia’s New Rubin Architecture Thrives on Networking

LG, Hisense, Yingling, GeForce RTX lead CES Innovation Awards 2026

Power Integrity for NVIDIA H200–Based AI Servers: The Role of Capacitors in System Reliability

GAP-LA: GPU-Accelerated Performance-Driven Layer Assignment

Top Ten (less 5) PC Vendors In Q3

Neuro-Channel Networks: A Multiplication-Free Architecture by Biological Signal Transmission

Warp-Cortex: An Asynchronous, Memory-Efficient Architecture for Million-Agent Cognitive Scaling on Consumer Hardware

BitDecoding: Unlocking Tensor Cores for Long-Context LLMs with Low-Bit KV Cache

The Data Center Boom Is Concentrated in the U.S.

Spiking Manifesto

HERO-Sign: Hierarchical Tuning and Efficient Compiler-Time GPU Optimizations for SPHINCS+ Signature Generation