3 posts tagged with "gpu"

View All Tags

How to Calculate if Your Network is Bottlenecking Distributed Training

January 29, 2026 · 8 min read

Adheip Singh

Founder

A practical guide to understanding why your multi-node GPU training might be slower than expected.

GPU-to-GPU Communication Across Nodes: What Actually Works

January 18, 2026 · 6 min read

Adheip Singh

Founder

If you're building a multi-node GPU cluster for distributed training, you've probably run into a confusing mess of terminology — NVLink, NVSwitch, InfiniBand, RoCE, GPUDirect. Half the blog posts out there mix these up, and vendor documentation assumes you already know what you're doing.

So let's sort this out.

Understanding Power Management in GPU via PCIe

August 27, 2025 · 3 min read

Adheip Singh

Founder

When monitoring GPU infrastructure, you might occasionally notice something peculiar: your high-end GPUs connected to PCIe 5.0 slots are operating at PCIe Gen 1 speeds. Before raising any alarms, let's walk through a systematic debugging approach that reveals this is often expected behavior rather than a configuration issue.