Tesla details how it finds punishing defective cores on its million-core Dojo supercomputers — a single error can ruin a weeks-long AI training run Tech News Eulis June 7, 2025 Tesla’s Stress tool detects and disables faulty cores in Dojo wafer-scale processors, which power Dojo clusters with millions of cores, without interrupting AI training. Go to Source Author:
Kioxia works with Nvidia to prep XL-Flash SSD that’s 3x faster than any SSD available — 10 million IOPS drive has peer-to-peer GPU connectivity for AI servers