
Cores

Tesla details how it finds punishing defective cores on its million-core Dojo supercomputers — a single error can ruin a weeks-long AI training run
Detecting malfunctioning cores and disabling them on a massive processor is challenging, but Tesla has developed its Stress tool, which can detect cores prone to silent data corruption across not only Dojo processors but also across Dojo clusters with millions of cores, all without taking them offline. This is an incredibly important capability, as Tesla says a…

Arm Unveils Powerful New Cores And Compute Subsystems For Next-Gen AI Workloads
Arm Holdings plc, or “arm”, was once considered a vendor of processors designed for embedded and low-power systems, but those days are well past at this point. Having conquered mobile some time ago, processors based on Arm’s Instruction Set Architecture (ISA) are now challenging the supremacy of the long-dominant x86-64 architecture on every…