AI 精选动态
智能评分 60
Azure 创 MLPerf 训练纪录
AI 推荐理由
新基准显示在大规模 GPU 集群上可显著缩短训练时长,值得关注后续 Azure 与 NVIDIA 的协同进展。核心解读
Azure 与 NVIDIA 合作,在 NVIDIA Blackwell 平台上使用 8,192 块 GPU(GB200 NVL72 系统)完成 Llama 3.1 405B 模型的训练,仅用时 7.07 分钟,创下迄今为止规模最大、最快的 MLPerf Training 成绩。此成绩展示了全栈创新在硅片、系统、网络和软件层面的协同效应。Satya Nadella 在推文中称其为 Azure 的新里程碑。
全文
Thanks @satyanadella 👏
Great work with @Azure on one of the largest MLPerf Training submission to-date on NVIDIA Blackwell: 8,192 GPUs on NVIDIA GB200 NVL72 systems, Llama 3.1 405B training target met in only 7.07 minutes.
More to come!
> **引用原帖 Satya Nadella (@satyanadella):**
> New Azure milestone. The fastest time to train yet at the largest reported scale for this leading AI training benchmark.
> A great example of what is possible when we bring together full-stack innovation across silicon, systems, networking, and software, along with our deep partnership with @nvidia, to advance the frontier of AI infra. https://t.co/5ADybErndz
> https://x.com/satyanadella/status/2067020073408368664