NVIDIA and Microsoft have teamed up to build one of the most powerful AI supercomputers in the world, powered by Microsoft Azure’s advanced supercomputing infrastructure combined with NVIDIA GPUs, networking and full stack of AI software.
The massive cloud AI computer will help enterprises train, deploy and scale AI, including large, state-of-the-art models.
It will be packed with powerful and scalable ND- and NC-series virtual machines optimised for AI distributed training and inference. As the first public cloud to incorporate NVIDIA’s advanced AI stack, the platform will feature tens of thousands of NVIDIA A100 and H100 GPUs, NVIDIA Quantum-2 400Gb/s InfiniBand networking and the NVIDIA AI Enterprise software suite.
As part of the collaboration, NVIDIA will utilise Azure’s scalable virtual machine instances to research and further accelerate advances in generative AI, a rapidly emerging area of AI in which foundational models such as Megatron Turing NLG 530B are the basis for unsupervised, self-learning algorithms to create new text, code, digital images, video, or audio.
“AI technology advances as well as industry adoption are accelerating. The breakthrough of foundation models has triggered a tidal wave of research, fostered new startups and enabled new enterprise applications. Our collaboration with Microsoft will provide researchers and companies with state-of-the-art AI infrastructure and software to capitalise on the transformative power of AI,” said Manuvir Das, Vice President of Enterprise Computing at NVIDIA.
“AI is fueling the next wave of automation across enterprises and industrial computing, enabling organisations to do more with less as they navigate economic uncertainties. Our collaboration with NVIDIA unlocks the world’s most scalable supercomputer platform, which delivers state-of-the-art AI capabilities for every enterprise on Microsoft Azure,” said Scott Guthrie, Executive Vice President of the Cloud + AI Group at Microsoft.