NVIDIA intros generative AI foundry service on Microsoft Azure

NVIDIA has introduced an AI foundry service aimed at accelerating the development and fine-tuning of generative AI applications utilising Microsoft Azure.

Aimed at both startups and enterprises, the service consolidates three key components: an assemblage of NVIDIA AI Foundation Models, the NVIDIA NeMo framework and tools, and the NVIDIA DGX Cloud AI supercomputing services. This provides businesses with a comprehensive solution to build custom generative AI models that can be employed through NVIDIA AI Enterprise software, empowering applications such as intelligent search, summarisation and content generation.

SAP SE, Amdocs and Getty Images are already leveraging this service to craft tailored models.

“Enterprises need custom models to perform specialised skills trained on the proprietary DNA of their company — their data. “NVIDIA’s AI foundry service combines our generative AI model technologies, LLM training expertise and giant-scale AI factory. We built this in Microsoft Azure so enterprises worldwide can connect their custom model with Microsoft’s world-leading cloud services,” said Jensen Huang, Founder and CEO of NVIDIA.

The collaboration with Microsoft Azure marks a pivotal step in extending these customised models to a global scale, aligning with the company’s cloud services.

“Our partnership with NVIDIA spans every layer of the Copilot stack — from silicon to software — as we innovate together for this new age of AI. With NVIDIA’s generative AI foundry service on Microsoft Azure, we’re providing new capabilities for enterprises and startups to build and deploy AI applications on our cloud,” said Satya Nadella, Chairman and CEO of Microsoft.

The NVIDIA foundry service offers a range of NVIDIA AI Foundation models, including the Nemotron-3 8B family optimised for various use cases and multilingual capabilities. These models, hosted in the Azure AI model catalogue, provide developers with curated options for building custom enterprise generative AI applications.

The availability of NVIDIA DGX Cloud AI supercomputing on Azure Marketplace offers customers scalable instances with thousands of NVIDIA Tensor Core GPUs. This integration with Azure’s services allows customers to utilise existing credits for NVIDIA AI supercomputing and software, expediting model development.

The integration of NVIDIA AI Enterprise software into Azure Machine Learning extends the platform’s capabilities, providing stable and supported AI and data science software. This addition enhances Azure’s enterprise-grade AI service with NeMo and NVIDIA Triton Inference Server.