
TAIPAI, Taiwan (COMPUTEX), May 21, 2025 – NVIDIA has introduced DGX Cloud Lepton, an AI platform designed to support the development of agentic and physical AI systems. It includes a compute marketplace that links developers to a pool of GPUs distributed across global cloud infrastructure. The goal is to provide scalable compute access without the need for direct hardware management, streamlining the development and deployment of advanced AI applications.
To meet the demand for AI, NVIDIA Cloud Partners (NCPs) including CoreWeave, Crusoe, Firmus, Foxconn, GMI Cloud, Lambda, Nebius, Nscale, Softbank Corp. and Yotta Data Services will offer NVIDIA Blackwell and other NVIDIA architecture GPUs on the DGX Cloud Lepton marketplace.
Developers can access GPU resources in specific regions for both short-term and sustained workloads, enabling support for strategic and sovereign AI use cases. Frontline cloud providers and GPU marketplaces are expected to integrate with the DGX Cloud Lepton platform.
“NVIDIA DGX Cloud Lepton connects our network of global GPU cloud providers with AI developers,” said Jensen Huang, founder and CEO of NVIDIA. “Together with our NCPs, we’re building a planetary-scale AI factory.”
DGX Cloud Lepton helps address the challenges of securing reliable GPU resources by unifying access to cloud AI services and GPU capacity across the NVIDIA compute ecosystem. The platform integrates with the NVIDIA software stack, including NVIDIA NIM and NeMo microservices, NVIDIA Blueprints and NVIDIA Cloud Functions, to accelerate and simplify the development and deployment of AI applications.
For cloud providers, DGX Cloud Lepton provides management software that delivers real-time GPU health diagnostics and automates root-cause analysis, eliminating manual operations and reducing downtime.
Key benefits of the platform include:
- Improved productivity and flexibility: Supports development, training, and inference in one place for better workflow. Buy GPU time from cloud providers or use own clusters for better flexibility and control.
- Frictionless deployment: Enables deployment of AI applications across multi-cloud and hybrid environments with minimal operational burden, using integrated services for inference, testing and training workloads.
- Agility and sovereignty: Gives developers access to GPU resources in specific regions, enabling compliance with data sovereignty regulations and meeting low-latency requirements for sensitive workloads.
- Predictable performance: Provides participating cloud providers enterprise-grade performance, reliability and security, ensuring a consistent user experience.
A New Bar for AI Cloud Performance
NVIDIA today also announced NVIDIA Exemplar Clouds to help NCPs enhance security, usability, performance and resiliency, using NVIDIA’s expertise, reference hardware and software and operational tools.
NVIDIA Exemplar Clouds tap into NVIDIA DGX Cloud Benchmarking, a suite of tools and recipes to boost AI workload speed and track cost versus performance.
Yotta Data Services is the first Asia-Pacific NCP to join the NVIDIA Exemplar Cloud program.
Availability
Developers can sign up for early access to NVIDIA DGX Cloud Lepton.
Source: NVIDIA
About NVIDIA
NVIDIA Corporation, based in Santa Clara, CA, is a U.S. technology company specializing in the design and production of graphics processing units (GPUs). Its hardware and software solutions support a range of applications and simulation. Operating for over 30 years, NVIDIA has seen strong financial growth, reporting $39.3 billion in revenue and $22.1 billion in net income for the fiscal quarter ending January 2025. Its headquarters are designed to promote a flat organizational structure that encourages open communication and collaboration between leadership and staff across industries. In gaming, its GPUs power high-performance visual rendering. In artificial intelligence and high-performance computing, NVIDIA provides the infrastructure needed for training and deploying large-scale models. The company also contributes to the automotive sector with systems for autonomous driving and supports robotics with tools for AI-based perception.