Introducing NVIDIA DGX Cloud Lepton: A Unified AI Platform Built for Developers

Table of Contents
- Introduction
- Key Benefits for Developers
- Monitoring and Observability
- Getting Started with DGX Cloud Lepton
- Join the DGX Cloud Lepton Early Access Program
Introduction
NVIDIA DGX Cloud Lepton is a unified AI platform and compute marketplace that connects developers to GPU capacity and AI services from a global network of cloud providers. It integrates seamlessly with NVIDIA software stack and provides infrastructure for scaling training, fine-tuning, and inference across geographies and providers.
Key Benefits for Developers
- Simplified GPU discovery across cloud providers.
- Built-in reliability and resilience for stable performance.
- Support for batch jobs and flexible deployment options for inference endpoints.
Monitoring and Observability
- Continuous monitoring of GPU and system health in real time.
- Observability tools for managing job lifecycles and maintaining visibility across the platform.
Getting Started with DGX Cloud Lepton
- Consistent experience across web interfaces, CLIs, and SDKs.
- Workspace for managing GPU resources and running workloads.
- Configuration settings for user access controls and usage quotas.
- Launch dev pods, submit batch jobs, and deploy inference endpoints for AI workloads.
Join the DGX Cloud Lepton Early Access Program
- Explore the DGX Cloud Lepton in Early Access to improve your generative AI development process.