Introducing NVIDIA DGX Cloud Lepton: A Unified AI Platform Built for Developers

thumbnail

Table of Contents

  1. Introduction
  2. Key Benefits for Developers
  3. Monitoring and Observability
  4. Getting Started with DGX Cloud Lepton
  5. Join the DGX Cloud Lepton Early Access Program

Introduction

NVIDIA DGX Cloud Lepton is a unified AI platform and compute marketplace that connects developers to GPU capacity and AI services from a global network of cloud providers. It integrates seamlessly with NVIDIA software stack and provides infrastructure for scaling training, fine-tuning, and inference across geographies and providers.

Key Benefits for Developers

  • Simplified GPU discovery across cloud providers.
  • Built-in reliability and resilience for stable performance.
  • Support for batch jobs and flexible deployment options for inference endpoints.

Monitoring and Observability

  • Continuous monitoring of GPU and system health in real time.
  • Observability tools for managing job lifecycles and maintaining visibility across the platform.

Getting Started with DGX Cloud Lepton

  • Consistent experience across web interfaces, CLIs, and SDKs.
  • Workspace for managing GPU resources and running workloads.
  • Configuration settings for user access controls and usage quotas.
  • Launch dev pods, submit batch jobs, and deploy inference endpoints for AI workloads.

Join the DGX Cloud Lepton Early Access Program

  • Explore the DGX Cloud Lepton in Early Access to improve your generative AI development process.