Rent NVIDIA L40 GPUs

Enterprise data center GPU for AI inference, virtual workstations, and professional graphics

Technical Specifications

ArchitectureNVIDIA Ada Lovelace
Memory Size48 GB GDDR6 ECC
Memory Bandwidth864 GB/s
Ray Tracing Cores142 (3rd gen)
Tensor Cores568 (4th gen)
NVIDIA L40

L40 Rental Options

Rent NVIDIA L40 GPUs for production AI workloads, virtualized graphics, and enterprise applications. Get the reliability and features of data center GPUs with flexible on-demand pricing.

L40 vs RTX 4090

L40RTX 4090% Diff
ArchitectureAda LovelaceAda LovelaceN/A
CUDA Cores18 17616 384+10.9%
Tensor Cores568 (4th gen)512 (4th gen)+10.9%
RT Cores142 (3rd gen)128 (3rd gen)+10.9%
Memory TypeGDDR6 ECCGDDR6XN/A
VRAM48 GB24 GB+100%
Bus Width384-bit384-bit0%
Bandwidth864 GB/s1 010 GB/s−14.5%
FP32 Performance~90.5 TFLOPS~82.6 TFLOPS+9.6%
TDP300 W450 W−33.3%
PCIePCIe 4.0 ×16PCIe 4.0 ×16N/A
Form FactorDual-slotTriple-slotN/A
CoolingPassiveActiveN/A
Display Outputs4× DP 1.4a3× DP 1.4aN/A
vGPU SupportYesNoN/A

Key performance metrics

Enterprise AI Workloads

Optimized for AI inference and training with 18,176 CUDA cores delivering ~90.5 TFLOPS FP32 performance. Perfect for production-scale AI deployments.

Virtualization Ready

Built for multi-tenant cloud environments with NVIDIA vGPU support, enabling secure workstation virtualization and remote graphics workloads.

Professional Graphics

48GB ECC memory and advanced encoding (3x NVENC/NVDEC with AV1) enable real-time ray tracing, 8K video workflows, and complex 3D scene rendering.

NVIDIA L40 FAQ

Common questions about renting NVIDIA L40 GPUs

The NVIDIA L40 is a data center GPU designed for AI inference, virtual workstations, 3D rendering, and multi-tenant cloud deployments. It features 48GB ECC memory, vGPU support, and enterprise-grade reliability.
The NVIDIA L40 has 48 GB of GDDR6 memory with ECC (Error-Correcting Code) for enhanced reliability in production environments.
Yes, the L40 supports NVIDIA vGPU technology for virtual workstations (vWS) and virtual PCs/apps, making it ideal for multi-tenant cloud environments and remote graphics workloads.
The L40 has double the memory (48GB vs 24GB), ECC support, vGPU capabilities, and lower power consumption (300W vs 450W). It is designed for data centers, while the RTX 4090 is a consumer GPU.
Key features include 18,176 CUDA cores, 568 4th-gen Tensor cores, 142 3rd-gen RT cores, 48GB GDDR6 ECC memory, 864 GB/s bandwidth, PCIe 4.0, and 3x NVENC/NVDEC with AV1 support.
Yes, you can rent L40 GPUs on CloudRift with flexible pricing options and instant deployment. Launch the Console to get started.

Get in Touch

We're here to support your compute and AI needs. Let us know if you're looking to:

  • Find an affordable GPU provider
  • Sell your compute online
  • Manage on-prem infrastructure
  • Build a hybrid cloud solution
  • Optimize your AI deployment
hello@cloudrift.ai
CloudRift Inc., a Delaware corporation
PO Box 1224, Santa Clara, CA 95052, USA
+1 (831) 534-3437

I'm interested in: