NVIDIA L40S GPU-accelerated OCI Instances

Expanding NVIDIA GPU-Accelerated Instances for  AI, Digital Twins, and Other Uses is Oracle  Cloud Infrastructure

Businesses are quickly using generative AI, large language models (LLMs), sophisticated visuals, and digital twins

Virtual machine powered by a single NVIDIA H100 Tensor Core GPU and the availability of NVIDIA L40S GPU bare-metal instances

The universal data centre GPU NVIDIA L40S has revolutionary multi-workload acceleration for generative AI, graphics, and video applications

For Llama 3 8B with NVIDIA TensorRT-LLM at an input and output sequence length of 128, for instance, a single L40S GPU

The GPU is perfect for creating apps on the NVIDIA Omniverse platform, which enables AI-enabled digital twins and real-time, lifelike 3D simulations

It supports AI tasks with 13 4th Gen Intel Xeon processor cores, 246GB system memory, and 2x 3.4TB NVMe SSDs

Oracle Cloud’s bare-metal compute with NVIDIA H100 and A100 GPUs, low-latency Supercluster

It has the NVIDIA Grace Hopper Superchip and NVLink-C2C, which connects the NVIDIA Grace CPU and NVIDIA Hopper GPU at 900GB/s