News

Exxact Features New NVIDIA Ada Generation GPUs - RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S

August 8, 2023

9 min read

EXX-Blog-New-NVIDIA-GPUs-RTX5000-RTX4500-RTX4000ada-L40S.jpg

Newest NVIDIA Ada Generation GPUs Available in Exxact System Configurators Today

Exxact will feature four new NVIDIA Ada Generation GPUs – the NVIDIA RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S – announced by NVIDIA at SIGGRAPH 2023, in our workstations, servers, and clusters. The new NVIDIA GPUs deliver compelling gen-over-gen performance for a competitive edge in real-time rendering environments, 3D graphics acceleration and development, high-performance scientific computing, AI and deep learning capabilities, and more.

Previous generation NVIDIA RTX A5500, A5000, A4500, and A4000 GPUs are still crowd favorites for high-performance computing and the NVIDIA A100 Tensor Core GPU is in high demand to power the artificial intelligence boom. The new Ada Lovelace GPU lineup delivers the performance professionals need to innovate.

The NVIDIA RTX 5000 and NVIDIA L40S are available to order now in Exxact configurable systems, while the RTX 4500 and RTX 4000 will follow in the fall. Let's delve in to see what to expect from these new GPUs.

The New RTX 5000 Ada Generation vs. RTX A5500

The NVIDIA RTX 5000 improves upon the last generation RTX A5500 with more GPU memory, CUDA cores, RT Cores, and Tensor Cores. And with the newer generation Ada Lovelace architecture, the RTX 5000 is 1.5x faster in AI training, has 2x more throughput in AI Inference, and is 2x more performant in graphics and rendering over the RTX A5500, potentially outperforming the last generation RTX A6000.

The gen-over-gen performance increases enable every industry to achieve more with RTX 5000. For architectural engineering, product design, and game development workloads, users will be able to build highly complex 3D models, render large environments faster, power immersive high-fidelity VR, and create more with fewer limitations.

RTX 5000 powers research and data science with faster time to completion of complex simulations, quick prototyping and training of dense AI models, deployment of LLMs and generative AI, and execution of scientific discovery through molecular dynamics and computer-aided drug design.

Model/Specs	RTX 5000	RTX A5500
Architecture	Ada Lovelace	Ampere
GPU Memory	32GB GDDR6 ECC	24GB GDDR6 ECC
Bus Interface	256-bit	384-bit
Memory Bandwidth	576 GB/s	768 GB/s
CUDA Cores	12,800	10,240
Ray Tracing Cores	100 (3rd Gen)	80 (2nd Gen)
Tensor Cores	400 (4th Gen)	320 (3rd Gen)
Single Precision Perf.	65.3 TFLOPS	34.1 TFLOPS
RT Core Performance	151.0 TFLOPS	66.6 TFLOPS
Tensor Core Performance	1044.4 TFLOPS (FP8) 522.2 TFLOPS (FP16)	272.8 TFLOPS (FP16)
Encode Decode	2x NVENC, 2x NVDEC (+AV1 encode & decode)	1x NVENC, 2x NVDEC (+AV1 decode)
System Interconnect	PCIe 4.0 x16	PCIe 4.0 x16
Form Factor	Dual Slot, Full Length	Dual Slot, Full Length
Power Consumption	250W	230W

NVIDIA L40S vs. NVIDIA A100

With increased demand in training the next impactful AI model, many startups and corporations gravitate towards NVIDIA DGX and HGX systems as AI complexity increases by the second. The NVIDIA L40S delivers comparable AI performance versus the last generation flagship NVIDIA A100 Tensor Core GPU. The L40S slots in as a mainstream, low barrier of entry, easy-to-implement GPU accelerator for any user to get started and boost their computing power to handle any workload.

NVIDIA L40S enables 1.2x more generative AI performance and 1.7x more inference performance over the NVIDIA A100. For LLM models, bring 1.2X the performance of an HGX A100 system to the table using an 8-GPU L40S configuration. With performance numbers close to the NVIDIA A100, the L40S’s availability, fast deployment time, and versatility become the deciding factor.

While scalability is a highlight for DGX and HGX systems, the L40S is a competitive option for those on a tighter budget and even tighter timeframe looking to train and deploy AI. And with 48GB of GPU memory, 142 RT Cores, and exceptional single precision performance, the L40S provides versatility as the most powerful universal GPU for the data center, accelerating performance for graphics rendering, video applications, and other various HPC workloads aside from AI. NVIDIA L40S scales best for small and mid-scale operations for training AI, LLM inferencing, and multi-workload environments.

Model/Specs	NVIDIA L40S	NVIDIA A100 Tensor Core GPU
Architecture	Ada Lovelace	Ampere
GPU Memory	48GB GDDR6	80GB HBM2e
Bus Interface	384-bit	5,120-bit
Memory Bandwidth	864 GB/s	2,039 GB/s
L2 Cache	96MB	40MB
CUDA Cores	18,176	6,912
Ray Tracing Cores	142 (3rd Gen)	N/A
Tensor Cores	568 (4th Gen)	432 (3rd Gen)
Single Precision Perf.	91.6 TFLOPS	19.5 TFLOPS
RT Core Performance	212.0 TFLOPS	N/A
Tensor Core Performance	1466 TFLOPS (FP8) 733 TFLOPS (FP16)	624 TFLOPS (FP16)
Encode Decode	3x NVENC, 3x NVDEC (+AV1 E&D)	0x NVENC, 5x NVDEC
System Interconnect	PCIe 4.0 x16	SXM4 over PCIe 4.0 x16
Form Factor	Dual Slot, Full Length	SXM4 8-Way HGX
Power Consumption	350W	400W

RTX 4500 Ada Generation vs. RTX A4500

The NVIDIA RTX A4500 GPUs are Exxact customer favorites to slot into workstations and servers because of their top-tier performance for the cost. The RTX 4500 Ada Generation, set to release later this year, extends this performance prowess to increase mainstream performance applications like rendering, generative AI, engineering simulations, molecular dynamics, and workloads using the NVIDIA Omniverse platform for 3D creation.

With the increase in memory, cores, and accelerators, RTX 4500 showcases a very healthy performance leap: 1.6x increase in graphics performance, 2.7x increased Omniverse rendering workload, 1.5x in classic rendering workloads, and 1.5x in AI training and inferencing over the last generation RTX A4500.

Model/Specs	RTX 4500	RTX A4500
Architecture	Ada Lovelace	Ampere
GPU Memory	24GB GDDR6 ECC	20GB GDDR6 ECC
Bus Interface	192-bit	320-bit
Memory Bandwidth	432 GB/s	640 GB/s
CUDA Cores	7,680	7168
Ray Tracing Cores	60 (3rd Gen)	56 (2nd Gen)
Tensor Cores	240 (4th Gen)	224 (3rd Gen)
Single Precision Perf.	39.6 TFLOPS	23.7 TFLOPS
RT Core Performance	91.6 TFLOPS	46.2 TFLOPS
Tensor Core Performance	634.0 TFLOPS (FP8) 317.0 TFLOPS (FP16)	189.2 TFLOPS (FP16)
Encode Decode	2x NVENC, 2x NVDEC (+AV1 encode & decode)	1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect	PCIe 4.0 x16	PCIe 4.0 x16
Form Factor	Dual Slot, Full Length	Dual Slot, Full Length
Power Consumption	210W	200W

RTX 4000 Ada Generation vs. RTX A4000

The RTX 4000 will be the single-slot GPU of choice for accelerating graphics, rendering, and high-performance compute workloads. Both RTX 4000 and RTX A4000 showcase the same amount of CUDA cores, RT Cores, and Tensor Cores yet they’re based on district architectures while maintaining similar thermal ratings. Let’s see the gen-over-gen improvements.

The RTX 4000 has a 1.5x performance uplift in graphics and 1.7x rendering performance, and 1.2x generative AI performance, with up to double the ray-tracing throughput and up to triple the Tensor Core throughput. RTX 4000 tackles demanding creative workloads, design, and engineering workflows in a compact single-slot design, perfect for multi-GPU workstation and server configurations.

Model/Specs	RTX 4000 Ada	RTX A4000
Architecture	Ada Lovelace	Ampere
GPU Memory	20GB GDDR6 ECC	16GB GDDR6 ECC
Bus Interface	160-bit	256-bit
Memory Bandwidth	360 GB/s	448 GB/s
CUDA Cores	6,144	6,144
Ray Tracing Cores	48 (3rd Gen)	48 (2nd Gen)
Tensor Cores	192 (4th Gen)	192 (3rd Gen)
Single Precision Perf.	26.7 TFLOPS	19.2 TFLOPS
RT Core Performance	61.8 TFLOPS	37.4 TFLOPS
Tensor Core Performance	427.6 TFLOPS (FP8) 213.8 TFLOPS (FP16)	153.4 TFLOPS (FP16)
Encode Decode	2x NVENC, 2x NVDEC (+AV1 encode & decode)	1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect	PCIe 4.0 x16	PCIe 4.0 x16
Form Factor	Single Slot, Full Length	Single Slot, Full Length
Power Consumption	130W	140W

Where to Get the New NVIDIA Ada Generation GPUs

At Exxact Corporation, we deliver high-performance computing platforms and solutions featuring NVIDIA GPUs; you can expect the RTX 5000 and NVIDIA L40S to show up in the Exxact System Configurator starting today! The RTX 4500 and RTX 4000 will follow shortly after their release.

We strive to provide the computing resources to determine the optimal system for your workload so you can solve complex computing challenges, accelerate groundbreaking discoveries, maintain a competitive edge, and fuel innovation. From a fleet of small form factor workstations all the way to full-scale multi-node clusters, build or upgrade your computing infrastructure with the new NVIDIA Ada Generation GPUs.

Have any questions about the new NVIDIA GPUs? Want to stay up to date on when they become available?
Contact us today for more information!

Topics

Have any questions?

News