News

Exxact Features New NVIDIA Ada Generation GPUs - RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S

August 8, 2023
9 min read
EXX-Blog-New-NVIDIA-GPUs-RTX5000-RTX4500-RTX4000ada-L40S.jpg

Newest NVIDIA Ada Generation GPUs Available in Exxact System Configurators Today

Exxact will feature four new NVIDIA Ada Generation GPUs – the NVIDIA RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S – announced by NVIDIA at SIGGRAPH 2023, in our workstations, servers, and clusters. The new NVIDIA GPUs deliver compelling gen-over-gen performance for a competitive edge in real-time rendering environments, 3D graphics acceleration and development, high-performance scientific computing, AI and deep learning capabilities, and more. 

Previous generation NVIDIA RTX A5500, A5000, A4500, and A4000 GPUs are still crowd favorites for high-performance computing and the NVIDIA A100 Tensor Core GPU is in high demand to power the artificial intelligence boom. The new Ada Lovelace GPU lineup delivers the performance professionals need to innovate. 

The NVIDIA RTX 5000 and NVIDIA L40S are available to order now in Exxact configurable systems, while the RTX 4500 and RTX 4000 will follow in the fall. Let's delve in to see what to expect from these new GPUs.

The New RTX 5000 Ada Generation vs. RTX A5500

The NVIDIA RTX 5000 improves upon the last generation RTX A5500 with more GPU memory, CUDA cores, RT Cores, and Tensor Cores. And with the newer generation Ada Lovelace architecture, the RTX 5000 is 1.5x faster in AI training, has 2x more throughput in AI Inference, and is 2x more performant in graphics and rendering over the RTX A5500, potentially outperforming the last generation RTX A6000. 

The gen-over-gen performance increases enable every industry to achieve more with RTX 5000. For architectural engineering, product design, and game development workloads, users will be able to build highly complex 3D models, render large environments faster, power immersive high-fidelity VR, and create more with fewer limitations.

RTX 5000 powers research and data science with faster time to completion of complex simulations, quick prototyping and training of dense AI models, deployment of LLMs and generative AI, and execution of scientific discovery through molecular dynamics and computer-aided drug design.

Model/Specs RTX 5000 RTX A5500
Architecture Ada Lovelace Ampere
GPU Memory 32GB GDDR6 ECC 24GB GDDR6 ECC
Bus Interface 256-bit 384-bit
Memory Bandwidth 576 GB/s 768 GB/s
CUDA Cores 12,800 10,240
Ray Tracing Cores 100 (3rd Gen) 80 (2nd Gen)
Tensor Cores 400 (4th Gen) 320 (3rd Gen)
Single Precision Perf. 65.3 TFLOPS 34.1 TFLOPS
RT Core Performance 151.0 TFLOPS 66.6 TFLOPS
Tensor Core Performance 1044.4 TFLOPS (FP8) 522.2 TFLOPS (FP16) 272.8 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 2x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Dual Slot, Full Length Dual Slot, Full Length
Power Consumption 250W 230W

NVIDIA L40S vs. NVIDIA A100

With increased demand in training the next impactful AI model, many startups and corporations gravitate towards NVIDIA DGX and HGX systems as AI complexity increases by the second. The NVIDIA L40S delivers comparable AI performance versus the last generation flagship NVIDIA A100 Tensor Core GPU. The L40S slots in as a mainstream, low barrier of entry, easy-to-implement GPU accelerator for any user to get started and boost their computing power to handle any workload. 

NVIDIA L40S enables 1.2x more generative AI performance and 1.7x more inference performance over the NVIDIA A100. For LLM models, bring 1.2X the performance of an HGX A100 system to the table using an 8-GPU L40S configuration. With performance numbers close to the NVIDIA A100, the L40S’s availability, fast deployment time, and versatility become the deciding factor.

While scalability is a highlight for DGX and HGX systems, the L40S is a competitive option for those on a tighter budget and even tighter timeframe looking to train and deploy AI. And with 48GB of GPU memory, 142 RT Cores, and exceptional single precision performance, the L40S provides versatility as the most powerful universal GPU for the data center, accelerating performance for graphics rendering, video applications, and other various HPC workloads aside from AI. NVIDIA L40S scales best for small and mid-scale operations for training AI, LLM inferencing, and multi-workload environments.

Model/Specs NVIDIA L40S NVIDIA A100 Tensor Core GPU
Architecture Ada Lovelace Ampere
GPU Memory 48GB GDDR6 80GB HBM2e
Bus Interface 384-bit 5,120-bit
Memory Bandwidth 864 GB/s 2,039 GB/s
L2 Cache 96MB 40MB
CUDA Cores 18,176 6,912
Ray Tracing Cores 142 (3rd Gen) N/A
Tensor Cores 568 (4th Gen) 432 (3rd Gen)
Single Precision Perf. 91.6 TFLOPS 19.5 TFLOPS
RT Core Performance 212.0 TFLOPS N/A
Tensor Core Performance 1466 TFLOPS (FP8) 733 TFLOPS (FP16) 624 TFLOPS (FP16)
Encode Decode 3x NVENC, 3x NVDEC (+AV1 E&D) 0x NVENC, 5x NVDEC
System Interconnect PCIe 4.0 x16 SXM4 over PCIe 4.0 x16
Form Factor Dual Slot, Full Length SXM4 8-Way HGX
Power Consumption 350W 400W

RTX 4500 Ada Generation vs. RTX A4500

The NVIDIA RTX A4500 GPUs are Exxact customer favorites to slot into workstations and servers because of their top-tier performance for the cost. The RTX 4500 Ada Generation, set to release later this year, extends this performance prowess to increase mainstream performance applications like rendering, generative AI, engineering simulations, molecular dynamics, and workloads using the NVIDIA Omniverse platform for 3D creation.

With the increase in memory, cores, and accelerators, RTX 4500 showcases a very healthy performance leap: 1.6x increase in graphics performance, 2.7x increased Omniverse rendering workload, 1.5x in classic rendering workloads, and 1.5x in AI training and inferencing over the last generation RTX A4500.

Model/Specs RTX 4500 RTX A4500
Architecture Ada Lovelace Ampere
GPU Memory 24GB GDDR6 ECC 20GB GDDR6 ECC
Bus Interface 192-bit 320-bit
Memory Bandwidth 432 GB/s 640 GB/s
CUDA Cores 7,680 7168
Ray Tracing Cores 60 (3rd Gen) 56 (2nd Gen)
Tensor Cores 240 (4th Gen) 224 (3rd Gen)
Single Precision Perf. 39.6 TFLOPS 23.7 TFLOPS
RT Core Performance 91.6 TFLOPS 46.2 TFLOPS
Tensor Core Performance 634.0 TFLOPS (FP8) 317.0 TFLOPS (FP16) 189.2 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Dual Slot, Full Length Dual Slot, Full Length
Power Consumption 210W 200W

RTX 4000 Ada Generation vs. RTX A4000

The RTX 4000 will be the single-slot GPU of choice for accelerating graphics, rendering, and high-performance compute workloads. Both RTX 4000 and RTX A4000 showcase the same amount of CUDA cores, RT Cores, and Tensor Cores yet they’re based on district architectures while maintaining similar thermal ratings. Let’s see the gen-over-gen improvements.

The RTX 4000 has a 1.5x performance uplift in graphics and 1.7x rendering performance, and 1.2x generative AI performance, with up to double the ray-tracing throughput and up to triple the Tensor Core throughput. RTX 4000 tackles demanding creative workloads, design, and engineering workflows in a compact single-slot design, perfect for multi-GPU workstation and server configurations.

Model/Specs RTX 4000 Ada RTX A4000
Architecture Ada Lovelace Ampere
GPU Memory 20GB GDDR6 ECC 16GB GDDR6 ECC
Bus Interface 160-bit 256-bit
Memory Bandwidth 360 GB/s 448 GB/s
CUDA Cores 6,144 6,144
Ray Tracing Cores 48 (3rd Gen) 48 (2nd Gen)
Tensor Cores 192 (4th Gen) 192 (3rd Gen)
Single Precision Perf. 26.7 TFLOPS 19.2 TFLOPS
RT Core Performance 61.8 TFLOPS 37.4 TFLOPS
Tensor Core Performance 427.6 TFLOPS (FP8) 213.8 TFLOPS (FP16) 153.4 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Single Slot, Full Length Single Slot, Full Length
Power Consumption 130W 140W

Where to Get the New NVIDIA Ada Generation GPUs

At Exxact Corporation, we deliver high-performance computing platforms and solutions featuring NVIDIA GPUs; you can expect the RTX 5000 and NVIDIA L40S to show up in the Exxact System Configurator starting today! The RTX 4500 and RTX 4000 will follow shortly after their release.

We strive to provide the computing resources to determine the optimal system for your workload so you can solve complex computing challenges, accelerate groundbreaking discoveries, maintain a competitive edge, and fuel innovation. From a fleet of small form factor workstations all the way to full-scale multi-node clusters, build or upgrade your computing infrastructure with the new NVIDIA Ada Generation GPUs.


Have any questions about the new NVIDIA GPUs? Want to stay up to date on when they become available?
Contact us today for more information!


EXX-Blog-New-NVIDIA-GPUs-RTX5000-RTX4500-RTX4000ada-L40S.jpg
News

Exxact Features New NVIDIA Ada Generation GPUs - RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S

August 8, 20239 min read

Newest NVIDIA Ada Generation GPUs Available in Exxact System Configurators Today

Exxact will feature four new NVIDIA Ada Generation GPUs – the NVIDIA RTX 5000, RTX 4500, RTX 4000, and NVIDIA L40S – announced by NVIDIA at SIGGRAPH 2023, in our workstations, servers, and clusters. The new NVIDIA GPUs deliver compelling gen-over-gen performance for a competitive edge in real-time rendering environments, 3D graphics acceleration and development, high-performance scientific computing, AI and deep learning capabilities, and more. 

Previous generation NVIDIA RTX A5500, A5000, A4500, and A4000 GPUs are still crowd favorites for high-performance computing and the NVIDIA A100 Tensor Core GPU is in high demand to power the artificial intelligence boom. The new Ada Lovelace GPU lineup delivers the performance professionals need to innovate. 

The NVIDIA RTX 5000 and NVIDIA L40S are available to order now in Exxact configurable systems, while the RTX 4500 and RTX 4000 will follow in the fall. Let's delve in to see what to expect from these new GPUs.

The New RTX 5000 Ada Generation vs. RTX A5500

The NVIDIA RTX 5000 improves upon the last generation RTX A5500 with more GPU memory, CUDA cores, RT Cores, and Tensor Cores. And with the newer generation Ada Lovelace architecture, the RTX 5000 is 1.5x faster in AI training, has 2x more throughput in AI Inference, and is 2x more performant in graphics and rendering over the RTX A5500, potentially outperforming the last generation RTX A6000. 

The gen-over-gen performance increases enable every industry to achieve more with RTX 5000. For architectural engineering, product design, and game development workloads, users will be able to build highly complex 3D models, render large environments faster, power immersive high-fidelity VR, and create more with fewer limitations.

RTX 5000 powers research and data science with faster time to completion of complex simulations, quick prototyping and training of dense AI models, deployment of LLMs and generative AI, and execution of scientific discovery through molecular dynamics and computer-aided drug design.

Model/Specs RTX 5000 RTX A5500
Architecture Ada Lovelace Ampere
GPU Memory 32GB GDDR6 ECC 24GB GDDR6 ECC
Bus Interface 256-bit 384-bit
Memory Bandwidth 576 GB/s 768 GB/s
CUDA Cores 12,800 10,240
Ray Tracing Cores 100 (3rd Gen) 80 (2nd Gen)
Tensor Cores 400 (4th Gen) 320 (3rd Gen)
Single Precision Perf. 65.3 TFLOPS 34.1 TFLOPS
RT Core Performance 151.0 TFLOPS 66.6 TFLOPS
Tensor Core Performance 1044.4 TFLOPS (FP8) 522.2 TFLOPS (FP16) 272.8 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 2x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Dual Slot, Full Length Dual Slot, Full Length
Power Consumption 250W 230W

NVIDIA L40S vs. NVIDIA A100

With increased demand in training the next impactful AI model, many startups and corporations gravitate towards NVIDIA DGX and HGX systems as AI complexity increases by the second. The NVIDIA L40S delivers comparable AI performance versus the last generation flagship NVIDIA A100 Tensor Core GPU. The L40S slots in as a mainstream, low barrier of entry, easy-to-implement GPU accelerator for any user to get started and boost their computing power to handle any workload. 

NVIDIA L40S enables 1.2x more generative AI performance and 1.7x more inference performance over the NVIDIA A100. For LLM models, bring 1.2X the performance of an HGX A100 system to the table using an 8-GPU L40S configuration. With performance numbers close to the NVIDIA A100, the L40S’s availability, fast deployment time, and versatility become the deciding factor.

While scalability is a highlight for DGX and HGX systems, the L40S is a competitive option for those on a tighter budget and even tighter timeframe looking to train and deploy AI. And with 48GB of GPU memory, 142 RT Cores, and exceptional single precision performance, the L40S provides versatility as the most powerful universal GPU for the data center, accelerating performance for graphics rendering, video applications, and other various HPC workloads aside from AI. NVIDIA L40S scales best for small and mid-scale operations for training AI, LLM inferencing, and multi-workload environments.

Model/Specs NVIDIA L40S NVIDIA A100 Tensor Core GPU
Architecture Ada Lovelace Ampere
GPU Memory 48GB GDDR6 80GB HBM2e
Bus Interface 384-bit 5,120-bit
Memory Bandwidth 864 GB/s 2,039 GB/s
L2 Cache 96MB 40MB
CUDA Cores 18,176 6,912
Ray Tracing Cores 142 (3rd Gen) N/A
Tensor Cores 568 (4th Gen) 432 (3rd Gen)
Single Precision Perf. 91.6 TFLOPS 19.5 TFLOPS
RT Core Performance 212.0 TFLOPS N/A
Tensor Core Performance 1466 TFLOPS (FP8) 733 TFLOPS (FP16) 624 TFLOPS (FP16)
Encode Decode 3x NVENC, 3x NVDEC (+AV1 E&D) 0x NVENC, 5x NVDEC
System Interconnect PCIe 4.0 x16 SXM4 over PCIe 4.0 x16
Form Factor Dual Slot, Full Length SXM4 8-Way HGX
Power Consumption 350W 400W

RTX 4500 Ada Generation vs. RTX A4500

The NVIDIA RTX A4500 GPUs are Exxact customer favorites to slot into workstations and servers because of their top-tier performance for the cost. The RTX 4500 Ada Generation, set to release later this year, extends this performance prowess to increase mainstream performance applications like rendering, generative AI, engineering simulations, molecular dynamics, and workloads using the NVIDIA Omniverse platform for 3D creation.

With the increase in memory, cores, and accelerators, RTX 4500 showcases a very healthy performance leap: 1.6x increase in graphics performance, 2.7x increased Omniverse rendering workload, 1.5x in classic rendering workloads, and 1.5x in AI training and inferencing over the last generation RTX A4500.

Model/Specs RTX 4500 RTX A4500
Architecture Ada Lovelace Ampere
GPU Memory 24GB GDDR6 ECC 20GB GDDR6 ECC
Bus Interface 192-bit 320-bit
Memory Bandwidth 432 GB/s 640 GB/s
CUDA Cores 7,680 7168
Ray Tracing Cores 60 (3rd Gen) 56 (2nd Gen)
Tensor Cores 240 (4th Gen) 224 (3rd Gen)
Single Precision Perf. 39.6 TFLOPS 23.7 TFLOPS
RT Core Performance 91.6 TFLOPS 46.2 TFLOPS
Tensor Core Performance 634.0 TFLOPS (FP8) 317.0 TFLOPS (FP16) 189.2 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Dual Slot, Full Length Dual Slot, Full Length
Power Consumption 210W 200W

RTX 4000 Ada Generation vs. RTX A4000

The RTX 4000 will be the single-slot GPU of choice for accelerating graphics, rendering, and high-performance compute workloads. Both RTX 4000 and RTX A4000 showcase the same amount of CUDA cores, RT Cores, and Tensor Cores yet they’re based on district architectures while maintaining similar thermal ratings. Let’s see the gen-over-gen improvements.

The RTX 4000 has a 1.5x performance uplift in graphics and 1.7x rendering performance, and 1.2x generative AI performance, with up to double the ray-tracing throughput and up to triple the Tensor Core throughput. RTX 4000 tackles demanding creative workloads, design, and engineering workflows in a compact single-slot design, perfect for multi-GPU workstation and server configurations.

Model/Specs RTX 4000 Ada RTX A4000
Architecture Ada Lovelace Ampere
GPU Memory 20GB GDDR6 ECC 16GB GDDR6 ECC
Bus Interface 160-bit 256-bit
Memory Bandwidth 360 GB/s 448 GB/s
CUDA Cores 6,144 6,144
Ray Tracing Cores 48 (3rd Gen) 48 (2nd Gen)
Tensor Cores 192 (4th Gen) 192 (3rd Gen)
Single Precision Perf. 26.7 TFLOPS 19.2 TFLOPS
RT Core Performance 61.8 TFLOPS 37.4 TFLOPS
Tensor Core Performance 427.6 TFLOPS (FP8) 213.8 TFLOPS (FP16) 153.4 TFLOPS (FP16)
Encode Decode 2x NVENC, 2x NVDEC (+AV1 encode & decode) 1x NVENC, 1x NVDEC (+AV1 decode)
System Interconnect PCIe 4.0 x16 PCIe 4.0 x16
Form Factor Single Slot, Full Length Single Slot, Full Length
Power Consumption 130W 140W

Where to Get the New NVIDIA Ada Generation GPUs

At Exxact Corporation, we deliver high-performance computing platforms and solutions featuring NVIDIA GPUs; you can expect the RTX 5000 and NVIDIA L40S to show up in the Exxact System Configurator starting today! The RTX 4500 and RTX 4000 will follow shortly after their release.

We strive to provide the computing resources to determine the optimal system for your workload so you can solve complex computing challenges, accelerate groundbreaking discoveries, maintain a competitive edge, and fuel innovation. From a fleet of small form factor workstations all the way to full-scale multi-node clusters, build or upgrade your computing infrastructure with the new NVIDIA Ada Generation GPUs.


Have any questions about the new NVIDIA GPUs? Want to stay up to date on when they become available?
Contact us today for more information!