Engineering MPD

Ansys HPC Pack - Ansys Licensing for CPUs and GPUs Explained

May 2, 2024
12 min read
EXX-Blog-Ansys-HPC-pack-liscensing-for-cpu-gpu.jpg

Ansys has a quite complex licensing format where enabling more compute nodes or higher performance hardware will require more licensing. Ansys takes into account the number of cores that are enabled for a specific workload called HPC Packs. Price per license fluctuates so we will not be including price here.

What are Ansys HPC Packs?

HPC Packs are license(s) that define the total supported cores enabled for your Ansys calculations, most notably, Fluent, Mechanical, HFSS, and more. The quantity of HPC packs purchased determines the designated number of CPU cores you can use in a single calculation.

The licensing model typically involves purchasing a certain number of HPC Packs for a single system, which then allows a determined amount of compute to be used whether that is CPU cores or GPUs. The more HPC Packs purchased, the more cores enabled for faster performance. Determining the number of packs required depends on factors such as the size and complexity of the simulations being performed, as well as the desired speed and efficiency of the analysis.

The number of cores enabled by an HPC pack is calculated with this simple formula where h = number of HPC Pack(s). These cores often refer to physical CPU cores or GPU Streaming Multiprocessors (SMs). More on SMs later. There is also a default of 4 solver cores offered regardless of HPC packs purchased. The formula for total cores offered in a specific HPC pack is as follows, where h is the HPC packs purchased:

EXX-Blog-Ansys-HPC-pack-liscensing-for-cpu-gpu-total-cores-formula.png

The table, as follows, will only go up to 5 HPC Packs but you can calculate the next tier for more allocation if necessary, using the formula.

CPU Core Count & Ansys HPC Pack Matrix

HPC Packs HPC Pack Cores Solver Cores Total Cores
0 0 4 4
1 8 4 12
2 32 4 36
3 128 4 132
4 512 4 516
5 2048 4 2052

Ansys HPC Packs are an exponential function gain so the increase from 20 to 2000 cores wouldn’t incur a 100x price cost. These total cores mean that even if your system houses more cores than the given HPC pack amount, those cores will be inactive for the calculation. For example, an individual with 2 HPC packs can only use 36 cores, so if your system has 48 total cores it would only use 36 cores with the rest left idle.

Organizations can choose to split up or consolidate their HPC pack licenses. However, the calculation of cores per system differs. For example, an organization that wants 4 HPC packs distributed between 2 separate and individual systems (2 Packs + 2 Packs) will have 36 + 36 = 72 total cores. If they did not distribute the packs and consolidated them onto a single system 4 HPC packs would yield 516 cores.

  • Sequential Jobs on Single Machine - Individualized workstations can benefit organizations with multiple departments running simulations and don’t need/want to manage compute resources. This computing format would isolate data to the individual machines and require more licenses.
  • Sequential Jobs on Multi-Node Machine - A consolidated system can benefit organizations that run extremely large simulations that require massive computing. Jobs and workflows that don't mind being queued for simulation calculation can utilize the most cores per license.
  • Simultaneous Jobs on Multi-Node Machine - Users still can distribute their simulations across a large compute node running multiple simulations on the same machine in parallel but require additional solver licenses or Ansys HPC Workgroup.

Both methods accelerate the simulation process, enabling you to tackle engineering problems efficiently, leading to increased productivity, faster time-to-market for products, and better products through iterative design.

Ansys HPC Packs for GPUs

Ansys HPC Packs don’t just define licensing costs for CPUs; they also define the cost of running Ansys on GPUs. Many Ansys simulation applications are still CPU-reliant, but GPUs are paramount in modern high-performance computing. Some Ansys simulation solvers, including Fluent, HFSS SBR+, and Mechanical ADPL, are accelerated or native to GPUs (with certain solvers requiring FP64-capable GPUs).

GPU microarchitecture differs from CPUs housing hundreds of times more cores than CPUs. So instead of using core counts as the variable, they use SMs or Streaming Multiprocessors. Not to go into full detail about GPU microarchitecture, but Streaming Multiprocessors or SMs are the building blocks of a GPU that store multiple cores, cache, and controllers. Think of SMs as a group of workers tasked with computations, memory management, instruction pipelines and whatnot and a GPU has a factory of these groups. These SMs enable parallelized computing thus, are what Ansys counts as “cores” in this case.

SMs HPC workgroup HPC Packs
1-40 0 0
41-48 1-8 1
49-72 9-32 2
73-168 33-128 3
169-552 129-512 4
553-2088 513-2048 5

Finding a GPU's SM count is not as easy as looking at the spec sheet. We listed current-gen and popular GPU models and their SM counts found by scouring GPU microarchitecture whitepapers and third-party sources like techpowerup.com. We will also compare how many HPC packs would equate to each GPU. We also added the GPU memory to help gauge which GPUs suit your potential model size (this does not affect the HPC pack licensing).

GPU SM Count HPC Packs Needed GPU Memory
RTX 2000 Ada 22 0 16GB GDDR6
RTX 4000 Ada 48 1 20GB GDDR6
RTX 4500 Ada 60 2 24GB GDDR6
RTX 5000 Ada 100 3 32GB GDDR6
RTX 6000 Ada 142 3 48GB GDDR6
NVIDIA A800 40GB Active 108 3 40GB HBM2e
NVIDIA A100 108 3 80GB HBM2e
NVIDIA H100 114 3 80GB HBM2e

Unlike how HPC packs work for CPUs if the GPU configuration exceeds the SM count threshold for HPC packs, additional purchase of the next tier up is required since SMs cannot be disabled by will. Therefore, depending on configuration, careful consideration of single or multiple GPUs is necessary for optimizing each HPC pack license. Some GPUs should be prioritized over others on this list since they maximize the number of available SMs per HPC Pack.

For example, a single RTX 5000 Ada (100 SMs) requires 3 HPC packs (up to 168 SMs). The faster and larger RTX 6000 Ada (142 SMs) would also cost 3 HPC packs. By paying the price of the higher tier card, you can maximize the price you’re paying for Ansys HPC pack licensing while getting better performance. But for multi-GPU configurations, the story is a little different:

GPU SM Count HPC Packs GPU Memory
1x RTX 6000 Ada 142 3 48GB GDDR6
2x RTX 6000 Ada 284 4 96GB GDDR6
3x RTX 6000 Ada 426 4 144GB GDDR6
4x RTX 6000 Ada 568 5 192GB GDDR6
1x RTX 5000 Ada 100 3 32GB GDDR6
2x RTX 5000 Ada 200 4 64GB GDDR6
3x RTX 5000 Ada 300 4 96GB GDDR6
4x RTX 5000 Ada 400 4 128GB GDDR6
1x NVIDIA A800 108 3 40GB HBM2e
2x NVIDIA A800 216 4 80GB HBM2e
3x NVIDIA A800 324 4 120GB HBM2e
4x NVIDIA A800 432 4 160GB HBM2e

In a 4x GPU configuration, the RTX 6000 Ada may not be a smart investment due to the extra license needed to be purchased for you to be able to run it. Be strategic with licensing by maximizing per core or per SM limit without spilling over to the next bracket. The RTX 5000 Ada may be a bit slower, but if the simulation size calls for 4x GPUs, purchasing an extra GPU without the need for an extra license can influence budget and performance considerations.

Also, for 4x GPU configurations, the NVIDIA A800 only requires 4x HPC packs and has native double-precision FP64. The A800 is a more expensive GPU compared to the RTX 6000 Ada, but the A800’s FP64 capability, HBM2e GPU memory, and an unneeded 5th HPC pack can influence decisions depending on the type of simulations and solvers. Of course, there are other considerations like GPU performance between RTX 6000 Ada versus the NVIDIA A800, benchmarks coming soon…

Conclusion

Accelerating computations in Ansys is paramount for engineers in perfecting the product design before putting it on the assembly line and eventually in the hands of the consumer. By running simulations faster, refinements can happen quicker, and innovative products can go to market sooner.

Configuring a high-performance Ansys system has many factors including hardware, licensing costs, performance, and infrastructure. Optimizing price to performance is essential for maximizing value in a large investment like this. Exxact is here to deliver the computing systems and tools for enabling reduced simulation time and success. If you have any questions regarding hardware configurations, contact our team and we will do our best to help guide you in the right direction.

Accelerate Simulations in Ansys with GPUs

With the latest CPUs and most powerful GPUs available, accelerate your Ansys simulation and CFD project optimized to your deployment, budget, and desired performance!

Configure Now

Topics

EXX-Blog-Ansys-HPC-pack-liscensing-for-cpu-gpu.jpg
Engineering MPD

Ansys HPC Pack - Ansys Licensing for CPUs and GPUs Explained

May 2, 202412 min read

Ansys has a quite complex licensing format where enabling more compute nodes or higher performance hardware will require more licensing. Ansys takes into account the number of cores that are enabled for a specific workload called HPC Packs. Price per license fluctuates so we will not be including price here.

What are Ansys HPC Packs?

HPC Packs are license(s) that define the total supported cores enabled for your Ansys calculations, most notably, Fluent, Mechanical, HFSS, and more. The quantity of HPC packs purchased determines the designated number of CPU cores you can use in a single calculation.

The licensing model typically involves purchasing a certain number of HPC Packs for a single system, which then allows a determined amount of compute to be used whether that is CPU cores or GPUs. The more HPC Packs purchased, the more cores enabled for faster performance. Determining the number of packs required depends on factors such as the size and complexity of the simulations being performed, as well as the desired speed and efficiency of the analysis.

The number of cores enabled by an HPC pack is calculated with this simple formula where h = number of HPC Pack(s). These cores often refer to physical CPU cores or GPU Streaming Multiprocessors (SMs). More on SMs later. There is also a default of 4 solver cores offered regardless of HPC packs purchased. The formula for total cores offered in a specific HPC pack is as follows, where h is the HPC packs purchased:

EXX-Blog-Ansys-HPC-pack-liscensing-for-cpu-gpu-total-cores-formula.png

The table, as follows, will only go up to 5 HPC Packs but you can calculate the next tier for more allocation if necessary, using the formula.

CPU Core Count & Ansys HPC Pack Matrix

HPC Packs HPC Pack Cores Solver Cores Total Cores
0 0 4 4
1 8 4 12
2 32 4 36
3 128 4 132
4 512 4 516
5 2048 4 2052

Ansys HPC Packs are an exponential function gain so the increase from 20 to 2000 cores wouldn’t incur a 100x price cost. These total cores mean that even if your system houses more cores than the given HPC pack amount, those cores will be inactive for the calculation. For example, an individual with 2 HPC packs can only use 36 cores, so if your system has 48 total cores it would only use 36 cores with the rest left idle.

Organizations can choose to split up or consolidate their HPC pack licenses. However, the calculation of cores per system differs. For example, an organization that wants 4 HPC packs distributed between 2 separate and individual systems (2 Packs + 2 Packs) will have 36 + 36 = 72 total cores. If they did not distribute the packs and consolidated them onto a single system 4 HPC packs would yield 516 cores.

  • Sequential Jobs on Single Machine - Individualized workstations can benefit organizations with multiple departments running simulations and don’t need/want to manage compute resources. This computing format would isolate data to the individual machines and require more licenses.
  • Sequential Jobs on Multi-Node Machine - A consolidated system can benefit organizations that run extremely large simulations that require massive computing. Jobs and workflows that don't mind being queued for simulation calculation can utilize the most cores per license.
  • Simultaneous Jobs on Multi-Node Machine - Users still can distribute their simulations across a large compute node running multiple simulations on the same machine in parallel but require additional solver licenses or Ansys HPC Workgroup.

Both methods accelerate the simulation process, enabling you to tackle engineering problems efficiently, leading to increased productivity, faster time-to-market for products, and better products through iterative design.

Ansys HPC Packs for GPUs

Ansys HPC Packs don’t just define licensing costs for CPUs; they also define the cost of running Ansys on GPUs. Many Ansys simulation applications are still CPU-reliant, but GPUs are paramount in modern high-performance computing. Some Ansys simulation solvers, including Fluent, HFSS SBR+, and Mechanical ADPL, are accelerated or native to GPUs (with certain solvers requiring FP64-capable GPUs).

GPU microarchitecture differs from CPUs housing hundreds of times more cores than CPUs. So instead of using core counts as the variable, they use SMs or Streaming Multiprocessors. Not to go into full detail about GPU microarchitecture, but Streaming Multiprocessors or SMs are the building blocks of a GPU that store multiple cores, cache, and controllers. Think of SMs as a group of workers tasked with computations, memory management, instruction pipelines and whatnot and a GPU has a factory of these groups. These SMs enable parallelized computing thus, are what Ansys counts as “cores” in this case.

SMs HPC workgroup HPC Packs
1-40 0 0
41-48 1-8 1
49-72 9-32 2
73-168 33-128 3
169-552 129-512 4
553-2088 513-2048 5

Finding a GPU's SM count is not as easy as looking at the spec sheet. We listed current-gen and popular GPU models and their SM counts found by scouring GPU microarchitecture whitepapers and third-party sources like techpowerup.com. We will also compare how many HPC packs would equate to each GPU. We also added the GPU memory to help gauge which GPUs suit your potential model size (this does not affect the HPC pack licensing).

GPU SM Count HPC Packs Needed GPU Memory
RTX 2000 Ada 22 0 16GB GDDR6
RTX 4000 Ada 48 1 20GB GDDR6
RTX 4500 Ada 60 2 24GB GDDR6
RTX 5000 Ada 100 3 32GB GDDR6
RTX 6000 Ada 142 3 48GB GDDR6
NVIDIA A800 40GB Active 108 3 40GB HBM2e
NVIDIA A100 108 3 80GB HBM2e
NVIDIA H100 114 3 80GB HBM2e

Unlike how HPC packs work for CPUs if the GPU configuration exceeds the SM count threshold for HPC packs, additional purchase of the next tier up is required since SMs cannot be disabled by will. Therefore, depending on configuration, careful consideration of single or multiple GPUs is necessary for optimizing each HPC pack license. Some GPUs should be prioritized over others on this list since they maximize the number of available SMs per HPC Pack.

For example, a single RTX 5000 Ada (100 SMs) requires 3 HPC packs (up to 168 SMs). The faster and larger RTX 6000 Ada (142 SMs) would also cost 3 HPC packs. By paying the price of the higher tier card, you can maximize the price you’re paying for Ansys HPC pack licensing while getting better performance. But for multi-GPU configurations, the story is a little different:

GPU SM Count HPC Packs GPU Memory
1x RTX 6000 Ada 142 3 48GB GDDR6
2x RTX 6000 Ada 284 4 96GB GDDR6
3x RTX 6000 Ada 426 4 144GB GDDR6
4x RTX 6000 Ada 568 5 192GB GDDR6
1x RTX 5000 Ada 100 3 32GB GDDR6
2x RTX 5000 Ada 200 4 64GB GDDR6
3x RTX 5000 Ada 300 4 96GB GDDR6
4x RTX 5000 Ada 400 4 128GB GDDR6
1x NVIDIA A800 108 3 40GB HBM2e
2x NVIDIA A800 216 4 80GB HBM2e
3x NVIDIA A800 324 4 120GB HBM2e
4x NVIDIA A800 432 4 160GB HBM2e

In a 4x GPU configuration, the RTX 6000 Ada may not be a smart investment due to the extra license needed to be purchased for you to be able to run it. Be strategic with licensing by maximizing per core or per SM limit without spilling over to the next bracket. The RTX 5000 Ada may be a bit slower, but if the simulation size calls for 4x GPUs, purchasing an extra GPU without the need for an extra license can influence budget and performance considerations.

Also, for 4x GPU configurations, the NVIDIA A800 only requires 4x HPC packs and has native double-precision FP64. The A800 is a more expensive GPU compared to the RTX 6000 Ada, but the A800’s FP64 capability, HBM2e GPU memory, and an unneeded 5th HPC pack can influence decisions depending on the type of simulations and solvers. Of course, there are other considerations like GPU performance between RTX 6000 Ada versus the NVIDIA A800, benchmarks coming soon…

Conclusion

Accelerating computations in Ansys is paramount for engineers in perfecting the product design before putting it on the assembly line and eventually in the hands of the consumer. By running simulations faster, refinements can happen quicker, and innovative products can go to market sooner.

Configuring a high-performance Ansys system has many factors including hardware, licensing costs, performance, and infrastructure. Optimizing price to performance is essential for maximizing value in a large investment like this. Exxact is here to deliver the computing systems and tools for enabling reduced simulation time and success. If you have any questions regarding hardware configurations, contact our team and we will do our best to help guide you in the right direction.

Accelerate Simulations in Ansys with GPUs

With the latest CPUs and most powerful GPUs available, accelerate your Ansys simulation and CFD project optimized to your deployment, budget, and desired performance!

Configure Now

Topics