• NVIDIA A2 PCIe 16GB GPU

NVIDIA A2 PCIe 16GB GPU

  • Released June 28th, 2021
  • GA107 Graphic Processor
  • 1280 Cores
  • 40 TMUS
  • 32 ROPS
  • 16GB Memory Size
  • GDDR6 Memory Type
  • 128 bit BUS Width

  • $USD $2,402.00

    *RRP Pricing*

    To View Channel Discounts Please Login


NVIDIA A2 PCIe 16GB GPU

NVIDIA A2 TENSOR CORE GPU Entry-level GPU that brings NVIDIA AI to any server.Start configuring your GP-GPU Server now!S5N | D43N-3UHSGP1GZ2 | G492-ZD2Versatile Entry-Level InferenceThe NVIDIA A2 Tensor Core GPU provide

NVIDIA A2 TENSOR CORE GPU

Entry-level GPU that brings NVIDIA AI to any server.


Start configuring your GP-GPU Server now!


S5N | D43N-3U


HSGP1


GZ2 | G492-ZD2

Versatile Entry-Level Inference

The NVIDIA A2 Tensor Core GPU provides entry-level inference with low power, a small footprint, and high performance for NVIDIA AI at the edge. Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.

Up to 20X More Inference Performance

AI inference is deployed to enhance consumer lives with smart, real-time experiences and to gain insights from trillions of end-point sensors and cameras. Compared to CPU-only servers, edge and entry-level servers with NVIDIA A2 Tensor Core GPUs offer up to 20X more inference performance, instantly upgrading any server to handle modern AI.

Higher IVA Performance for the Intelligent Edge

Servers equipped with NVIDIA A2 GPUs offer up to 1.3X more performance in intelligent edge use cases, including smart cities, manufacturing, and retail. NVIDIA A2 GPUs running IVA workloads deliver more efficient deployments with up to 1.6X better price-performance and 10 percent better energy efficiency than previous GPU generations.

Optimized for Any Server

NVIDIA A2 is optimized for inference workloads and deployments in entry-level servers constrained by space and thermal requirements, such as 5G edge and industrial environments. A2 delivers a low-profile form factor operating in a low-power envelope, from a TDP of 60W down to 40W, making it ideal for any server.  

Leading AI Inference Performance Across Cloud, Data Center, and Edge

AI inference continues to drive breakthrough innovation across industries, including consumer internet, healthcare and life sciences, financial services, retail, manufacturing, and supercomputing. A2’s small form factor and low power combined with the NVIDIA A100 and A30 Tensor Core GPUs deliver a complete AI inference portfolio across cloud, data center, and edge. A2 and the NVIDIA AI inference portfolio ensure AI applications deploy with fewer servers and less power, resulting in faster insights with substantially lower costs.


The A2 is a professional graphics card by NVIDIA, launched on November 10th, 2021. Built on the 8 nm process, and based on the GA107 graphics processor, the card supports DirectX 12 Ultimate. Unlike the fully unlocked GeForce RTX 3050 8 GB GA107, which uses the same GPU but has all 2560 shaders enabled, NVIDIA has disabled some shading units on the A2 to reach the product's target shader count. It features 1280 shading units, 40 texture mapping units, and 32 ROPs. Also included are 40 tensor cores which help improve the speed of machine learning applications. The card also has 10 raytracing acceleration cores. NVIDIA has paired 16 GB GDDR6 memory with the A2, which are connected using a 128-bit memory interface. The GPU is operating at a frequency of 1440 MHz, which can be boosted up to 1770 MHz, memory is running at 1563 MHz (12.5 Gbps effective).

Being a single-slot card, the NVIDIA A2 does not require any additional power connector, its power draw is rated at 60 W maximum. This device has no display connectivity, as it is not designed to have monitors connected to it. A2 is connected to the rest of the system using a PCI-Express 4.0 x8 interface.

Start configuring your GP-GPU Server now!



S5N | D43N-3U


HSGP1


GZ2 | G492-ZD2

Peak FP324.5 TF
TF32 Tensor Core9 TF | 18 TF¹
BFLOAT16 Tensor Core18 TF | 36 TF¹
Peak FP16 Tensor Core18 TF | 36 TF¹
Peak INT8 Tensor Core36 TOPS | 72 TOPS¹
Peak INT4 Tensor Core72 TOPS | 144 TOPS¹
RT Cores10
Media engines1 video encoder
2 video decoders (includes AV1 decode)
GPU memory16GB GDDR6
GPU memory bandwidth200GB/s
InterconnectPCIe Gen4 x8
Form factor1-slot, low-profile PCIe
Max thermal design power (TDP)40–60W (configurable)
Virtual GPU (vGPU) software support²NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS)
Datasheet 1
Title Version Date Size
NVIDIA A2 DS 1 628KB

Tags: NVIDIA, GP, GPU, A12, 16GB, PCIe, Ampere, HPC, AI, Deep Learning, TESLA