GPU architecture NVIDIA Ampere architecture
GPU memory
48 GB GDDR6 with ECC
Memory bandwidth
696 GB/s
Interconnectinterface
NVIDIA® NVLink® 112.5 GB/s (bidirectional)3 PCIe Gen4: 64GB/s
NVIDIA Ampere architecturebased CUDA Cores 10,752
NVIDIA second-generation RT Cores 84
NVIDIA third-generation Tensor Cores 336
Peak FP32 TFLOPS (non-Tensor) 37.4
Peak FP16 Tensor TFLOPS with FP16 Accumulate
149.7 | 299.4*
Peak TF32 Tensor TFLOPS 74.8 | 149.6*
RT Core performance TFLOPS 73.1
Peak BF16 Tensor TFLOPSwith FP32 Accumulate149.7 | 299.4*
Peak INT8 Tensor TOPS Peak INT 4 Tensor TOPS 299.3 | 598.6* 598.7 | 1,197.4*
Form factor 4.4"(H) x 10.5" (L) dual slot
Display ports 3x DisplayPort 1.4**; Supports NVIDIA Mosaic and Quadro® Sync4
Max power consumption 300 W
Power connector 8-pin CPU
Thermal solution Passive
Virtual GPU (vGPU) software support
NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server
vGPU profiles supported See the Virtual GPU Licensing Guide
NVENC | NVDEC 1x | 2x(includes AV1 decode)