跳到主要内容

GPU 选型科普

消费级 GPU

型号显存半精度(TFLOPS)单精度(TFLOPS)CUDA核心数量Tensor核心数量架构显存类型
RTX 509032GB209.6104.821760680 (3352 AI TOPS)Blackwell 2.0GDDR7
RTX 5090D32GB209.6104.821760680 (2375 AI TOPS)Blackwell 2.0GDDR7
RTX 508016GB112.5656.2810752336 (1801 AI TOPS)Blackwell 2.0GDDR7
RTX 5070 Ti16GB88.744.358960280 (1406 AI TOPS)Blackwell 2.0GDDR7
RTX 507012GB61.6830.846144192 (988 AI TOPS)Blackwell 2.0GDDR7
RTX 409024GB165.1682.5816384512 (1321 AI TOPS)Ada LovelaceGDDR6X
RTX 4090D24GB147.0873.5414592456 (1177 AI TOPS)Ada LovelaceGDDR6X
RTX 408016GB97.4848.749728304 (780 AI TOPS)Ada LovelaceGDDR6X
RTX 4070 Ti12GB80.1840.097680240 (641 AI TOPS)Ada LovelaceGDDR6X
RTX 407012GB58.3029.155888184 (466 AI TOPS)Ada LovelaceGDDR6X
RTX 4060 Ti16GB44.1222.064352136 (353 AI TOPS)Ada LovelaceGDDR6
RTX 4060 Ti8GB44.1222.064352136 (353 AI TOPS)Ada LovelaceGDDR6
RTX 40608GB30.2215.11307296 (242 AI TOPS)Ada LovelaceGDDR6
RTX 3090Ti24GB80.0040.0010752336 (320 AI TOPS)AmpereGDDR6X
RTX 309024GB71.1635.5810496328 (285 AI TOPS)AmpereGDDR6X
RTX 3080Ti12GB68.2034.1010240320AmpereGDDR6X
RTX 308012GB61.2830.648960280AmpereGDDR6X
RTX 308010GB59.5429.778704272AmpereGDDR6X
RTX 3070 Ti8GB43.5021.756144192AmpereGDDR6X
RTX 30708GB40.6220.315888184AmpereGDDR6
RTX 3060 Ti8GB33.4016.204864152AmpereGDDR6X
RTX 3060 Ti8GB33.4016.204864152AmpereGDDR6
RTX 306012GB25.4812.743584112AmpereGDDR6
RTX 30608GB25.4812.743584112AmpereGDDR6
RTX 2080 Ti11GB26.9013.454352544TuringGDDR6
GTX 1080 Ti11GB22.6811.343584PascalGDDR5X

专业级 GPU

型号显存半精度(TFLOPS)单精度(TFLOPS)双精度(TFLOPS)CUDA核心数量Tensor核心数量架构显存类型
NVIDIA RTX A600048GB77.4238.711.20910752336AmpereGDDR6
NVIDIA RTX A500024GB55.5427.770.86710752256AmpereGDDR6
NVIDIA RTX A400016GB38.3419.170.5996144192AmpereGDDR6
Quadro RTX 800048GB32.6216.310.5094608576TuringGDDR6
Quadro RTX 600024GB32.6216.310.5094608576TuringGDDR6
Quadro RTX 500016GB22.3011.150.3483072384TuringGDDR6

云与数据中心级 GPU

Tesla NVIDIA A系列 GPU

型号显存半精度(TFLOPS)单精度(TFLOPS)双精度(TFLOPS)CUDA核心数量Tensor核心数量架构显存类型
NVIDIA A100 SXM480GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A100 SXM440GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A100 PCIe80GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A100 PCIe40GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A800 PCIe80GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A800 SXM480GB38.9819.499.7466912432AmpereHBM2e
NVIDIA A40 PCIe48GB74.8437.421.16910752336AmpereGDDR6
NVIDIA A30 PCIe24GB20.6410.320.3223584224AmpereHBM2e
NVIDIA A10 PCIe24GB62.4831.240.9769216288AmpereGDDR6

Tesla NVIDIA V系列 GPU

型号显存半精度(TFLOPS)单精度(TFLOPS)双精度(TFLOPS)CUDA核心数量Tensor核心数量架构显存类型
Tesla V100 PCIe16GB28.2614.137.0665120640VoltaHBM2
Tesla V100 PCIe32GB28.2614.137.0665120640VoltaHBM2
Tesla V100 SXM216GB32.7116.358.1775120640VoltaHBM2
Tesla V100 SXM232GB31.3315.677.8345120640VoltaHBM2
Tesla V100 SXM332GB32.7116.358.1775120640VoltaHBM2
Tesla V100S PCIE32GB32.7116.358.1775120640VoltaHBM2

Tesla NVIDIA T系列 GPU

型号显存半精度单精度(TFLOPS)双精度CUDA核心数量Tensor核心数量架构显存类型
NVIDIA Tesla T416GB65.13TFLOPS (8:1)8.141254.4GFLOPS (1:32)2560320TuringGDDR6

Tesla NVIDIA P系列 GPU

型号显存半精度单精度(TFLOPS)双精度CUDA核心数量Tensor核心数量架构显存类型
NVIDIA Tesla P48GB89.12GFLOPS (1:64)5.704178.2(GFLOPS1:32)2560-PascalGDDR5
NVIDIA Tesla P4024GB183.7GFLOPS (1:64)11.76367.4(GFLOPS1:32)3840-PascalGDDR5

Tesla NVIDIA L系列 GPU

型号显存半精度(TFLOPS)单精度(TFLOPS)双精度CUDA核心数量Tensor核心数量架构显存类型
NVIDIA L4048GB90.5290.521414(GFLOPS1:64)18176568Ada LovelaceGDDR6
NVIDIA L40S48GB91.6191.611431(GFLOPS1:64)18176568Ada LovelaceGDDR6