Quick/Comparisons
The GeForce RTX 5090 takes the lead in gaming performance with a perfect score, thanks to its superior architecture and higher memory capacity.
The GeForce RTX 5090 edges out the GeForce RTX 4070 Ti SUPER in content creation, with a strong workstation performance and slightly better overall score.
The GeForce RTX 5090 excels in AI/ML compute with a score 36 points higher than the GeForce RTX 4070 Ti SUPER, thanks to its capable architecture and high memory capacity.
70 | Gaming | 100 |
72 | Workstation | 93 |
59 | AI/ML |
Compare with something else?
Compare GeForce RTX 4070 with another →95 |
46 | Energy Efficiency | 49 |
69 | Quick Comparison Final Score | 96 |
| AD103 | Graphics Processor | GB202-300 |
| 8448 | Cores | 21760 |
| 264 / 96 | TMUs / ROPs | 680 / 176 |
| 7089.35 | Blender GPU | 15010.34 |
| 150 | Forza Horizon 5 | 244 |
| 198 | The Witcher 3 | 323 |
| 220 | Counter-Strike 2 | 265 |
| 152 | Far Cry 6 | 202 |
| 118 | Hogwarts Legacy | 207 |
| 188 | Call of Duty | 327 |
| 111 | Ghost of Tsushima | 179 |
| 155 | Cyberpunk 2077 | 229 |
| 249 | Tomb Raider | 299 |
| 171 | Average FPS | 253 |
| 200 | 1080p High | 294 |
| 171 | 1080p Ultra | 253 |
| 134 | 1440p Ultra | 207 |
| 79 | 4K Ultra | 124 |
| Low | Margin of Error | Low |
| 237068 | GB6 Compute Score | 419309 |
| 331.7 img/sec | Background Blur | 333 img/sec |
| 211.6 img/sec | Face Detection | 298.5 img/sec |
| 9.63 Gpixels/sec | Horizon Detection | 21.7 Gpixels/sec |
| 13.7 Gpixels/sec | Edge Detection | 33.7 Gpixels/sec |
| 13.9 Gpixels/sec | Gaussian Blur | 34.5 Gpixels/sec |
| 2.15 Gpixels/sec | Feature Matching | 2.53 Gpixels/sec |
| 1210 Gpixels/sec | Stereo Matching | 2850 Gpixels/sec |
| 33566 FPS | Particle Physics | 59146.9 FPS |
| OpenCL | API | OpenCL |
| 31798 | G3D Mark Score | 38955 |
| 1240 | G2D Mark | 1413 |
| 277 FPS | DirectX 11 | 341 FPS |
| 119 FPS | DirectX 12 | 178 FPS |
| 18264 Ops/s | GPU Compute | 24632 Ops/s |
| 32793 | GB6 ML Single Precision | 53861 |
| 47412 | GB6 ML Half Precision | 81080 |
| 24566 | GB6 ML Quantized | 38938 |
| 12970 | Image Classification (SP) | 18948 |
| 26364 | Image Segmentation (HP) | 55109 |
| 37217 | Image Super Resolution (Q) | 58478 |
| 58720 | Face Detection (HP) | 114685 |
| 177604 | Pose Estimation (Q) | 344729 |
| 3815 | Text Classification (SP) | 4464 |
| 5829 | Machine Translation (HP) | 8184 |
| 15675 | Object Detection (SP) | 27802 |
| 62447 | Depth Estimation (Q) | 85572 |
| 357918 | Style Transfer (SP) | 687080 |
| ONNX | Framework | ONNX |
| DirectML | Backend | DirectML |
| 251 GPixel/s | Pixel Fill Rate | 424 GPixel/s |
| 689 GTexel/s | Texture Fill Rate | 1637 GTexel/s |
| 44.1 TFLOPS | FLOPS (FP32) | 104.8 TFLOPS |
| GDDR6X | Memory Type | GDDR7 |
| 16 GB | Memory Size | 32 GB |
| 1313 MHz | Memory Clock | 1750 MHz |
| 21000 Mbps | Effective Memory Speed | 28000 Mbps |
| 256-bit | Bus | 512-bit |
| No | ECC | No |
| 672.3 GB/s | Memory Bandwidth | 1792 GB/s |
| PCIe 4.0 x16 | Interface | PCIe 5.0 x16 |
| 285 W | TGP | 575 W |
| TSMC | Manufacturing | TSMC |
| 5 nm | Fabrication Process | 4 nm |
| 379 mm² | Die Size | 750 mm² |
| 45 billion | Transistor Count | 92.2 billion |
| 118.73 MTr/mm² | Transistor Density | 122.93 MTr/mm² |
| 12 | DirectX | 12.2 |
| 1.3 | Vulkan | 1.4 |
| 4.6 | OpenGL | 4.6 |
| 3 | OpenCL | 3 |
| 8.9 | CUDA | 12 |
| Yes | Ray Tracing | Yes |
| DLSS 3 | DLSS | DLSS 4 |
| 1.4a | DisplayPort | 2.1b |
| Nvidia | Vendor | Nvidia |
| Discrete | Build | Discrete |
| January 24, 2024 | Released | January 7, 2025 |
| Desktop | Case | Desktop |
| Gaming | Purpose | Gaming |
| High-end | Segment | High-end |
| Ada Lovelace | Architecture | Blackwell 2.0 |
| AD103 | GPU Codename | GB202-300 |
| -Intel Core i7 14700Kor above | Recommended CPU | -Intel Core Ultra 9 285Kor above |
| 25741 | Steel Nomad Lite Score | 52638 |
| 24222 | Time Spy | 46941 |
| 116542 | Solar Bay | 235224 |
| 15802 | Port Royal | 38113 |
| 56340 | Fire Strike | 87433 |
| 49901 | Wild Life Extreme | 108736 |
| 176609 | Night Raid | 207058 |
| 2340 MHz | Base Clock | 2017 MHz |
| 2610 MHz | Boost Clock | 2407 MHz |
| 8448 | Shading Units | 21760 |
| 264 | Texture Mapping Units (TMUs) | 680 |
| 96 | Render Output Units (ROPs) | 176 |
| 66 | Compute Units (Pipelines) | 170 |
| 264 | Tensor Cores | 680 |
| 66 | Ray-tracing Cores | 170 |
| 128KB per cluster | L1 Cache | 128KB per cluster |
| 48MB shared | L2 Cache | 96MB shared |
| 2 IPC | Instructions Per Cycle | 2 IPC |