The RTX 4090 comes with the first Ada Lovelace GPU of this generation: the AD102. But it's worth noting the chip used in this flagship card is not the full core, despite its already monstrous specs sheet.
Still, at its heart are 16,384 CUDA cores arrayed across 128 streaming multiprocessors (SMs). That represents a 52% increase over the RTX 3090 Ti's GA102 GPU, which was itself the full Ampere core.
The full AD102 chip comprises 18,432 CUDA Cores and 144 SMs. That also means you're looking at 144 third gen RT Cores and 576 fourth gen Tensor Cores. Which I guess means there's plenty of room for an RTX 4090 Ti or even a Titan should Nvidia wish.
Memory hasn't changed much, again with 24GB of GDDR6X running at 21Gbps, which delivers 1,008GB/sec of memory bandwidth.
GeForce RTX 4090 | GeForce RTX 3090 Ti | |
---|---|---|
Lithography | TSMC 4N | Samsung 8N |
CUDA cores | 16,432 | 10,752 |
SMs | 128 | 84 |
RT Cores | 128 | 84 |
Tensor Cores | 512 | 336 |
ROPs | 176 | 112 |
Boost clock | 2,520MHz | 1,860MHz |
Memory | 24GB GDDR6X | 24GB GDDR6X |
Memory speed | 21Gbps | 21Gbps |
Memory bandwidth | 1,008GB/s | 1,008GB/s |
L1 | L2 cache | 16,384KB | 73,728KB | 10,752KB | 6,144KB |
Transistors | 76.3 billion | 28.3 billion |
Die Size | 608.5mm² | 628.5mm² |
TGP | 450W | 450W |