Saturday, September 24, 2022
HomeComputer HardwareNvidia Reveals Ada Lovelace GPU Secrets and techniques: Excessive Transistor Counts at...

Nvidia Reveals Ada Lovelace GPU Secrets and techniques: Excessive Transistor Counts at Excessive Clocks


When Nvidia launched its Ada Lovelace household of graphics processing models earlier this week, it primarily targeted on its top-of-the-range AD102 GPU and its flagship GeForce RTX 4090 graphics card. It did not launch too many particulars about its AD103 and AD104 graphics chips. Thankfully, Nvidia uploaded its Ada Lovelace whitepaper immediately that comprises a great deal of information concerning the new GPUs and fills in lots of gaps. We have up to date the RTX 40-series GPUs every part we all know hub with the brand new particulars, however this is the overview of the brand new and attention-grabbing data.

Large GPUs for Large Gaming 

We already know that Nvidia’s range-topping AD102 is a 608-mm^2 GPU containing 76.3 billion transistors, 18,432 CUDA cores, and 96MB of L2 cache. We now additionally know that AD103 is a 378.6 mm^2 graphics processor that includes 45.9 billion transistors, 10,240 CUDA cores, and 64MB L2 cache. As for the AD104, it has a die measurement of 294.5 mm^2, 35.8 billion transistors, 7680 CUDA cores, and 48MB of L2.

Nvidia Ada Specs vs. Ampere
GPU/Graphics Card Full AD102 RTX 4090 RTX 4080 16GB RTX 4080 12GB RTX 3090 Ti
Structure AD102 AD102 AD103 AD104 GA102
Course of Know-how TSMC 4N TSMC 4N TSMC 4N TSMC 4N Samsung 8LPP
Transistors (Billion) 76.3 76.3 45.9 35.8 28.3
Die measurement (mm^2) 608 608 378.6 294.5 628.4
Streaming Multiprocessors 144 128 76 60 84
GPU Cores (Shaders) 18432 16384 9728 7680 10752
Tensor Cores 576 512 320 240 336
Ray Tracing Cores 144 144 80 60 84
TMUs 512 512 304? 240 336
ROPs 192 192 112 80 112
L2 Cache (MB) 96 96 64 48 6
Enhance Clock (MHz) ? 2520 2505 2600 1860
TFLOPS FP32 (Enhance) ? 82.6 48.7 40.1 40.0
TFLOPS FP16 (FP8) ? 661 (1321) 390 (780) 319 (639) 320 (N/A)
TFLOPS Ray Tracing ? 191 113 82 78.1
Reminiscence Interface (bit) 384 384 256 192 384
Reminiscence Pace (GT/s) ? 21 22.4 21 21
Bandwidth (GBps) ? 1008 736 504 1008
TDP (watts) ? 450 320 285 450
Launch Date ? Oct 12, 2022 Nov 2022? Nov 2022? Mar 2022
Launch Value ? $1,599 $1,199 $899 $1,999

One of many attention-grabbing issues that Nvidia tells in its whitepaper is that Ada Lovelace GPUs use high-speed transistors in essential paths to spice up most clock speeds. In consequence, its fully-enabled AD102 GPU with 18,432 CUDA cores is ”able to working at clocks over 2.5 GHz, whereas sustaining the identical 450W TGP.” Maintaining this in thoughts, we’re not stunned that the corporate is speaking about 3.0 GHz clocks for the GeForce RTX 4090 (with 16,384 CUDA cores) reached in its labs. At 3.0 GHz, the GeForce RTX 4090 will completely headline our listing of the finest graphics playing cards round. 

(Picture credit score: Nvidia)

Along with excessive clocks, Nvidia’s Ada Lovelace GPU additionally boast huge L2 caches that enhance efficiency in compute intensive workloads (e.g., ray tracing, path tracing, simulations, and so on.) and reduces reminiscence bandwidth necessities. Primarily, Nvidia’s Ada GPUs take a web page from RDNA 2 Infinity Cache’s e book right here, though we imagine that basic targets for the brand new structure had been set nicely earlier than AMD’s Radeon RX 6000-series merchandise debuted in 2020. 

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments