News EVGA Teases GeForce RTX 3090 FTW3 With a 2.1GHz Manual Overclock

The first screen is the setting screen with slides , the second screen is the card running , at I think the 5520Mhz memory clock is a normal mode , remember that the GDDR6X gives double the bandwidth ? so thats 4 times the regular GDDR ...

No typo here , it is a GDDR6X mode.
 
  • Like
Reactions: Chung Leong
Not sure why a 2105 Mhz is so amazing when my 2080 S can generate 2100 Mhz max on MSI Afterburner? These are supposed to be faster than the 2000 series.
 
Not sure why a 2105 Mhz is so amazing when my 2080 S can generate 2100 Mhz max on MSI Afterburner? These are supposed to be faster than the 2000 series.

Well, first off it's a new unknown node, so no one knows what to expect.
Second, you're looking at a 3090 with 28 billion transistors, whereas the 2080S had 13.6 billion transistors.
Third, the 2080S has a 250W TDP, while the 3090 is 350W
Fourth, 24GB of VRAM

So basically you have 2.05x the transistor count, 3x the memory (faster/hotter, at that), with 40% more power usage/heat generation despite the node shrink. And on air, still hitting 2105MHz? If it's even true...it would be an incredible card.

Remember...the stock boost clock of the 3090 is just 1700MHz. The 2080S is 1815MHz. So this would be a higher % increase. So for your card, you said you got 2100MHz. That's just a 15.7% increase over stock boost clocks. With this card, it would be 23.8% increase.

I should note...this type of boost clock will NOT be sustainable on the RTX 3090 Founders Edition, or Reference Board design. Maximum power draw for those cards would be 375W (150W per 8-pin connector (x2 = 300W), and 75W from PCIE slot). So you have very limited room for overclocking. If what we see regarding the 2100MHz clock is correct, it would be from one of the 3x 8-pin connector card designs. The Asus STRIX, for example, is rated for a 400W TDP. And actually taking a look at my chart, the entire FTW3 line (FTW3, FTW3 Ultra, FTW3 Hybrid, FTW3 Waterblock) are 3x 8-pin connector designs. We know the Asus ROG Strix line has 22 power stages, and the EVGA Kingpin card has 23 power stages. This is while the Founders Edition and Reference Boards (any of the 2x 8-pin designs) have only 18 power stages. So the FTW3 line likely also has a 22-23 power stage design, which would allow these clocks.

Believe me...this kind of clock on a 3090 isn't easy, because the Nvidia reference/founders design has severely hampered its available power. While previous cards like the 1080ti or 2080ti had 20% more power available to them past the TDP, the 3090 only has 7.1% more power available to it. That means lower clocks, and even lower sustained clocks when the card heats up.

So again...if these clocks are on air, it will be absolutely amazing.
 
Last edited:
  • Like
Reactions: sstanic
I'm sure it'll use 450+ watts too, there's already reports that AIB 3090s have 3x 8pin power connections.... Guess Nvidia consumers don't care about efficiency anymore when it used to be a main part of their battlecry

P.S. Don't take a shortcut to thinking, make a false dichotomy logical fallacy and assume any criticism of Nvidia automatically makes me an AMD fan.... As if there are only two choices in thought
 
I'm sure it'll use 450+ watts too, there's already reports that AIB 3090s have 3x 8pin power connections.... Guess Nvidia consumers don't care about efficiency anymore when it used to be a main part of their battlecry

P.S. Don't take a shortcut to thinking, make a false dichotomy logical fallacy and assume any criticism of Nvidia automatically makes me an AMD fan.... As if there are only two choices in thought

Reference/Founders Edition cards can't surpass 375W. But I'm curious...how are you getting a lack of efficiency? The cards are literally performing that much more. At the same performance level as an RTX 2080, it wins out at 1.9x more performance per watt. So based on their graph, where the RTX 2080 got to 60fps with 240W, the RTX 3080 got to that same 60fps with 126W. Of course that's not entirely fair because you're comparing a card running at max clocks (least efficient) vs one with limited load (most efficient). But continue that curve and you end up with the 3080 hitting 105fps at 320W.

So in that example, compared to the 2080, it put out 75% more frames per second, at a 33% increase in power. Or in other words...an increase in performance per watt of 31%. Let me break it down a bit based on that chart:

RTX 2080 used 4 Watts for each frame per second (240W/60FPS)
RTX 3080 used 3 Watts for each frame per second (320W/105FPS)
RTX 3080, just matching the 60fps of the RTX 2080, used just 2.1 Watts for each frame per second (126/60)

So at the same level of performance, it uses 2.1W instead of 4W. At maxed out performance, it uses 3W instead of 4W. You're bringing up total power usage which has nothing to do with efficiency. If they had doubled the number of cores and vram, you'd be looking at a 700W TDP. That wouldn't make it inefficient.

I'll give you some more details here. This is made on an 8nm node from Samsung, which is just a refined 10nm. Samsung is also far worse than TSMC. The RTX 3080, which has a TDP of 320W, is a (approximately) 627mm2 die with 28 billion transistors. The Radeon VII is made on TSMC's superior 7nm node. It's also using HBM2 memory which uses less power than GDDR6x. The die on that card is just 331mm2, and has a total of 13.2 billion transistors. Yet it has a TDP of 300W. And performance? Even less than a 240W RTX 2080 on average across all games.

I'm also seriously doubting AMD's ability to compete against these cards, because for the first time in a long time, Nvidia actually put out a massive GPU die on a brand new node. Normally it likes to milk people. AMD wouldn't be able to outperform Nvidia directly, so it'd have to go the way of the chiplet. The problem there is...even if you do get a good solution similar to how it operates on their CPUs, the system which lets the chiplets perform as one giant chip uses up a lot of power too, as is the case on the CPU front right now.

  • Nvidia went from 13.6B Transistors to 28B (about 1.70x more when you account for disabled cores)
  • Nvidia went from 60fps to 105fps (about 1.75x performance)
  • Nvidia went from 240W to 320W (about 1.33x more power use)

So the card is not inefficient. It is more efficient. It just happens to be very performant. And in fact, it is more efficient and performant than an AMD card on a better/more efficient node, with more efficient memory. And it is ahead in that category by a mile.
 
This is made on an 8nm node from Samsung, which is just a refined 10nm. Samsung is also far worse than TSMC.

Is Samsung worse than TSMC in practice though? From what I've read, Samsung 8LPP has more relaxed design rules than TSMC 7N. That might allow you to pack in more transistors than a denser (on paper) but more restrictive process. If your die size for the 3080 is correct, it has around 45 million transistors per mm². Navi meanwhile, only has 41 million transistors per mm².
 
Is Samsung worse than TSMC in practice though? From what I've read, Samsung 8LPP has more relaxed design rules than TSMC 7N. That might allow you to pack in more transistors than a denser (on paper) but more restrictive process. If your die size for the 3080 is correct, it has around 45 million transistors per mm². Navi meanwhile, only has 41 million transistors per mm².

Well, I know when Apple had Samsung make some of their chips along with TSMC, the Samsung ones had higher power usage, and throttled more, so they sold them in their cheaper iPhone versions. Situation might be different with high performance chips, but that seems unlikely. Samsung is a cheaper option than TSMC. So if it performed as well or better than TSMC, why would the Ampere A100 cards be made by TSMC?

Samsung’s fab has a lot of issues. Nvidia used them only because they made a gamble by signing with them instead of TSMC and then TSMC being booked solid. Samsung wasn’t chosen for quality or performance. It’s what they were unfortunately stuck with.

A100 chip has 54.2 billion transistors on an TSMC 7nm at 826mm2. Samsung has 28 billion transistors in 627mm2.

Those numbers alone say it all.

So just as a comparison:

Samsung 8nm (Ampere RTX 3090) = 44.6 million transistors per mm2

TSMC 7nm (Ampere A100 HPC) = 65.6 million transistors per mm2.

So TSMC would have 47% more transistors per mm2. Or in other words, instead of a 627mm2 die on Samsung 8nm, it could have been on a 426mm2 die instead. Now I know there’s not a linear power reduction from node shrinks at these levels but it would still have been quite significant.
 
Last edited:
  • Like
Reactions: sstanic
Samsung 8nm (Ampere RTX 3090) = 44.6 million transistors per mm2

TSMC 7nm (Ampere A100 HPC) = 65.6 million transistors per mm2.

The A100 has a massive amount of cache. As we know, memory cells are packed more tightly. The two chips aren't comparable.

And of course, transistors are no good when unused. Nvidia has to disable 15% of the GA100's cores to reach acceptable yield. Now we don't know how many 3080s Nvidia has to make before they get one 3090. The fact that the two cards are launched concurrently does hint at excellent yield.
 
  • Like
Reactions: HyperMatrix
The A100 has a massive amount of cache. As we know, memory cells are packed more tightly. The two chips aren't comparable.

And of course, transistors are no good when unused. Nvidia has to disable 15% of the GA100's cores to reach acceptable yield. Now we don't know how many 3080s Nvidia has to make before they get one 3090. The fact that the two cards are launched concurrently does hint at excellent yield.

Remember the 3090 isn’t a full fat die either. It has SMs disabled. I’d expect a similar situation as with the pascal Titan x and Titan xp. Eventual refresh with few percentage more cores and the full stack gddr6x bumping it up to 48GB@21Gbps.

As for the other part, you’d know more about cache transistor density than I would so I’ll yield to you on the specifics of the comparison. But with your knowledge, would you not say the TSMC 7nm appears to be significantly better than the Samsung 8nm even if not to the degree I stated?
 
  • Like
Reactions: Chung Leong
"The RTX 3090 already looks like it will deliver incredible performance. Nvidia says it's 50% faster than the outgoing RTX 2080 Ti "

Oh dear!! -->

"3090 = $1499+ for 50% > 2080Ti"
3080 = $699+ for c25-30% > 2080Ti

So, Nvidia asking buyer to cough up an extra $799+ to get and extra c20-25% over the 3080 (i.e. that's including that extra memory) !!!

3090 price-to-performance is shockingly insane in comparison to 3080 !!!
 
Last edited:
Not sure why a 2105 Mhz is so amazing when my 2080 S can generate 2100 Mhz max on MSI Afterburner? These are supposed to be faster than the 2000 series.
Having 10000 cuda cores 2.1Ghz is little different than having 3000 cuda cores @2.1Ghz. 😉
 
But with your knowledge, would you not say the TSMC 7nm appears to be significantly better than the Samsung 8nm even if not to the degree I stated?

I think it's depends on what you're making. If you're making something like a Ryzen, where you're packing 36mb of cache into 74 mm², Samsung would lose hands down. If you're making a gaming GPU, then perhaps the greater flexibility in layout is a serious advantage.
 
"The RTX 3090 already looks like it will deliver incredible performance. Nvidia says it's 50% faster than the outgoing RTX 2080 Ti "

Oh dear!! -->

"3090 = $1499+ for 50% > 2080Ti"
3080 = $699+ for c25-30% > 2080Ti

So, Nvidia asking buyer to cough up an extra $799+ to get and extra c20-25% over the 3080 (i.e. that's including that extra memory) !!!

3090 price-to-performance is shockingly insane in comparison to 3080 !!!
I just built my very first pc, eveything is working fine except for the cpu cooler whenever I turn on the pc the cpu cooler turns on and start glowing but the fan doesn't spin. Is that normal? Does it start to spin when the cpu gets hot? I haven't checked the bios yet so I dont know if I should configure sth there or not.


There's nothing really new there, this has always been the nvidia way.

Want the highest end end card currently available?

Its going to be nvidia and its also going to be insanely priced.

However this time there will actually be a fairly valid reason for beople to buy it, 4k 60htz+? get saving.
 
When I woke up this morning, I suddenly remembered that GlobalFoundries had licensed Samsung's tech for 12nm. That means we have witnessed transitions in both directions. If we compare the uplift in each case, we can get a good sense of whose approach is better.

Polaris (GF 12nm) → Navi (TSMC 7nm):
RX 590: 2304 cores on 232 mm², 7.1 TFLOPS, 175 W
RX 5700 XT: 2560 cores on 251 mm², 9.6 TFLOPS, 225 W

Turing (TSMC 12nm) → Ampere (Samsung 8nm):
RTX 2080 Ti: 4352 cores on 754 mm², 13.5 TFLOPS, 250 W
RTX 3090: 10496 cores on 628 mm², 35.6 TFLOPS, 350 W

I think the only reasonable conclusion to draw from these numbers is that Samsung's process is better suited to GPUs. Losing that advantage offset the gain from a smaller node in AMD's case. Nvidia, on the other hand, gained that advantage plus the benefit of a shrink.