News Nvidia RTX 4060 Is Allegedly up to 20% Faster Than RTX 3060 in Geekbench

D

Deleted member 2731765

Guest
FIRST !

Btw, a 20% boost in performance at a lower price does sound good but that's only if the 3060 Ti doesn't end up being priced close to $300 USD. The 3060 Ti might eat up sales of the RTX 4060 since it offers a wider memory bus.

 
Last edited by a moderator:
D

Deleted member 2731765

Guest
On some Off Topic note, regarding the Ti SKU variant.

For GAME emulation, 128-bit mem bus width creates a huge bottleneck. YUZU developers found the GeForce RTX 4060 Ti to be a serious downgrade for emulation purposes compared to the older GeForce RTX 3060 Ti.

According to them. when using the NVIDIA GeForce RTX 4060 Ti for Switch emulation, you will get slower performance compared to the RTX 3060 Ti due to its narrower 128-bit bus. So this RTX 4060 non-Ti shouldn't be an exception.

Despite the higher L2 cache, using a 2x upscale easily eats up vast pools of cache and that leaves you with the 128-bit wide bus interface which is a downgrade versus the previous Ti offering.


We don’t understand what kind of decisions NVIDIA took when deciding the Ada Lovelace GeForce product stack, but it has been nothing but mistakes.

The RTX 4060 Ti 8GB with only a 128-bit wide memory bus and GDDR6 VRAM is a serious downgrade for emulation when compared to its predecessor, the 256-bit wide equipped RTX 3060 Ti. You will be getting slower performance in Switch emulation if you get the newer product.

We have no choice but to advise users to stick to Ampere products if possible, or aim higher in the product stack if you have to get a 4000 series card for some reason (DLSS3 or AV1 encoding), which is clearly what NVIDIA is aiming for.

The argument in favour of Ada is the increased cache size, which RDNA2 confirmed in the past helps with performance substantially, but it also has a silent warning no review mentions: if you saturate the cache, you’re left with the performance of a 128-bit wide card, and it’s very easy to saturate the cache when using the resolution scaler — just 2X is enough to tank performance.

Spending 400 USD on a card that has terrible performance outside of 1X scaling is, in our opinion, a terrible investment, and should be avoided entirely. We hope the 16GB version at least comes equipped with GDDR6X VRAM, which would increase the available bandwidth and provide an actual improvement in performance for this kind of workload.
via Yuzu
 
On some Off Topic note, regarding the Ti SKU variant.

For GAME emulation, 128-bit mem bus width creates a huge bottleneck. YUZU developers found the GeForce RTX 4060 Ti to be a serious downgrade for emulation purposes compared to the older GeForce RTX 3060 Ti.

According to them. when using the NVIDIA GeForce RTX 4060 Ti for Switch emulation, you will get slower performance compared to the RTX 3060 Ti due to its narrower 128-bit bus. So this RTX 4060 non-Ti shouldn't be an exception.

Despite the higher L2 cache, using a 2x upscale easily eats up vast pools of cache and that leaves you with the 128-bit wide bus interface which is a downgrade versus the previous Ti offering.

I wonder if emulation somehow doesn't get the cache hit rates of regular gaming? Or perhaps it's something to do with the emulation code. I mean, L2 cache is L2 cache. You go from 4MB L2 (3060 Ti) to 36MB L2 (4060 Ti) and you should get substantially higher hit rates. Higher hit rates means fewer memory accesses means higher effective bandwidth.

Given nearly all emulation falls in a gray area (the emulators are fine, the ROMs they run are 99.9999% pirated downloads, even if you originally owned the software), I wouldn't put too much stock in how they run compared to native games. Other settings (scaling) may also be a factor and might be doing things in a way that somehow negates cache benefits.
 

LolaGT

Reputable
Oct 31, 2020
284
256
5,090
We(the consumer) get what we deserve.

The number one thing nvidia has learned over the last couple years is that the cash grab works.
As long as nvidia fangirlz buy, they have no need to do anything more.
 
Pondering over the table's Vulkan API scores, it looks like the new RTX 4060 is about 12% faster than the RTX 3060. In Open CL tests the difference is better, with the new Ada Lovelace card being very nearly 20% faster.
I would assume the Vulcan score might be more on-point for 1080p game performance. That's about on par with the mediocre performance gains that the 4060 Ti saw over the 3060 Ti at that resolution. And the 1440p gains will likely be lower still, again much like what was seen with the 4060 Ti, especially since the 3060 had more VRAM in addition to its faster bus. Going forward, I wouldn't be surprised if the 3060 outperforms the 4060 at 1440p in many future games with higher VRAM requirements. Apparently Nvidia is trying to push $300-$400 as the new price point for mid-range 1080p.

Compared to their similarly-named and priced predecessors, the 4060 Ti saw a reduction of around 11% of its cores and 36% of its memory bandwidth compared to the 3060 Ti, while the 4060 sees a reduction of around 14% of its cores and 24% of its memory bandwidth, in addition to a loss of 33% of its VRAM. And sure, they've moved to a smaller process node that likely costs more, but the graphics chips used in each of these cards are roughly half the size of those used in their predecessors. It's likely that Nividia originally designed the 4060 Ti as a 3060 successor, and the 4060 as a 3050 successor, but decided they could get away with big price hikes for their lineup by shifting around product names yet again.
 
D

Deleted member 2731765

Guest
I wonder if emulation somehow doesn't get the cache hit rates of regular gaming? Or perhaps it's something to do with the emulation code.

It must definitely be the emulation code. I presume the Resolution Scaler mode in Yuzu doesn't play nice with the extra L2 cache. Without the scaler, there won't be any issues as such, imo.

Because it is very easy to fill up the cache pool when running a resolution scaler. Using a 2x upscale will easily eat up vast pools of cache, and, since scaling affects memory bandwidth the most, so the performance number is VRAM clock speed x VRAM bus width.

The extra cache on the Nvidia card is getting mitigated with the scaling.
 
Last edited by a moderator:
My Geekbench 6 score for my 4.5 year old RTX 2060 (6 GB) is 85,781 for OpenCL and 78,490 for Vulkan. i.e. the RTX 4060 8GB, with 4.5 years of development and launching at roughly the same price, is 23% faster for OpenCL and 27% faster for Vulkan. Incredible! But we do get a whopping 2GB VRAM increase. Remember, it's been four and half years since the RTX 2060.
 
Last edited:
Here's all you need: Forget all the RX cards, toss the 4060 4050 Ti into the trash can and get yourself a 3060 Ti.
I'am building the lowest power consumption gaming pc . Even a fan will be disable. max 100W entire system idle 28W max 5w CPU + 6w chipset + 2/4w graphics 10w other things (one exaust fan corsair ml 120 running at 430 RPM) The rx 7600 or 4060 will be locked 60fps max.
 

mwm2010

Prominent
May 3, 2023
58
27
560
I like DLSS and RT, but i'm stuck with Radeon 6000(not that that's a bad thing) until a non-piece of crap NVIDIA midrange GPU comes out with 12 GB VRAM. I know I could just get a 4070 or a 3060, but those aren't appetizing to me.
Sigh...