News Nvidia GeForce Driver Promises Doubled Stable Diffusion Performance

Admin · May 24, 2023

NVIDIA today announced the release of its GeForce Game Ready Drivers version 532.03, which promises up to 2x performance improvement in Machine Learning workloads.

Nvidia GeForce Driver Promises Doubled Stable Diffusion Performance : Read more

Deleted member 2731765 · May 24, 2023

The 2x performance claim is certainly interesting, given we can more or less get 50% uplift with SD using hardware-specific optimized models.

Though it looks like this optimized version has to be executed through an ONNX pipeline which is not out of the box for SD webui as of now. Also interested to know the relative memory requirements between using the optimized pipeline vs the current SD pipeline, since they haven't mentioned this.

Kind of off topic, and not directly related to this news, but if anyone is interested now, for preliminary testing generative AI models, the Dolly 2.0 large language model is actually now on Hugging Face, which is also an Olive-optimized version.

NVIDIA's NeMo LLM for conversational AI is also coming soon, although there is no ETA.

It'd be interesting to see if these older-architecture cards that don't sport any Tensor cores see a similar 2x performacne improvement in ML acceleration or not.

I don't think the older GTX cards can get a 2X perf improvement uplift, unlike the RTX GPUs, since they obviously lack specific dedicated hardware.

JarredWaltonGPU · May 24, 2023

Metal Messiah. said:
The 2x performance claim is certainly interesting, given we can more or less get 50% uplift with SD using hardware-specific optimized models.

Though it looks like this optimized version has to be executed through an ONNX pipeline which is not out of the box for SD webui as of now. Also interested to know the relative memory requirements between using the optimized pipeline vs the current SD pipeline, since they haven't mentioned this.

Kind of off topic, and not directly related to this news, but if anyone is interested now, for preliminary testing generative AI models, the Dolly 2.0 large language model is actually now on Hugging Face, which is also an Olive-optimized version.

NVIDIA's NeMo LLM for conversational AI is also coming soon, although there is no ETA.

I don't think the older GTX cards can get a 2X perf improvement uplift, unlike the RTX GPUs, since they obviously lack specific dedicated hardware.

Looks like I have something else to try and figure out. Gotta talk to Nvidia and see what exactly is needed to get this speedup with Automatic1111's WebUI.

As for the older GTX cards, I'm also curious if there's any benefit from these drivers. Not even 2X, but maybe 10-25% faster? But the last time I tried Automatic1111 on a GTX 1660 Super, it was horribly slow — far slower than it ought to be!

Considering RX 6600 manages over two images per minute, I'd expect a GTX 1660 Super to at least be able to do maybe one per minute. Last time I tried, I think I got about one 512x512 image every two and a half minutes!

Deleted member 2731765 · May 24, 2023

JarredWaltonGPU said:
Not even 2X, but maybe 10-25% faster?

That uplift sounds reasonable. Will you retest all the cards now, and update the SD benchmark/performance analysis article ?

jp7189 · May 25, 2023

JarredWaltonGPU said:
Looks like I have something else to try and figure out. Gotta talk to Nvidia and see what exactly is needed to get this speedup with Automatic1111's WebUI.

As for the older GTX cards, I'm also curious if there's any benefit from these drivers. Not even 2X, but maybe 10-25% faster? But the last time I tried Automatic1111 on a GTX 1660 Super, it was horribly slow — far slower than it ought to be!

Considering RX 6600 manages over two images per minute, I'd expect a GTX 1660 Super to at least be able to do maybe one per minute. Last time I tried, I think I got about one 512x512 image every two and a half minutes!

If you figure it out, I for one would love a "how to" article so I can follow along at home.

jp7189 · May 25, 2023

I wonder if training speed can be increased as well? I would definitely welcome a 2x boost there.

Deleted member 2731765 · May 25, 2023

The training speed will actually depend on many other factors as well. But I will let Jarred confirm this.

Gerosu · May 25, 2023

I tested it on my GTX 1080.
Installed new drivers, optimized the ONNX model.
But the generation time has not changed. Tested on 512x512 50 steps, 1 image. It was 24 seconds and it became 24 seconds.

I created a question on github, I hope to explain there why there are no changes, or just the model without RT kernels does not improve

Search

News Nvidia GeForce Driver Promises Doubled Stable Diffusion Performance

Admin

Administrator

Deleted member 2731765

Guest

JarredWaltonGPU

Splendid

Deleted member 2731765

Guest

jp7189

Distinguished

jp7189

Distinguished

Deleted member 2731765

Guest

Gerosu

TRENDING THREADS

Latest posts

Moderators online

Share this page