News Tesla's Dojo system-on-wafer is in production — a serious processor for serious AI workloads

Admin · May 2, 2024

Additional details about Tesla's Dojo SoW emerged at TSMC's North American Technology Symposium.

Tesla's Dojo system-on-wafer is in production — a serious processor for serious AI workloads : Read more

subspruce · May 2, 2024

pipe dream lmao, did you know the last company that tried to make wafer-scale processing? Cerberas and they failed hard to make anything, yield is just too low for something that huge

bit_user · May 2, 2024

subspruce said:
pipe dream lmao, did you know the last company that tried to make wafer-scale processing? Cerberas and they failed hard to make anything, yield is just too low for something that huge

LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!

"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."

https://www.anandtech.com/show/1662...ne-two-wse2-26-trillion-transistors-100-yield

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.

ames61 · May 4, 2024

bit_user said:
LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!

"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."

https://www.anandtech.com/show/1662...ne-two-wse2-26-trillion-transistors-100-yield

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.

To be accurate, that's no 100% yield if you sacrifice 1.5% cores and probably you have some other fault correction schemes to tackle D0. You can't avoid it.
That said, WSE-2 has no significant sales\deployment and probably goes nowhere for other reasons.

bit_user · May 4, 2024

ames61 said:
To be accurate, that's no 100% yield if you sacrifice 1.5% cores

No, it's 100% yield because no wafers go in the trash bin. That's what yield actually means, not that there are zero defects.

ames61 said:
WSE-2 has no significant sales\deployment and probably goes nowhere for other reasons.

How do you know? Cerebras is a private company. They're not obligated to report any of their sales or deployments.

ames61 · May 4, 2024

bit_user said:
No, it's 100% yield because no wafers go in the trash bin. That's what yield actually means, not that there are zero defects.

How do you know? Cerebras is a private company. They're not obligated to report any of their sales or deployments.

By your definition even if you have only 30% working cores then you have 100% yield if you don't scrap a wafer. On this token, every process node which starts with, say 50% GDPW, has 100% yield.

bit_user · May 4, 2024

ames61 said:
By your definition even if you have only 30% working cores then you have 100% yield if you don't scrap a wafer.

The reason I used the term "wafer" is that their product is an entire wafer. If we're talking about a normal chip, where you have multiple of them per wafer, then 100% yield would mean that all of your chips are working to spec. If you incorporated enough redundancy into your design, then you get higher yield.

GPUs are a good example of this, where the the big ones ship with some functional units disabled. That doesn't mean they didn't yield, just that the design was de-rated in order to increase the yield.

Of course, one can talk about other kinds of yields, like yield of individual cores, but I think the industry standard definition of yield is essentially the proportion of chips that are manufactured to usable quantity.

And yes, it makes sense to talk about the amount of redundancy needed to achieve a given yield, on a given process. However, now we're getting into the details rather than talking about that top-line figure.

subspruce · May 4, 2024

bit_user said:
LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!

"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."

https://www.anandtech.com/show/1662...ne-two-wse2-26-trillion-transistors-100-yield

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.

i stand corrected

Joe Physics · May 8, 2024

I can see no physicist posted here.

Power = Voltage Times Current =
1.2v TSMC times 18,000 amps = 21,600 watts, not 15,000 watts.
The announcement is a bit phony.

HotSlaw · May 29, 2024

Why is putting chips on a wafer and using a semiconductor as the interconnect better than using say a decent conductor like copper. I could understand if it was all contained on the wafer and not having to deal with connections but it seems like that is lost when placing the silicon on top of another silicon that goes to another piece of silicon. Is there some type of length reduction happening here? Less metalization layer??

Joe Physics · May 29, 2024

HotSlaw said:
Why is putting chips on a wafer and using a semiconductor as the interconnect better than using say a decent conductor like copper. I could understand if it was all contained on the wafer and not having to deal with connections but it seems like that is lost when placing the silicon on top of another silicon that goes to another piece of silicon. Is there some type of length reduction happening here? Less metalization layer??

Silicon has the same coefficient of expansion as silicon. A copper substrate would expand at a different rate and may have issues.

Search

News Tesla's Dojo system-on-wafer is in production — a serious processor for serious AI workloads

Admin

Administrator

subspruce

Prominent

bit_user

Titan

ames61

bit_user

Titan

ames61

bit_user

Titan

subspruce

Prominent

Joe Physics

HotSlaw

Joe Physics

TRENDING THREADS

Latest posts

Moderators online

Share this page