News Tesla's Dojo system-on-wafer is in production — a serious processor for serious AI workloads

Status
Not open for further replies.

subspruce

Proper
Jan 14, 2024
149
36
110
pipe dream lmao, did you know the last company that tried to make wafer-scale processing? Cerberas and they failed hard to make anything, yield is just too low for something that huge
 

bit_user

Titan
Ambassador
pipe dream lmao, did you know the last company that tried to make wafer-scale processing? Cerberas and they failed hard to make anything, yield is just too low for something that huge
LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!

"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."

https://www.anandtech.com/show/1662...ne-two-wse2-26-trillion-transistors-100-yield

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.
 
May 4, 2024
2
0
10
LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!
"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."​

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.
To be accurate, that's no 100% yield if you sacrifice 1.5% cores and probably you have some other fault correction schemes to tackle D0. You can't avoid it.
That said, WSE-2 has no significant sales\deployment and probably goes nowhere for other reasons.
 

bit_user

Titan
Ambassador
To be accurate, that's no 100% yield if you sacrifice 1.5% cores
No, it's 100% yield because no wafers go in the trash bin. That's what yield actually means, not that there are zero defects.

WSE-2 has no significant sales\deployment and probably goes nowhere for other reasons.
How do you know? Cerebras is a private company. They're not obligated to report any of their sales or deployments.
 
  • Like
Reactions: jbo5112
May 4, 2024
2
0
10
No, it's 100% yield because no wafers go in the trash bin. That's what yield actually means, not that there are zero defects.


How do you know? Cerebras is a private company. They're not obligated to report any of their sales or deployments.
By your definition even if you have only 30% working cores then you have 100% yield if you don't scrap a wafer. On this token, every process node which starts with, say 50% GDPW, has 100% yield.
 

bit_user

Titan
Ambassador
By your definition even if you have only 30% working cores then you have 100% yield if you don't scrap a wafer.
The reason I used the term "wafer" is that their product is an entire wafer. If we're talking about a normal chip, where you have multiple of them per wafer, then 100% yield would mean that all of your chips are working to spec. If you incorporated enough redundancy into your design, then you get higher yield.

GPUs are a good example of this, where the the big ones ship with some functional units disabled. That doesn't mean they didn't yield, just that the design was de-rated in order to increase the yield.

Of course, one can talk about other kinds of yields, like yield of individual cores, but I think the industry standard definition of yield is essentially the proportion of chips that are manufactured to usable quantity.

And yes, it makes sense to talk about the amount of redundancy needed to achieve a given yield, on a given process. However, now we're getting into the details rather than talking about that top-line figure.
 
  • Like
Reactions: jbo5112

subspruce

Proper
Jan 14, 2024
149
36
110
LOL, where the heck did you hear that??

In fact, Cerebras achieved 100% yield of their WSE-2!
"Cerebras achieves 100% yield by designing a system in which any manufacturing defect can be bypassed – initially Cerebras had 1.5% extra cores to allow for defects, but we’ve since been told this was way too much as TSMC's process is so mature."​

Do you honestly believe anyone smart enough to make a go of such an epic undertaking would be too naive to account for defects? Do you really think any VCs would provide enough funding to someone so naive that they could even afford to have a go at it?

Lastly, did you not read the part of the article where Tesla is pre-validating the chips they're stacking on the carrier wafer? That's another way to do it.
i stand corrected
 
May 8, 2024
2
0
10
I can see no physicist posted here.

Power = Voltage Times Current =
1.2v TSMC times 18,000 amps = 21,600 watts, not 15,000 watts.
The announcement is a bit phony.
 
May 29, 2024
1
0
10
Why is putting chips on a wafer and using a semiconductor as the interconnect better than using say a decent conductor like copper. I could understand if it was all contained on the wafer and not having to deal with connections but it seems like that is lost when placing the silicon on top of another silicon that goes to another piece of silicon. Is there some type of length reduction happening here? Less metalization layer??
 
May 8, 2024
2
0
10
Why is putting chips on a wafer and using a semiconductor as the interconnect better than using say a decent conductor like copper. I could understand if it was all contained on the wafer and not having to deal with connections but it seems like that is lost when placing the silicon on top of another silicon that goes to another piece of silicon. Is there some type of length reduction happening here? Less metalization layer??
Silicon has the same coefficient of expansion as silicon. A copper substrate would expand at a different rate and may have issues.
 
Status
Not open for further replies.