That's because Intel introduced it at 14 nm. People tested Intel's implementation in Alder Lake (back when you still could) and found neither a significant clock penalty, nor excessive temperatures. 14 nm was just too soon to introduce it into a high-clocking CPU (though fine for Xeon Phi, which only boosted up to 1.7 GHz (1.5 GHz base)).
IMO, the main thing AMD got right was to wait until TSMC N5 to do it. Sure, they also had half the FMA ports and everything in their backed basically retained the same widths as Zen 3.
In the N4P-based Zen 5 (desktop/server), not only did they double the width of the pipes, but they also doubled the number of FMA ports, to equal Golden Cove Server's AVX-512 pipeline arrangement.
Compare to:
Last year, Intel’s Golden Cove brought the company back to competitive against AMD.
chipsandcheese.com
Lion Cove already increased the FP pipelines by 33%. It'd be interesting if their first core to implement AVX10/256 had double the issue ports at half the width of their Golden Cove server cores.