News Xilinx One-Ups Intel With PCIe 4.0 Alveo U50 Data Center Card

bit_user

Titan
Ambassador
25x lower latency and 10x higher throughput in speech translation compared to Tesla T4
Nice try, but these benchmarks typically use a batch size of 1, which puts GPUs at an unrealistic disadvantage. I'm also curious if their benchmark used its Tensor cores and if they used any of their integer functionality.

Using a realistic batch size, I'd be really surprised if it could beat the T4.