How do you know that? For certain, I mean? Your screenshot assumes it "steals" from Tom's because Tom's has "comfortable keyboard" in its review. But if you google
"Lenovo Thinkpad X13" "comfortable keyboard", you get 63,200 Google results. You also take something like "excels in productivity" as potential plagiarism, when the actual quote, "
designed for productivity", has 23,400 results. And that probably doesn't include countless user reviews on some websites and some other sources.
Since I only have a very superficial knowledge of neural networks (especially ones of this magnitude), I'm not saying that you are necessarily wrong. I'm just saying that you may be mistaken about at least some aspects of it and therefore confusing correlation with causation. This is what I was talking about with confirmation bias and Bing Chat. From my understanding about how these neural networks process, evaluate, and organize data (and again, I have some perfunctory knowledge but I'm not an expert) , there is a higher chance that this is simply a mish-mash of all Lenovo Thinkpad X13 reviews (likely including last-gen and possibly translated ones as well) that bears some similarity to some of them, than there is one that this quote was taken verbatim from Tom's and intentionally rephrased to avoid copyright infringement.
As for the sources provided by Google, again, I'm not even sure that it has anything to do with the models to begin with. There is a good chance that their algorithm simply looks up relevant keywords in the output and presents the top domains that have the most occurrences of the phrase - as well as some sponsored links - as the "sources".
"best graphics card" site:quora.com has 16,200 results, roughly twice as many as Reddit (9,460) or Tom's (6,780). As for the other two links in your example, I'm not familiar with either, but the fact that one of them is a retailer and another is simply a non-player in the global traffic rankings makes me think that these are both sponsored links masqueraded as sources.
As you can see, I'm not defending either Google or Microsoft. If my assumptions about them making up the sources on the fly and presenting ads as sources are correct, that's not exactly vindication by any means. What I'm trying to say here is that we need more tangible facts, more objective research, and a hell of a lot less guesswork when it comes to how exactly these companies train and deploy their models before making up our minds and reaching for the pitchforks.