How did they train their model? Did they leverage OpenAI models? China never developed anything from scratch. No R&D.
The training code is open source, that is why it's called OpenAI.
You build a model by feeding it massive amounts of data which uses equally massive amounts of computational power. The Chinese company alleges they were able to spend only ~6 million on this phase, which is incredibly low as most western companies have to spend $40~60 million. Assuming this is true and the method works "well enough", it means we can build AI models for a fraction of the previously anticipated cost. This is very bad news for any company who invested billions into building new datacenters and purchasing hardware, their expected profitability just tanked.
As for the "what data did they use", I would imagine they used the same data OpenAI did.
