China can’t buy the best Nvidia chips. So Alibaba has decided to connect theirs and sell them as if they were one
Alibaba does not want its infrastructure artificial intelligence (AI) continues to depend on Nvidia technologies. Little by little, the largest technology companies in China are assuming the request that Xi Jinping’s government made them at the beginning of October 2024: as far as possible They had to use chips produced in China. Ten months later this recommendation became a requirement. And the data centers that belong to the State throughout the country had to use at least 50% Chinese integrated circuits on their servers. This scenario especially favors Huawei, Moore Threads and Cambricon Technologies because they are Top AI GPU Manufacturers from China, but it also works great for Alibaba. In fact, Alibaba Cloud, its cloud computing subsidiary, has taken a very important step forward. A few days ago it presented a new chip for AI, the Zhenwu M890, and made official a very ambitious itinerary that describes what solutions it will develop over the next three years. This GPU has been designed by T-Head, the semiconductor division that Alibaba founded in 2018. It incorporates 144 GB of HBM3 memory and achieves an interconnection transfer speed between chips of up to 800 GB/s. As we are about to discover, this last feature is essential in the strategy that Alibaba has developed to compete in the AI hardware market. Alibaba is going to spend $53 billion on its infrastructure According to Alibaba, the performance of its Zhenwu M890 chip is triple that of its predecessor. Additionally, it has been designed to perform well both during training of cutting-edge AI models and during inference. An important note: inference is broadly the computational process carried out by language models with the purpose of generating responses that correspond to the requests they receive. Alibaba wants to compete face to face with Nvidia in the deployment of infrastructure for data centers However, there is another relevant fact that is worth not overlooking: in medium precision operations (FP16) the Zhenwu M890 chip reaches 0.6 petaflops, a performance comparable to that of Nvidia’s A100 GPU and three times higher than that of the H20 chip. On the other hand, the ICN Switch interconnection chip allows link up to 128 GPUs M890 so that they work in unison. Alibaba assures that this architecture makes these GPUs work as a single chip, which, on paper, will allow it to compete head-to-head with Nvidia in the deployment of infrastructure for data centers. Regarding the itinerary that will follow until 2028, this Chinese company has anticipated that it plans to launch the Zhenwu V900 during the third quarter of 2027. According to Alibaba, it will implement its own significantly improved parallel computing architecture, will have three times the performance of the M890 chip, will be supported by 216 GB of memory and will reach an interconnection transfer speed of 1,200 GB/s. The Zhenwu J900 will arrive during the third quarter of 2028 with another major architectural leap. This roadmap It reflects that Alibaba goes all out. In fact, it has also announced that it will support this plan with an investment in 380 billion yuan (about $53 billion) over the next three years. Is the largest engagement of its kind in history of the company. Additionally, T-Head is planning its IPO to fund a more aggressive infrastructure investment program, which would put it in direct competition with Cambricon Technologies and Huawei’s Ascend line in the domestic AI chip market. Image | Alibaba More information | Alibaba | ChinaDaily In Xataka | Nvidia has to deal with the absolute distrust of several US legislators. Your plan in China is in danger In Xataka | The US wants to end Chinese AI chips sold abroad. And China knows how to defend itself