Shenzhen Team Completes 1.6 Trillion-Parameter DeepSeek-V4-Pro Model Training on Homegrown Ascend 910C Chip

According to Shenzhen Release, on June 5, a project team from Shenzhen Hezhou Academy, in collaboration with Harbin Institute of Technology (Shenzhen), Shenzhen Big Data Institute, Huawei, and Deep Intelligence City's AI computing platform, completed full-parameter post-training of the 1.6 trillion-parameter DeepSeek-V4-Pro model using the Ascend 910C domestic AI computing cluster. This marks one of the first instances of a third-party organization completing model training at this scale on a Chinese domestic computing platform, demonstrating that domestic AI chips can support world-class large-parameter model training.
Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments