Huawei reshaping industries with AI cloud services and upgraded Pangu models

Huawei unveiled its next-generation Huawei Cloud AI Service as well as its latest Pangu 5.5 models at its Developer Conference 2025 in Dongguan, China.

At Huawei’s Developer Conference 2025 in Dongguan, China, Huawei unveiled its next-generation Huawei Cloud AI Service. The new service is built on CloudMatrix 384 supernodes, offering robust compute for advanced AI model applications. Huawei also unveiled its latest Pangu Models 5.5 with significant upgrades across five capabilities that include natural language processing (NLP), computer vision (CV), multi-modal, prediction, and scientific computing.

According to Zhang Ping'an, Executive Director of Huawei, CEO of Huawei Cloud, the unprecedented advancement of AI technologies is causing explosive growth in compute requirements for foundation model training and inference, leaving traditional computing architectures struggling to keep pace.

Taking a deeper look at Huawei Cloud's new-generation AI Cloud Service, the CloudMatrix 384 supernodes that it is based on is the industry's first to implement peer-to-peer interconnection of 384 proprietary NPUs and 192 Kunpeng CPUs through a high-speed MatrixLink network to form a super AI server, which increases the inference throughput of a single card to 2,300 tokens/s, a near 4-fold improvement over that of non-supernodes.

With its capability to better support the inference of mixture of experts (MoE) models, the supernode also allocates resources flexibly, improves parallel task processing, reduces waiting time, and improves model FLOPS utilization (MFU) by more than 50%. Additionally, the supernodes support integrated deployment of compute for training and inference, performing tasks such as inference by day and training by night. The compute resources for training and inference can be flexibly allocated to help customers optimize resource usage.

According to Huawei, the AI Cloud Service has emerged as the preferred choice for AI infrastructure supporting more than 1,300 customers, such as Sina, SiliconFlow, ModelBest, Chinese Academy of Sciences (CAS), and 360, accelerating intelligent upgrade across industries.

Pangu Models 5.5 reshaping industries

Apart from the new 718B deep thinking model unveiled, which is a MoE model consisting of 256 experts, the upgraded Pangu Models 5.5 include capabilities to improve user experience in long-sequence processing, low hallucinations, integrated fast and slow thinking, and agents.

Pangu Models aim to help industry customers build their own models without "reinventing the wheel". Huawei Cloud provides enterprises with six core capabilities: Pangu foundation and industry-specific models, pre-training and post-training corpus, data engineering tool sets, model training tool sets, industry-specific judge models, and industry-specific evaluation platforms.

Huawei Cloud also released a new model – the Pangu World Model based on the Pangu Multimodal Model. The Pangu World Model generates digital physical spaces for training intelligent driving and embodied AI robots, which also supports continuous optimization and iteration.

Another release is the CloudRobo Embodied AI Platform based on the multimodal and thinking capabilities of Pangu Models. The platform integrates end-to-end capabilities, such as data synthesis, data labeling, model development, simulation verification, cloud-edge synergetic deployment, and sensing and security monitoring.

Meanwhile, the Pangu Prediction Model uses the industry's first triplet transformer unified pre-training architecture, which realizes unified triplet encoding of data from different industries, including table data from manufacturing-process parameters, time series data from device-running logs, and image data from product inspections. The model efficiently processes and pre-trains this data within the same framework, greatly improving the accuracy of prediction and providing better generalization capabilities for predictions across different industries and scenarios.

Other models released include the Pangu Scientific Computing Model as well as the Pangu CV Model, a 30B-parameter CV model based on the new MoE architecture, which is also the largest CV model in the industry and supports multi-dimensional, pan-vision perception, analysis, and decision-making, with pan-vision meaning that it supports identification of images, infrared, lidar-generated point clouds, light spectrum, and radar.

Cloud services with AI

While some businesses are now looking to move back on-premises as they continue to develop and deploy more AI workloads, Huawei Cloud CTO Zhang Yuxin unveiled a few new AI enhancements to its cloud services as well.

This includes ModelArts Versatile, the optimal AI agent platform for enterprises that offers experience templates designed for diverse service needs, empowering businesses and developers to create professional, productive, and proactive enterprise-level AI agents. The platform also revolutionizes AI agent generation with its intelligent toolchain, transforming what once took days into mere minutes. By streamlining the process, it slashes both the complexity and expertise traditionally required for agent development.

Therea are also enhancements to Pangu Doer, Huawei Cloud’s intelligent assistant. The tool is now leverages cutting-edge AI compute, advanced Pangu models, and robust agents for unparalleled performance. The Pangu Deep Reasoning Model elevates Pangu Doer's intelligence by enhancing its intent understanding, task planning, and execution precision. With tailored professional domain models that leverage specialized knowledge to boost expertise, customer’s agentic workflows can now be streamlined to address critical pain points.

Meanwhile, CodeArts Doer, the intelligent assistant is Huawei Cloud's software development pipeline, is equipped with six specialized agents, which streamlines every stage of the R&D lifecycle—project management, product management, build, testing, and deployment—boosting efficiency by over 40%. There is also GaussDB Doer which empowers enterprises to develop their own Database Administrator (DBA) capabilities with comprehensive upgrades across three key areas.

Other enhancements are the MetaStudio, which revolutionizes virtual human creation with next-level ease and precision. Its advanced TTS voice synthesis delivers unmatched realism in tone and articulation, while more accurate lip sync and more dynamic gestures bring characters to life. Huawei Cloud also announced Model Application Firewall (MAF), designed to fortify model inference security by preventing prompt injections and detecting non-compliant content in real time. This system effectively counters common injections like jailbreaking, role playing, and malicious instructions. With a preconfigured library housing millions of prompt rules, it detects over 95% of prompt injections, boosting the overall model security score by more than 20%.