🎤 Cheer for Your Idol · Gate Takes You Straight to Token of Love! 🎶
Fam, head to Gate Square now and cheer for #TokenOfLove# — 20 music festival tickets are waiting for you! 🔥
HyunA / SUECO / DJ KAKA / CLICK#15 — Who are you most excited to see? Let’s cheer together!
📌 How to Join (the more ways you join, the higher your chance of winning!)
1️⃣ Interact with This Post
Like & Retweet + vote for your favorite artist
Comment: “I’m cheering for Token of Love on Gate Square!”
2️⃣ Post on Gate Square
Use hashtags: #ArtistName# + #TokenOfLove#
Post any content you like:
🎵 The song you want to he
6000-word detailed explanation of the Pangu model: Can it support the other pole of the world's AI?
Huawei has shown off its "muscles" in the field of large models.
On July 7, the 2023 Huawei Developer Conference (HDC 2023) opened. In the keynote speech of more than two hours in the afternoon, HUAWEI CLOUD disclosed the progress of the Pangu large model in detail for the first time. It not only released the industry-oriented Pangu large model 3.0, but also introduced in detail the basic technical capabilities of Huawei to develop large models.
The Pangu Large Model 3.0 includes a "5+N+X" three-tier structure. The three layers refer to the five basic large models of the L0 layer, the N industry-wide large models of the L1 layer, and the L2 layer that allows users to independently train more Refine the scene model. It adopts a complete layered decoupling design, and enterprise users can choose a suitable large-scale model development, upgrade or fine-tuning based on their own business needs, so as to adapt to the changing needs of thousands of industries.
Huawei is one of the earliest cloud service providers in China to deploy large-scale models, and has released the Pangu large-scale model as early as 2021. On the road to developing large-scale models, Huawei has built an AI computing power cloud platform based on Kunpeng and Ascend from the bottom layer, as well as technical capabilities such as the heterogeneous computing architecture CANN, the full-scenario AI framework MindSpore, and the AI development production line ModelArts. .
In addition to the large model and computing power base, at the meeting, HUAWEI CLOUD also highlighted typical cases of the combination of the Pangu large model and specific industries. The industries involved include government affairs, meteorology, railways, manufacturing, and finance, as well as multiple upgrades and reshaping of Huawei. Application cases of cloud software products and services.
Whether it is basic technical capabilities, AI**+ cloud product service system, or application cases in specific industries, HUAWEI CLOUD has demonstrated highly mature and systematic business capabilities, which really impresses the industry. Bring surprises. **While everyone is still arguing about who is China's OpenAI, HUAWEI CLOUD has opened up a fairly mature development path for large-scale models.
Huawei is using its own practice to prove that large-scale models are important, but more importantly, it is to use large-scale models to solve the pain points of industries and products, to make products and services that can make enterprises and users pay, and to truly create value for thousands of industries.
01 Pangu Large Model 3.0: Layered Decoupling Architecture
Decoupling is the keyword of the Pangu Model 3.0 released today. This is also a common appeal of industry customers who have actually invoked large models in the past few months.
A leading SaaS vendor said when releasing its own large-scale model upgrade application, "We do not develop large-scale models by ourselves, but in different business scenarios, which large-scale model is good at what, we take that model." In order to be able to Switching between different large models, "Our own product architecture must be independent of the underlying large model, or loosely coupled."
"The decoupling design of Pangu's large model is for the sake of the industry." At the Huawei Developer Conference, Zhang Pingan, Huawei's executive director and CEO of Huawei Cloud, gave the differentiated route of Pangu's large model. Its core is to decouple the various layers and capabilities of the Pangu model, allowing industry users to develop according to their own needs.
“5” represents the five basic large models of the L0 layer: including natural language, vision, multimodal, forecasting, and scientific computing large models, which provide and meet the needs of various skills in industry scenarios.
Pangu 3.0 provides customers with serialized basic large models with 10 billion parameters, 38 billion parameters, 71 billion parameters and 100 billion parameters, matching the diversified needs of customers in different scenarios, different delays, and different response speeds. At the same time, it provides a new set of capabilities, including knowledge question answering, copy generation, and code generation for NLP large models, as well as image generation and image understanding for multi-modal large models. These skills can be directly used by customers and partner companies. Regardless of the size of the large model, Pangu provides a consistent set of capabilities.
The "N" in the "5+N+X" three-tier structure represents N large industry models at the L1 level. There are two ways to provide industry large models: on the one hand, HUAWEI CLOUD can provide general industry large models trained using industry public data, including government affairs, finance, manufacturing, mining, weather and other large models; on the other hand, it can be based on industry customers With its own data, on the L0 and L1 layers of the Pangea large model, it trains its own proprietary large model for customers.
Zhang Pingan said: "Pangu was born to serve the industry, providing a variety of large-scale model deployment, development, and reasoning forms. It can generate its own large-scale industry model just like Huawei's large-scale model of Pangu, and only needs to input its own private data. .” Moreover, the training data is also decoupled from the large model.
The X in "5+N+X" means that the L2 layer provides customers with more detailed scene models, focusing more on government affairs hotlines, network assistants, leading drug screening, foreign object detection on conveyor belts, and typhoon paths Provide customers with "out-of-the-box" model services for specific industry applications or specific business scenarios such as forecasting.
Through the three-layer large model of "**5+N+X", HUAWEI CLOUD built its own large model base.
At yesterday's World Artificial Intelligence Conference, Hu Houkun, Huawei's rotating chairman, explained vividly: "The most basic level of benchmarking is the general large-scale model, which we call the basic large-scale model. Our image at this level is called reading thousands of books, which is to do well. A large amount of basic knowledge is learned. On this layer, industry models and scene models are also created, called traveling thousands of miles. There are still many challenges to overcome from reading thousands of books to traveling thousands of miles. The key point is to Huawei is working with partners from various industries to fully match and integrate knowledge from various industries with large models.”
**In addition, the innovation of the large model is not only the innovation of the model itself, but also depends on the innovation of various root technologies of AI. At the meeting, Yao Jun, director of Huawei's Noah's Ark Laboratory, introduced the technical base of the Pangu model.
Huawei has built an AI computing power cloud platform based on Kunpeng and Ascend at the bottom layer, as well as the heterogeneous computing architecture CANN, the full-scenario AI framework MindSpore, and the AI development production line ModelArts, etc., to provide distributed solutions for the development and operation of large models. Key capabilities such as parallel acceleration, operator and compilation optimization, and cluster-level communication optimization. Based on Huawei's AI root technology, the performance of large model training can be adjusted to 1.1 times that of mainstream GPUs in the industry.
At the same time, 90% of operators in these frameworks can be smoothly migrated to the Ascend platform through Huawei's end-to-end migration tool. For example, Meitu migrated 70 models to Ascend in just 30 days. At the same time, HUAWEI CLOUD and the Meitu team jointly optimized more than 30 operators and accelerated the process in parallel. Compared with the original solution, the AI performance improved by 30% .
In addition, GPU failures are often encountered during large model training, and developers have to restart training frequently, which takes a long time and costs a lot. Ascend AI cloud service can provide more stable AI computing service. The long-term stability rate of 30-day kilocalorie training reaches 90%, and the breakpoint recovery time does not exceed 10 minutes.
02 Empower thousands of industries
Ren Zhengfei previously said, "The direct contribution of artificial intelligence software platform companies to human society may be less than 2%, and 98% is the promotion of industrial society and agricultural society. But the application platform is not our option, we will be the bottom layer of AI Computing power platform."
Letting large models into thousands of industries has become the focus of Huawei's development of large models. At the meeting, HUAWEI CLOUD introduced the application cases of the Pangu large model in seven fields including government affairs, railways, meteorology, and finance.
Government Affairs
According to Huawei Cloud, the core of Pangu's large model of government affairs is cognitive ability. Let the urban public system be seen and understood, and complete the closed loop from perception to cognition and disposal. And according to different scenarios, it provides different capabilities such as question answering, copy generation, video perception, and multimodal understanding.
railway
Traditional train inspectors have to inspect millions of train pictures every day to detect whether there are faults in the freight cars running on the railway network. After the introduction of the Pangu large model, it can accurately identify 67 kinds of trucks running on the live network and more than 430 kinds of faults, and the screening rate of non-faulty pictures is as high as 95%. In other words, train inspectors only need to detect 1/20 of the train pictures in the past, which is equivalent to a 20-fold increase in work efficiency.
coal mine
In the field of coal mines, the large-scale model of Pangu Mine has been used in 8 mines across the country. A large model can cover more than 1,000 subdivided scenarios under business processes such as mining, excavation, machinery, transportation, transportation, and washing of coal mines, allowing more More coal miners can work on the ground, which not only makes the working environment of coal miners more comfortable, but also greatly reduces safety accidents.
meteorological
finance
In the field of finance, Pangu Large Model cooperated with ICBC to create a series of exploratory applications.
One of the typical scenarios is to improve the work efficiency of bank tellers. ICBC has tens of thousands of outlets across the country and 200,000 outlet tellers. They need to switch between various services, which will waste a lot of time.
And this is only the most basic application. Huawei is exploring with the financial industry to apply the large model to more financial scenarios such as credit analysis in the future.
manufacturing
Huawei itself is also a manufacturing company. The hardware products it manufactures involve communication base stations, mobile phones, automobiles, chips and other fields. Based on the experience accumulated in the past, Huawei introduced the Pangu large model into the field of production and manufacturing.
Drug Discovery
In the field of drug research and development, the original research and development of a new drug takes an average of 10 years and costs 1 billion US dollars. The large molecular model of Pangu drugs helped the team of Professor Liu Bing of the First Affiliated Hospital of Xi'an Jiaotong University discover the world's first new target and new class of antibiotics in 40 years, and shorten the lead drug development cycle to one month and reduce the development cost by 70%.
03 Large model integrated into Huawei Cloud product system
In addition to the practice in thousands of industries, the HUAWEI CLOUD Pangu model has also been deeply integrated into HUAWEI CLOUD's product services to restructure product innovation.
Pangu Large Model + Huawei Cloud Service
With the blessing of the Pangu model, a series of B-end products and services of Huawei Cloud have been upgraded and reconstructed. At the meeting, HUAWEI CLOUD introduced the details of four service upgrades: data service, cloud customer service, BI, and cloud search.
Pangu large model + CodeArts code tool
The tool has trained 76 billion lines of selected codes and 13 million technical documents. It has three core functions of intelligent generation, intelligent question and answer, and intelligent collaboration. It can realize code generation in one sentence of dialogue, automatic annotation and generation of test cases in one click. One command can be deployed intelligently, so that every software developer has his own programming assistant.
Pangu Large Model + Digital Man
Based on these two major services, developers can quickly generate and drive digital human models, empowering online education, entertainment live broadcast, corporate conferences and other industry applications, so that every enterprise employee can realize "digital human freedom". For example, users only need to upload a 20-second personal video on the service page of HUAWEI CLOUD MetaStudio to quickly generate a personalized digital human explanation video. The work completed by three R&D personnel in three days in the past can now be completed in only three minutes .
Pangu Large Model + Embodied Intelligence
At the meeting, Huawei Cloud also mentioned the application of the Pangu model in the field of robotics and demonstrated a video.
According to Huawei, the above demonstration is not a concept video, but a real product, which was exhibited at the venue during the HDC conference.
**04 Summary and thinking: Can Huawei become the other pole of AI? **
Zhang Pingan said, “In order to help global customers, partners, and developers train and use large models, we are committed to creating a world for global customers AI **Another pole, providing new AI developers s Choice". **
Even earlier, as early as March this year, Ren Zhengfei had expressed a similar meaning within the company. He said that there will be a surge in AI models, not just Microsoft. Ren Zhengfei's reason is actually the direction of Huawei Cloud's efforts today, that is, the direct contribution of artificial intelligence software platform companies to human society may be less than 2%, and 98% is the promotion of industrial society and agricultural society.
For example, factories in China and Germany are promoting the promotion of artificial intelligence to the industry, so as to realize unmanned production; for example, the wharf in Tianjin Port has also tried unmanned cargo loading and unloading. Once the code is entered, the container will be automatically removed from the ship. Carry it over and then transport it away by car; for example, in the coal mine in Shanxi, after adopting 5G+ artificial intelligence underground, the number of personnel has been reduced by 60-70%, and most people work in suits in the control room on the ground.
These are examples where AI has been applied to the industrial side on a large scale in the past few years. What these industries have in common is that they have huge scale and output value, and a little improvement in efficiency can bring huge benefits.
**The emergence of large models essentially provides more efficient productivity tools. **On the one hand, for these industries that are already embracing AI, it means higher efficiency and faster transformation process; and higher efficiency also means that it is easier for more industries to calculate the "economic account" ", AI has the potential to transform from a few so-called major industries to transforming thousands of industries.
This is the reason why Huawei resolutely enters the industry. In fact, major domestic cloud service companies such as Alibaba Cloud, Tencent Cloud, Volcano Cloud, and Baidu Cloud have similar ideas. In the case of the same direction and close starting point, who can run the fastest in this competition is the whole chain capability from computing power, large model base, platform, products to specific solutions.
Due to well-known reasons, Huawei cannot obtain the world's most advanced computing chip, which is currently recognized, and it seems that it is inherently insufficient in this competition. But judging from today's press conference, Huawei can't see that it is lagging behind due to the constraints of the upper reaches. In the key chain of the large model, it has come up with mature products and cases, and the decoupled Pangu large model architecture is even more It is eye-catching. **In fact, considering the needs of localization today, Huawei, which does not lag behind in terms of computing power, is likely to become an independent and controllable advantage. **
Large models have become a new opportunity for Huawei, and it looks like it is becoming a reality.