Pangu LM

10mos agoupdate 377 0 0

Huawei has developed an industry-leading, ultra-large-scale pre-trained model with powerful natural language processing, visual processing, and multimodal capabilities that can be widely used in multiple industry scenarios.

Location:
China
Language:
zh
Collection time:
2024-06-01
盘古大模型Pangu LM

Pangu (creator of the universe in Chinese mythology)Large ModelIt is an ultra-large scale pre-training model developed by Huawei, which has demonstrated excellent performance and wide application potential in several fields. The following is a detailed description of the Pangu Large Model:

  1. model composition::
    • Pangu Big Model contains several sub-models such as NLP Big Model, CV Big Model, Scientific Computing Big Model and so on.
    • The NLP Big Model is the industry's first Chinese pre-training Big Model with over 100 billion parameters, and is considered to be the closest to human Chinese comprehension.AI macromodel.
  2. Technical characteristics::
    • high performance: The Pangu large model performs well in several NLP, CV and other tasks, with performance metrics better than industry SOTA models. The performance metrics outperform other models in 16 downstream tasks, including several leads in zero-sample, single-sample, and small-sample learning tasks.
    • computational fusion: The application of graph-computing fusion technique in the Pangu model reduces the overall training time by more than 201 TP4T, optimizing the training performance of the model.
    • Large-scale pre-training: The NLP Big Model has learned more than 40TB of industry text data and 4 million hours of industry speech data in the pre-training phase, and has a strong reserve of generalized Chinese knowledge.
  3. Industry Applications::
    • The Pangu Big Model can be applied to multiple industry scenarios, such as government, finance, manufacturing, medicine, mining, railroad, meteorology, and so on.
    • In the financial industry, intelligent customer service can be realized to answer the user's banking, insurance and other questions.
    • In the e-commerce industry, it can achieve product recommendation, intelligent customer service and other functions to provide a personalized shopping experience.
    • In education, it can be used in intelligent tutoring systems to answer students' questions and provide personalized learning guidance.
  4. Architecture and Hierarchy::
    • Pangu Big Model 3.0 adopts a three-layer architecture: L0 Basic Big Model, L1 Industry Big Model and L2 Scenario Model.
      • Layer L0: contains five basic macromodels, including natural language macromodels, vision macromodels, multimodal macromodels, prediction macromodels, and scientific computing macromodels.
      • Layer L1: is a large model for each industry, trained based on industry public data or customer-owned data.
      • Layer L2: Models for more refined scenarios, providing out-of-the-box modeling services.
  5. modeling capability::
    • The Pangu Big Model provides a rich set of capabilities, including knowledge quiz, copy generation, and code generation for the Natural Language Processing Big Model, and image generation and image understanding for the Multimodal Big Model.
  6. latest developments::
    • Huawei has released Pangu Grand Model 5.0 and debuted it alongside HarmonyOS NEXT Hongmeng Xinghe Edition at the Huawei Developer Conference.
    • In the field of mining, Huawei Pangu Big Model has realized the first commercialization, solving the problem of the difficulty of landing artificial intelligence in the field of mining.

As an important technological achievement of Huawei, Pangu Big Model is becoming an important force in promoting the development of the AI field with its excellent performance and wide application potential.

data statistics

Relevant Navigation

No comments

none
No comments...