
Zidong Taichu is a cross-modal general artificial intelligence platform developed by the Institute of Automation of the Chinese Academy of Sciences (IAAS), the core of which is the world's first graphic-text-audio (visual-text-speech) tri-modal pre-training model (OPT-Omni-Perception pre-Trainer).
Background and Significance::
- The development of Zidong Taichu marks a breakthrough in the field of artificial intelligence, especially in pre-trained models.
- The platform is based on multimodalLarge ModelAs the core, based on the full-stack localized basic software and hardware platform, it can support full-scene AI applications.
Core technology features::
- Cross-modal understanding and generative capabilities: Zidong Taichu is equipped with cross-modal understanding and cross-modal generation capabilities, and is able to perform multi-task joint learning without supervision and quickly migrate to data from different domains.
- Unified representation of the three modes: Through the introduction of speech modality, Zidong Taichu realizes the common graphic-text-phonetic-semantic spatial representation and utilization, and breaks through to directly realize the unified representation of the three modalities.
- Unique Application ScenariosIn particular, Zidong Taichu has made the "sound from picture" and "picture from sound" a reality for the first time, providing model-based support for more diversified scenarios, such as video dubbing, voice broadcasting, headline summarization, poster creation, and so on.
Milestones::
- On July 9, 2021, Zidon Taichu officially reported at the World Artificial Intelligence Conference (WAIC) 2021 Rise AI Summit.
- On June 16, 2023, the Institute of Automation of the Chinese Academy of Sciences (IAAS) released Zidong Taichu 2.0 in Shanghai, with significant improvements in decision-making and judgment capabilities compared to the first generation.
- March 5, 2024 - Wuhan Institute of Artificial Intelligence (WIAI) and Institute of Automation of Chinese Academy of Sciences (IACS) independently developed "Zidong Taichu" large model has been iterated to version 2.0, and it is expected that "Zidong Taichu 3.0" will be released in the first half of 2024, which is a new version of "Zidong Taichu". It is expected that "Zidong Taichu 3.0" will be released in the first half of 2024.
Markets & Applications::
- Zidong Taichu Big Model has passed the Interim Measures for the Administration of Generative Artificial Intelligence Services for the record, and can be officially online to provide services to the public.
- The platform has a wide range of application prospects in the fields of medical care, transportation and industrial production, and will play a greater role in these fields in the future.
Partners and impact::
- Newland, one of the founding partners of Zidon Taichu, is ranked number one in terms of algorithm quality in the relevant field.
- ZiDong Taichu 2.0 Omnimodal Large Model won the "Excellence in Artificial Intelligence Leadership Award", the highest award at the World Conference on Artificial Intelligence (WCAI) 2022, proving its leadership and influence in the field of Artificial Intelligence (AI).
To summarize, Zidong Taichu, as a masterpiece of Institute of Automation, Chinese Academy of Sciences, has not only made remarkable breakthroughs in technology, but also demonstrated a wide range of application prospects and great potential in the market.
data statistics
Relevant Navigation

DeepMind's advanced world model generates interactive, physically logical 3D virtual environments in real time from textual cues, and is widely used in gaming, education, and AGI research.

HunyuanImage2.1
Tencent launched the open source raw image model, which natively supports 2K HD raw images, accurately parses complex semantics, and can efficiently generate high-quality images with Chinese and English fusion.

Kimi
A powerful and easy-to-use AI assistant product that meets the needs of users in a variety of scenarios such as learning, working, creating and daily life.

Secret Tower AI Search
An ad-free, big-model technology-based search engine that provides high-quality, structured search results and supports multi-language search and multiple search modes.

Gemini Robotics-ER 1.6
Google DeepMind has introduced an autonomous robot AI model with powerful embodied reasoning capabilities that can efficiently accomplish tasks such as industrial instrumentation reading, complex task planning, and security risk prevention and control.

Nemotron 3
NVIDIA's open-source AI model series, featuring Nano, Super, and Ultra variants, is specifically designed for intelligent agent applications, delivering high efficiency and precision.

Tencent Genius
The AI intelligent body creation and distribution platform based on Tencent's hybrid big model supports low-threshold and rapid creation of intelligent bodies, provides plug-ins, knowledge base, workflow and other functions, and can be released to QQ, WeChat and other platforms, helping the deep integration of creativity and intelligence.

AI Talking Duck
An intelligent dialogue platform that integrates multiple domestic AI big models, providing a convenient experience of personalized, multi-model simultaneous dialogue, covering a variety of life and work scenarios.
No comments...
