GPT-4o OpenAI introduces a multimodal, all-inclusive AI model that supports text, audio and image input and output with fast response and advanced features, and is free and open to the public to provide a natural and smooth interactive experience. 01,2240 Large Model# GPT-4o# OpenAI
Congrong LM The multimodal large model independently developed by CloudScience has the ability of real-time learning, synchronous feedback, cross-modal interaction, etc. It is widely used in many industries such as finance, security, government affairs, etc., to promote the popularization and development of AI applications. 01,3460 Large Model# Multimodal
Guangyu LM An innovative big model that combines big language and symbolic reasoning, designed to enhance the credibility and accuracy of applications in finance, healthcare, and other fields. 07360 Large Model
XiHu LM Westlake HeartStar's self-developed universal big model, which integrates multimodal capabilities and possesses high IQ and EQ, has been widely used in many fields. 01,0140 Large Model# Universal Large Model
Yan model Rockchip has developed the first non-Transformer architecture generalized natural language model with high performance, low cost, multimodal processing capability and private deployment security. 09000 Large Model# Multimodal
Blue Heart Large Model Vivo's self-developed generalized big model matrix contains several self-developed big models covering core scenarios, providing intelligent assistance, dialog bots, and other functions with powerful language understanding and generation capabilities. 01,1070 Large Model
Qwen2.5-Max The mega-scale Mixture of Experts model introduced by AliCloud's Tongyi Thousand Questions team stands out in the AI field for its excellent performance and wide range of application scenarios. 02,5750 Large Model# Large model# A Thousand Questions on Tongyi
OpenAI o3-mini OpenAI introduces small AI models with inference capabilities and cost-effective pricing, designed for developers and users to optimize application performance and efficiency. 01,2800 Large Model# OpenAI# Large model# Reasoning Model
Gemini 2.0 Flash Google introduced a new generation of AI models that support multimodal inputs and outputs and natively integrate intelligent tools to provide developers with powerful and flexible assistant functions. 01,1970 Large Model# Gemini 2.0 Flash# Google# Multimodal
Gemini 2.0 Pro Google released a high-performance AI model with strong coding performance and the ability to handle complex cues with a contextual window of 2 million tokens. 06,5690 Large Model# 2.0# Gemini# Google
s1 An AI model developed by Fei-Fei Li's team that achieves superior inference performance at a very low training cost. 06,6490 Large ModelOpen Source Project# Reasoning Model# Model Distillation
DeepSeek-V3 Hangzhou Depth Seeker has launched an efficient open source language model with 67.1 billion parameters, using a hybrid expert architecture that excels at handling math, coding and multilingual tasks. 06,4980 Large ModelOpen Source Project# DeepSeek# Open Source Large Model
DeepSeek-VL2 Developed by the DeepSeek team, it is an efficient visual language model based on a hybrid expert architecture with powerful multimodal understanding and processing capabilities. 06,8450 Large ModelOpen Source Project# Visual Language Model
WebLI-100B Google DeepMind launches a 100 billion visual language dataset designed to enhance the cultural diversity and multilingualism of AI models. 06,3580 Large Model# Visual Language Model
Confucius-o1 NetEaseYouDao launched the first 14B lightweight model in China that supports step-by-step reasoning and explanation, designed for educational scenarios, which can help students efficiently understand complex math problems. 06,2080 Large ModelOpen Source Project# Reasoning Model# Netease Youtube
Grok 3 The third generation of artificial intelligence models developed by Musk's xAI company, with superior computational and reasoning capabilities, can be applied to a variety of fields such as 3D model generation and game production, which is an important innovation in the field of AI. 06,4140 Large Model# xAI
Ovis2 Alibaba's open source multimodal large language model with powerful visual understanding, OCR, video processing and reasoning capabilities, supporting multiple scale versions. 06,8390 Large ModelOpen Source Project# Multimodal Large Model
Claude 3.7 Sonnet Anthropic has released the world's first hybrid reasoning model that demonstrates superior performance and flexibility by being able to flexibly switch between rapid response and deeper reflection based on different needs. 06,4600 Large Model# Hybrid Reasoning Model
GPT-4.5 OpenAI's large-scale language model, officially launched on February 28, 2025, is an upgraded version of GPT-4. 06,2580 Large Model# OpenAI
QwQ-32B Alibaba released a high-performance inference model with 32 billion parameters that excels in mathematics and programming for a wide range of application scenarios. 06,3910 Large ModelOpen Source Project# Reasoning Model# A Thousand Questions on Tongyi
R1-Omni Alibaba's open-source multimodal large language model uses RLVR technology to achieve emotion recognition and provide an interpretable reasoning process for multiple scenarios. 06,6240 Large ModelOpen Source Project# Multimodal# Emotion Recognition
Gemma 3 Google launched a new generation of open source AI models with multi-modal, multi-language support and high efficiency and portability, capable of running on a single GPU/TPU for a wide range of application scenarios. 06,6430 Large Model
Seedream 2.0 Byte Jump launched a native bilingual image generation model with excellent comprehension and rendering capabilities for a wide range of creative design scenarios. 07,2340 AI image generationLarge Model# Image Generation
Chitu The Tsinghua University team and Qingcheng Jizhi jointly launched an open source large model inference engine, aiming to realize efficient model inference across chip architectures through underlying technological innovations and promote the widespread application of AI technology. 07,3920 Large ModelOpen Source Project# Large model
Bunshin Big Model 4.5 Baidu's self-developed native multimodal basic big model, with excellent multimodal understanding, text generation and logical reasoning capabilities, using a number of advanced technologies, the cost is only 1% of GPT4.5, and plans to be fully open source. 06,1840 Large Model# Multimodal Large Model
Bunshin Big Model X1 Baidu launched an advanced large language model with deep thinking, multi-modal support and multi-tool invocation capabilities to meet the needs of multiple domains with excellent performance, affordable price and rich functionality. 06,3250 Large Model# Deep Thinking Model
Claude 3.7 Max Anthropic's top-of-the-line AI models for hardcore developers tackle ultra-complex tasks with powerful code processing and a 200k context window. 06,5440 AI programmingLarge Model
o1-pro High-performance inference models from OpenAI with enhanced multimodal inference capabilities, structured outputs, and function call support, designed to handle complex professional problems with high pricing but high performance. 06,3080 Large Model# OpenAI
Command A Cohere released a lightweight AI model with powerful features such as efficient processing, long context support, multi-language and enterprise-grade security, designed for small and medium-sized businesses to achieve superior performance with low-cost hardware. 06,4750 Large Model
Hunyuan T1 Tencent's self-developed deep thinking models with fast response, ultra-long text processing and strong reasoning capabilities have been widely used in intelligent Q&A, document processing and other fields. 06,4480 Large ModelOpen Source Project# Deeper Thinking