R1-Omni Alibaba's open-source multimodal large language model uses RLVR technology to achieve emotion recognition and provide an interpretable reasoning process for multiple scenarios. 02,3560 Large ModelOpen Source Project# Multimodal# Emotion Recognition
Gemini 2.0 Flash Google introduced a new generation of AI models that support multimodal inputs and outputs and natively integrate intelligent tools to provide developers with powerful and flexible assistant functions. 05330 Large Model# Gemini 2.0 Flash# Google# Multimodal
Yan model Rockchip has developed the first non-Transformer architecture generalized natural language model with high performance, low cost, multimodal processing capability and private deployment security. 04540 Large Model# Multimodal
Congrong LM The multimodal large model independently developed by CloudScience has the ability of real-time learning, synchronous feedback, cross-modal interaction, etc. It is widely used in many industries such as finance, security, government affairs, etc., to promote the popularization and development of AI applications. 06560 Large Model# Multimodal
IFlytek Spark The large-scale language model with powerful semantic understanding and knowledge reasoning capabilities introduced by KU Xunfei is widely used in many fields such as enterprise services, intelligent hardware, and smart government. 02,4980 Large ModelHot Products# Code Capabilities# Multimodal# Math skills
Kling LM Racer's self-developed advanced video generation model supports the generation of high-quality videos based on text descriptions, helping users to efficiently create artistic video content. 07430 AI VideoDomestic Selection# Multimodal# Video Generation