R1-Omni Alibaba's open-source multimodal large language model uses RLVR technology to achieve emotion recognition and provide an interpretable reasoning process for multiple scenarios. 08170 Large ModelOpen Source Project# Multimodal# Emotion Recognition
Gemini 2.0 Flash Google introduced a new generation of AI models that support multimodal inputs and outputs and natively integrate intelligent tools to provide developers with powerful and flexible assistant functions. 03090 Large Model# Gemini 2.0 Flash# Google# Multimodal
Yan model Rockchip has developed the first non-Transformer architecture generalized natural language model with high performance, low cost, multimodal processing capability and private deployment security. 03020 Large Model# Multimodal
Congrong LM The multimodal large model independently developed by CloudScience has the ability of real-time learning, synchronous feedback, cross-modal interaction, etc. It is widely used in many industries such as finance, security, government affairs, etc., to promote the popularization and development of AI applications. 04270 Large Model# Multimodal
IFlytek Spark The large-scale language model with powerful semantic understanding and knowledge reasoning capabilities introduced by KU Xunfei is widely used in many fields such as enterprise services, intelligent hardware, and smart government. 01,0580 Large ModelHot Products# Code Capabilities# Multimodal# Math skills
Kling LM Racer's self-developed advanced video generation model supports the generation of high-quality videos based on text descriptions, helping users to efficiently create artistic video content. 05480 AI VideoDomestic Selection# Multimodal# Video Generation