
SenseNovaLarge ModelIt is a comprehensive big modeling system launched by ShangTech.
Background & Release:
- Day by day new SenseNova is a big modeling system announced by ShangTech in April 2023 by Chairman and CEO Xu Li.
- The system was approved to go live in August 2023, marking its official availability to the public.
- On May 29th, 2024, ShangTech announced that its "DayDayNew" Big Model will soon undergo a major upgrade, and officially released the DayDayNew Big Model 5.0 Cantonese version to the public.
Main Functions and Features::
- natural language processing (NLP): Automatically transforms data into meaningful analytics and visualization results through a combination of code generation and automated execution through capabilities such as natural language generation, intent recognition, logic understanding and code interpreters.
- Vincentian graphic ability (geology):: Includesdigital personThe video generation platform "SenseAvatar" (SenseAvatar) and other features can provide users with rich visual content generation services.
- Model Development Functions: Support users to develop models according to their needs and provide customized AI solutions.
Technical characteristics::
- Hybrid Expert Architecture (MoE): Nisshin SenseNova 5.0 utilizes the MOE Hybrid Expert Architecture, an architecture that allows the model to complete inference with a small number of parameters activated, improving the model's processing efficiency and responsiveness.
- Volume of training data: Based on more than 10TB tokens of training data, it ensures that the model has a robust knowledge base and a wide range of applications.
- inference context window: Reaching around 200K allows the model to handle longer text sequences and more complex contextual relationships.
Performance benchmarking::
Rizhixin SenseNova 5.0 fully benchmarks the GPT-4 Turbo in terms of comprehensive performance and meets or exceeds the GPT-4 Turbo in mainstream objective reviews, especially in terms of natural language capability, text-to-graph capability, multimodal and data analysis capability.
application scenario::
Risen SenseNova big models have been widely used in many fields such as finance, healthcare, education, etc., such as intelligent customer service, intelligent marketing, investment research analysis, research report writing, medical and healthcare language big models, etc., which provide powerful AI support for the industry.
language version::
In addition to the standard version, ShangTech has also released the Day Day New Big Model 5.0 Cantonese version, which further extends the model's language support capabilities.
Rizhixin SenseNova Big Model is a comprehensive big model system launched by ShangTech, with powerful natural language processing, text-born graph capabilities and model development functions. Through advanced hybrid expert architecture and a large amount of training data, it realizes performance comparable to GPT-4 Turbo, and is widely used in a variety of fields, providing users with efficient and flexible AI services.
data statistics
Relevant Navigation

Vivo's self-developed generalized big model matrix contains several self-developed big models covering core scenarios, providing intelligent assistance, dialog bots, and other functions with powerful language understanding and generation capabilities.

Yan model
Rockchip has developed the first non-Transformer architecture generalized natural language model with high performance, low cost, multimodal processing capability and private deployment security.

Kling LM
Racer's self-developed advanced video generation model supports the generation of high-quality videos based on text descriptions, helping users to efficiently create artistic video content.

EmaFusion
Ema introduces a hybrid expert modeling system that dynamically combines multiple models to accomplish enterprise-class AI tasks at low cost and high accuracy.

Pangu LM
Huawei has developed an industry-leading, ultra-large-scale pre-trained model with powerful natural language processing, visual processing, and multimodal capabilities that can be widely used in multiple industry scenarios.

DeepSeek-VL2
Developed by the DeepSeek team, it is an efficient visual language model based on a hybrid expert architecture with powerful multimodal understanding and processing capabilities.

Congrong LM
The multimodal large model independently developed by CloudScience has the ability of real-time learning, synchronous feedback, cross-modal interaction, etc. It is widely used in many industries such as finance, security, government affairs, etc., to promote the popularization and development of AI applications.

Ovis2
Alibaba's open source multimodal large language model with powerful visual understanding, OCR, video processing and reasoning capabilities, supporting multiple scale versions.
No comments...