
SenseNovaLarge ModelIt is a comprehensive big modeling system launched by ShangTech.
Background & Release:
- Day by day new SenseNova is a big modeling system announced by ShangTech in April 2023 by Chairman and CEO Xu Li.
- The system was approved to go live in August 2023, marking its official availability to the public.
- On May 29th, 2024, ShangTech announced that its "DayDayNew" Big Model will soon undergo a major upgrade, and officially released the DayDayNew Big Model 5.0 Cantonese version to the public.
Main Functions and Features::
- natural language processing (NLP): Automatically transforms data into meaningful analytics and visualization results through a combination of code generation and automated execution through capabilities such as natural language generation, intent recognition, logic understanding and code interpreters.
- Vincentian graphic ability (geology):: Includesdigital personThe video generation platform "SenseAvatar" (SenseAvatar) and other features can provide users with rich visual content generation services.
- Model Development Functions: Support users to develop models according to their needs and provide customized AI solutions.
Technical characteristics::
- Hybrid Expert Architecture (MoE): Nisshin SenseNova 5.0 utilizes the MOE Hybrid Expert Architecture, an architecture that allows the model to complete inference with a small number of parameters activated, improving the model's processing efficiency and responsiveness.
- Volume of training data: Based on more than 10TB tokens of training data, it ensures that the model has a robust knowledge base and a wide range of applications.
- inference context window: Reaching around 200K allows the model to handle longer text sequences and more complex contextual relationships.
Performance benchmarking::
Rizhixin SenseNova 5.0 fully benchmarks the GPT-4 Turbo in terms of comprehensive performance and meets or exceeds the GPT-4 Turbo in mainstream objective reviews, especially in terms of natural language capability, text-to-graph capability, multimodal and data analysis capability.
application scenario::
Risen SenseNova big models have been widely used in many fields such as finance, healthcare, education, etc., such as intelligent customer service, intelligent marketing, investment research analysis, research report writing, medical and healthcare language big models, etc., which provide powerful AI support for the industry.
language version::
In addition to the standard version, ShangTech has also released the Day Day New Big Model 5.0 Cantonese version, which further extends the model's language support capabilities.
Rizhixin SenseNova Big Model is a comprehensive big model system launched by ShangTech, with powerful natural language processing, text-born graph capabilities and model development functions. Through advanced hybrid expert architecture and a large amount of training data, it realizes performance comparable to GPT-4 Turbo, and is widely used in a variety of fields, providing users with efficient and flexible AI services.
data statistics
Relevant Navigation

Developed by the DeepSeek team, it is an efficient visual language model based on a hybrid expert architecture with powerful multimodal understanding and processing capabilities.

Guangyu LM
An innovative big model that combines big language and symbolic reasoning, designed to enhance the credibility and accuracy of applications in finance, healthcare, and other fields.

Xiaomi MiMo
Xiaomi's open-sourced 7 billion parameter inference macromodel, which outperforms models such as OpenAI o1-mini in mathematical reasoning and code competitions by a small margin.

Zidong Taichu
The cross-modal general artificial intelligence platform developed by the Institute of Automation of the Chinese Academy of Sciences has the world's first graphic, text and audio three-modal pre-training model with cross-modal comprehension and generation capabilities, supporting full-scene AI applications, which is a major breakthrough towards general artificial intelligence.

DeepSeek-V3
Hangzhou Depth Seeker has launched an efficient open source language model with 67.1 billion parameters, using a hybrid expert architecture that excels at handling math, coding and multilingual tasks.

360Brain
360 company independently developed a comprehensive large model, integrated with multimodal technology, with powerful generation creation, logical reasoning and other capabilities, to provide enterprises with a full range of AI services.

GraphRAG
Microsoft's open-source retrieval-enhanced generative model based on knowledge graph and graph machine learning techniques is designed to improve the understanding and reasoning of large language models when working with private data.

SKYMEDIA
Wanxing Technology has developed China's first audio and video multimedia creation pendant big model, which integrates video, audio, picture and language processing capabilities to provide powerful AI creation support for the digital creative field.
No comments...