No launch event, no overwhelming publicity, on March 24th DeepSeek version V3-0324 went live quietly. https://huggingface.co/deepseek-ai/DeepSeek-V3-032...
Blackwell has not yet been delivered on a large scale, and NVIDIA has already laid out two generations of successors. On Tuesday, March 18 local time, NVIDIA CEO Jen-Hsun Huang delivered a keynote speech at the GTC25 conference, announcing the 2026-2027 data center GPU roadmap, Rubin and Ru...
March 16, Baidu officially released Wenxin Big Model 4.5 and Wenxin Big Model X1, which can be used free of charge on the official website of Wenxin Yiyin. According to reports, Wenxin Big Model 4.5 is Baidu's first native multimodal big model, and its multimodal comprehension, text and logical reasoning capabilities have been significantly improved in a number of measurement...
In recent times, the inference model DeepSeek-R1 has arguably been the number one topic in AI. Those who have used it know that the model outputs a piece of thought chain content before outputting the final answer. Doing so improves the accuracy of the final answer. Today's post will take you through the chain of thought...
The Gemma series of macromodels is a series of lightweight macromodels open-sourced by Google. Just a few moments ago (March 12, 2025), Google open-sourced the third generation of the Gemma series of macromodels, which contains a total of four different parameter scale versions.The third generation of the Gemma 3 series is multi...
At present, the application of big models in the field of government affairs has become an important hand of the government to improve the level of service, DeepSeek deployment and application in the field of government affairs around China is advancing at an unprecedented speed.DeepSeek series of models, by virtue of its advantages in terms of cost and performance, in the field of government services, public...
In the early morning of March 6, another sleepless night in the tech circle after DeepSeek, everyone was screened by a product called Manus.The AI circle was boiling, and the AI intelligent body plate soared.The release of Manus made Chinese AI technology shock the world once again. According to its team...
With the rapid development of artificial intelligence technology, large-scale language modeling (LLM) has become a key force driving this field forward. In order to better master and utilize LLM technology, it is particularly important to understand its core parameters. In this paper, we will take an in-depth look at three key parameters in large-scale language modeling: the Toke...
In today's era of rapid development of artificial intelligence, DeepSeek, as a leading AI model, has become the first choice of many enterprises due to its powerful functions and wide range of application areas. On the one hand, R1, V3 and other versions of the model, with the label of "performance comparable to GPT-4, cost only 10%", have pushed ...
"OpenAI is not Open, DeepSeek is Deep". This week, the "Open Source Week" activities in full swing, DeepSeek every day from time to time on the new "black technology", so that programmers around the world called out: this wave is simply in the atmosphere! From computing to communication to storage, De...
What is the hybrid Turbo S The hybrid Turbo S is a new generation of Tencent hybrid self-research fast thinking model, on February 27, 2025 officially released. The model is designed to solve the shortcomings of the slow thinking model in response speed, through technological innovation to realize the "second back" ability, doubling the speed of words....
On the evening of the 25th, Alibaba announced that it had fully open-sourced its video generation model, the Ten Thousand Phases 2.1 model, a move that triggered widespread concern among AI developers around the world. Tongyi Wanphase 2.1 model is based on the Apache 2.0 protocol, and opens up all the inference code and weights of the two parameter specifications 14B and 1.3B...
There are many options for individual developers or tasters who want to deploy DeepSeek locally, but when it comes to enterprise deployment, the steps will be much more cumbersome. If a simple fine-tuning of the model can fit our business needs, then using Ollama, LM S...
Microsoft Corp. recently officially unveiled Majorana 1, its first self-developed quantum computing chip, and Muse, a generative AI tool designed for video game scenario creation. Majorana 1: A Breakthrough in Quantum Computing Microsoft Corp. announced the launch of...
At noon on February 18, Musk's XAI held a Grok 3 launch event, which was watched by more than 1 million people online, and was praised by Musk as "the smartest AI on the planet". The launch demonstration showed that in mathematical reasoning, scientific logical reasoning and other aspects of performance performance, Grok3 and...
Article Summary - WeChat Accesses DeepSeek to Provide AI Search and Social Sharing Features. - WeChat users can use DeepSeek without downloading a new app. - WeChat and DeepSeek combine to deeply utilize WeChat ecological resources. - WeChat accesses DeepSe...
Core viewpoints: ・Algorithm efficiency improvement has not suppressed the demand for arithmetic power ・Artificial Intelligence applications to accelerate the landing ・Artificial Intelligence arithmetic development demand changes February 14 (Yan Yiyi) International Data Corporation (IDC) and Wave Information jointly released the "2025 China's Artificial Intelligence Computing Power Development Evaluation Report" ...
Introduction: ・Wenxin Big Model open source background: AI model development trend, technology accumulation and breakthroughs; ・Significance of open source and openness: promoting technological progress, lowering the threshold, and ecological construction; ・Contents of open source and openness: models and frameworks, open source protocols; ・Impacts of open source and openness: industry applications, international competitiveness...
Article Summary: DeepSeek Transforms AI Chain with Significant Low-Cost Advantages - 🚀 DeepSeek Launches Bigger Models with Better Performance at Less Than a Tenth of the Competitor's Costs - 🌐 Global tech giants are deploying the DeepSeek API, creating a wave of low-cost AI ...
On February 13, the AI industry ushered in a major turning point. Wenxin Yiyin and ChatGPT announced almost simultaneously that they would launch free services, a move that not only triggered widespread attention in the industry, but also marked a new stage in the AI market competition. Wenxin Yiyin: Completely free and in-depth search function...