New MoE architecture! Ali open source Qwen3-Next, training costs straight down 90%!
The Large Language Model (LLM), is entering the Next Level. In the early morning of Friday, Ali Tongyi team officially released and open-sourced the next-generation base model architecture Qwen3-Next. 80B of total parameters of the model is only activated 3B, the performance can be comparable to the flagship version of the Qianqian 3 235B model...