HunyuanVideo-Avatar Tencent hybrid open source voice digital human model, upload pictures and audio that generate multi-style, highly dynamic personalized dynamic video. 02,0711 AI Digital PersonOpen Source Project# Digital People
DeepSeek-Math-V2 The world's first large model of mathematical reasoning in open source form to reach the gold medal level of the International Mathematical Olympiad (IMO), realizing the rigor of reasoning and the ability to solve difficult mathematical problems through a self-verification framework. 02150 Large ModelOpen Source Project# Mathematical Reasoning
SeekDB OceanBase is the world's first open source AI-native database, which focuses on multimodal hybrid search, minimal development and extreme security, redefining the way data and AI converge and helping developers build high-performance intelligent applications with a single click. 03260 AI data processingOpen Source Project# Database
SAM 3D Meta open source revolutionary single-image 3D generation model, support one-click from 2D photos to generate high-fidelity, interactive 3D models, covering the object/human body scene, empowering e-commerce, AR/VR, film and television, and other multi-industry cost reduction and efficiency. 04890 Open Source Project# 3D model# 3D generation
SmartResume Ali open source SmartResume is a high-precision resume parsing system based on OCR and lightweight large models, which can convert 12 formats of resumes such as PDF/pictures into structured data in seconds, with an accuracy rate of 93.1%. 02530 Open Source Project# Resume Analysis
PaddleOCR-VL Baidu's lightweight multimodal document parsing model, with 0.9B parameters, achieves accurate recognition and structured output of complex documents in 109 languages, with world-leading performance. 06950 AI document assistantOpen Source Project# Document Analysis
SongBloom Tencent AI Lab and other joint research and development of open source song generation model, 10 seconds of audio + lyrics into 2 minutes 30 seconds of high-quality music, comparable to commercial standards. 05980 AI music compositionOpen Source Project# Song Generation
PromptEnhancer Tencent's open source Chinese text-to-image prompt word enhancement framework that optimizes user-input prompts and improves the image quality and semantic accuracy of the generated model. 08890 AI assistantOpen Source Project# Cue word enhancement
HunyuanImage2.1 Tencent launched the open source raw image model, which natively supports 2K HD raw images, accurately parses complex semantics, and can efficiently generate high-quality images with Chinese and English fusion. 05680 AI Video CreationLarge Model# graphical model
HunyuanWorld-Voyager Tencent introduced the industry's first open source world model that supports native 3D reconstruction and ultra-long roaming, allowing for rapid generation of interactive and immersive 3D scenes based on a single image or text. 01,0080 Open Source Project# Virtual Worlds
Waver 1.0 Waver 1.0 is an open source full-featured video generation model that makes it easy to create text/images to HD video with efficiency, convenience and outstanding quality. 01,4330 AI Video CreationOpen Source Project# Video Generation
Seed-OSS ByteDance's open-source 36 billion parameter-long contextual big language model supports 512K tokens, a controlled mind budget, excels in inference, code and agent tasks, and is freely commercially available under the Apache-2.0 license. 06710 Large ModelOpen Source Project# Large model
KittenTTS An open source lightweight text-to-speech model that is less than 25 MB and can run in real time on ordinary CPUs, supports a variety of natural tones and can be used offline. 01,2880 AI speech generationOpen Source Project# TTS# Video Generation
Qwen-Image Ali Tongyi Thousand Questions open source 20 billion parameter image generation model , specializing in Chinese and English high fidelity text rendering and complex scene detail processing , support for multi-style image generation . 09920 AI image generationOpen Source Project# Image Generation
Qwen3-Coder Ali open source code big model, support full-flow programming and complex task planning, performance over GPT-4.1, lower cost. 01,7150 AI programmingOpen Source Project# AI Programming
FLUX.1-Kontext A multimodal model that supports text generation and image editing with powerful contextual understanding and authoring capabilities. 02,0180 AI image processingAI image generation# Image Generation# Image Editor
Gemma 3n Google introduced a lightweight open source large language model , both high performance and easy to deploy , suitable for local development and multi-scenario applications . 07690 Large ModelOpen Source Project# Large Language Model
Xiaomi MiMo Xiaomi's open-sourced 7 billion parameter inference macromodel, which outperforms models such as OpenAI o1-mini in mathematical reasoning and code competitions by a small margin. 02,0800 Large ModelOpen Source Project# Reasoning Model
SkyReels-V2 The unlimited duration movie generation model introduced by KunlunWanwei team breaks through the bottleneck of the existing video generation technology and realizes high-quality, high-consistency and high-fidelity video creation. 02,0190 AI Video CreationOpen Source Project# Video Generation Model
Krillin AI AI video subtitle translation and dubbing tool, supporting multi-language input and translation, providing one-stop solution from video acquisition to subtitle translation and dubbing. 06,5650 AI translationAI video applications
BabelDOC Open source AI translation tool, supporting bilingual control, multi-engine translation, format preservation and batch processing, helping researchers read foreign literature efficiently. 08,3480 AI translationOpen Source Project# Translation tool
ChatAnyone The real-time portrait video generation tool developed by Alibaba's Dharma Institute realizes highly realistic, style-controlled and real-time efficient portrait video generation through a hierarchical motion diffusion model, which is suitable for video chatting, virtual anchoring and digital entertainment scenarios. 06,8630 AI Digital PersonOpen Source Project
Vibe Draw Open source AI-assisted drawing tool that intelligently converts hand-drawn sketches and text descriptions into 3D models, supporting real-time collaboration and creative expression. 06,9790 AI TeamworkOpen Source Project# Whiteboard Tool
Hunyuan T1 Tencent's self-developed deep thinking models with fast response, ultra-long text processing and strong reasoning capabilities have been widely used in intelligent Q&A, document processing and other fields. 06,8140 Large ModelOpen Source Project# Deeper Thinking
AlphaDrive Combining visual language modeling and reinforcement learning, the autopilot technology framework is equipped with powerful planning inference and multimodal planning capabilities to deal with complex and rare traffic scenarios. 06,9020 Open Source Project# Autopilot
Chitu The Tsinghua University team and Qingcheng Jizhi jointly launched an open source large model inference engine, aiming to realize efficient model inference across chip architectures through underlying technological innovations and promote the widespread application of AI technology. 07,8370 Large ModelOpen Source Project# Large model
MIDI (loanword) AI 3D scene generation tool that can efficiently generate complete 3D environments containing multiple objects from a single image, widely used in VR/AR, game development, film and television production and other fields. 06,6750 AI image generationOpen Source Project# 3D Scene Generation
Open-Sora 2.0 Lucent Technologies has launched a new open source video generation model with high performance and low cost, leading the open source video generation technology into a new stage. 07,0750 AI Video CreationOpen Source Project# Video Generation
R1-Omni Alibaba's open-source multimodal large language model uses RLVR technology to achieve emotion recognition and provide an interpretable reasoning process for multiple scenarios. 06,9990 Large ModelOpen Source Project# Multimodal# Emotion Recognition
OpenManus An open source AI Agent framework that supports localized deployment and multi-intelligence collaboration to efficiently complete complex tasks. 07,8000 AI assistantOpen Source Project# AI Agent