LiveTalking An open source digital human production platform designed to help users quickly create naturalistic digital human characters, dramatically reduce production costs and increase work efficiency. 03,7160 AI Digital PersonOpen Source Project# Open Source# Digital People
Xiaomi MiMo Xiaomi's open-sourced 7 billion parameter inference macromodel, which outperforms models such as OpenAI o1-mini in mathematical reasoning and code competitions by a small margin. 03,4890 Large ModelOpen Source Project# Reasoning Model
HunyuanVideo-Avatar Tencent hybrid open source voice digital human model, upload pictures and audio that generate multi-style, highly dynamic personalized dynamic video. 02,9971 AI Digital PersonOpen Source Project# Digital People
KittenTTS An open source lightweight text-to-speech model that is less than 25 MB and can run in real time on ordinary CPUs, supports a variety of natural tones and can be used offline. 02,9060 AI speech generationOpen Source Project# TTS# Video Generation
SkyReels-V2 The unlimited duration movie generation model introduced by KunlunWanwei team breaks through the bottleneck of the existing video generation technology and realizes high-quality, high-consistency and high-fidelity video creation. 02,6290 AI Video CreationOpen Source Project# Video Generation Model
FaceFusion AI face swap open source project that uses deep learning techniques to achieve high quality face replacement and image processing . 02,5730 Open Source Project# AI face swap
Qwen3-Coder Ali open source code big model, support full-flow programming and complex task planning, performance over GPT-4.1, lower cost. 02,4140 AI programmingOpen Source Project# AI Programming
Waver 1.0 Waver 1.0 is an open source full-featured video generation model that makes it easy to create text/images to HD video with efficiency, convenience and outstanding quality. 02,2490 AI Video CreationOpen Source Project# Video Generation
PromptEnhancer Tencent's open source Chinese text-to-image prompt word enhancement framework that optimizes user-input prompts and improves the image quality and semantic accuracy of the generated model. 02,2480 AI assistantOpen Source Project# Cue word enhancement
Dify AI A next-generation large-scale language modeling application development framework for easily building and operating generative AI native applications. 02,1950 Open Source Project# Application Development Framework
Deep-Live-Cam Python-based open source AI real-time face replacement tool that supports millisecond face replacement effects and can be used in a variety of fields such as entertainment, art creation and education. 02,1230 Open Source Project# Real-time face change
Kolors Racer has open-sourced a text-to-image generation model called Kolors (Kotu), which has a deep understanding of English and Chinese and is capable of generating high-quality, photorealistic images. 02,1090 Open Source Project
OmAgent Device-oriented open-source smart body framework designed to simplify the development of multimodal smart bodies and provide enhancements for various types of hardware devices. 02,0820 Open Source Project# Smart Body Frame
Gemma Google's lightweight, state-of-the-art open-source models, including Gemma 2B and Gemma 7B scales, each available in pre-trained and instruction-fine-tuned versions, are designed to support developer innovation, foster collaboration, and lead to responsible use of the models through their powerful language understanding and generation capabilities. 01,9890 Open Source Project# Gemma# Open Source
ChatGLM-6B An open source generative language model developed by Tsinghua University, designed for Chinese chat and dialog tasks, demonstrating powerful Chinese natural language processing capabilities. 01,9140 Open Source Project# ChatGLM-6B# Large model# Open Source
OpenHands Open source software development agent platform designed to improve developer efficiency and productivity through features such as intelligent task execution and code optimization. 01,9070 Open Source Project# Development Agent
Meta Llama 3 Meta's high-performance open-source large language model, with powerful multilingual processing capabilities and a wide range of application prospects, especially in the conversation class of applications excel. 01,9000 Open Source Project# Open Source Large Model
ChatTTS An open source text-to-speech model optimized for conversational scenarios, capable of generating high-quality, natural and smooth conversational speech. 01,8990 AI audioOpen Source Project# Conversational TTS
InternLM Shanghai AI Lab leads the launch of a comprehensive big model research and development platform, providing an efficient tool chain and rich application scenarios to support multimodal data processing and analysis. 01,8680 Open Source Project# InternLM# Scholar# Large model
Qwen-Image Ali Tongyi Thousand Questions open source 20 billion parameter image generation model , specializing in Chinese and English high fidelity text rendering and complex scene detail processing , support for multi-style image generation . 01,8270 AI image generationOpen Source Project# Image Generation
PaddleOCR-VL Baidu's lightweight multimodal document parsing model, with 0.9B parameters, achieves accurate recognition and structured output of complex documents in 109 languages, with world-leading performance. 01,7850 AI document assistantOpen Source Project# Document Analysis
Grok-1 xAI released an open source large language model based on hybrid expert system technology with 314 billion parameters designed to provide powerful language understanding and generation capabilities to help humans acquire knowledge and information. 01,7810 Open Source Project# Open Source
MetaGPT Multi-intelligent body collaboration open source framework, through the simulation of software company operation process, to achieve efficient collaboration and automation of GPT model in complex tasks. 01,7650 Open Source Project# Multi-intelligence
TeleChat The 7 billion parameter semantic grand model based on the Transformer architecture launched by China Telecom has powerful natural language understanding and generation capabilities, and is applicable to multiple AI application scenarios such as intelligent dialog and text generation. 01,7650 Open Source Project# TeleChat# Open Source
Laminar An open source AI engineering optimization platform focused on AI engineering from first principles. It helps users collect, understand and use data to improve the quality of LLM (Large Language Model) applications. 01,7580 Open Source Project# Analysis Tool
Skywork-13B Developed by Kunlun World Wide Web, the open source big model, with 13 billion parameters and 3.2 trillion high-quality multi-language training data, has demonstrated excellent natural language processing capabilities in Chinese and other languages, especially in the Chinese environment, and is applicable to a number of domains. 01,7580 Open Source Project# Skywork# Skywork-13B# Open Source
Mistral Small 3 Open source AI model with 24 billion parameters featuring low-latency optimization and imperative task fine-tuning for conversational AI, low-latency automation, and domain-specific expertise applications. 01,7290 Open Source Project# Mistral.AI# Small 3
Mistral 7B A powerful large-scale language model with about 7.3 billion parameters, developed by Mistral.AI, demonstrates excellent multilingual processing power and reasoning performance. 01,7180 Open Source Project# Mistral 7B# Mistral.AI# Open Source
AutoGPT Based on the GPT-4 open-source project, integrating Internet search, memory management, text generation and file storage, etc., it aims to provide a powerful digital assistant to simplify the process of user interaction with the language model. 01,7160 Open Source Project# Digital Assistant
Emu3 Beijing Zhiyuan Artificial Intelligence Research Institute launched a large model containing several series with large-scale, high-precision, emergent and universal characteristics, and has been fully open-sourced. 01,6860 Open Source Project# Open Source Large Model