QwQ-32B Alibaba released a high-performance inference model with 32 billion parameters that excels in mathematics and programming for a wide range of application scenarios. 06,6990 Large ModelOpen Source Project# Reasoning Model# A Thousand Questions on Tongyi
SpeciesNet Google open-sourced a model that uses artificial intelligence technology to analyze camera trap photos to automatically identify animal species. 06,7490 Open Source Project# Image Recognition
CogView4 The open-source text-to-graphics model released by Wisdom Spectrum AI supports bilingual input, generates high-quality images and is the first to generate Chinese characters in the screen, which is widely used in advertising, short videos, art creation and other fields. 06,5440 AI image generationOpen Source Project# Image Generation
Wan2.1 Alibaba launched an efficient video generation model that can accurately simulate complex scenes and actions, support Chinese and English special effects, and lead a new era of AI video creation. 07,0130 AI Video CreationLarge Model# Video Creation# Video Generation Model
FacePoke Open source real-time facial expression editing tool that allows users to adjust facial expressions and head orientation in static images in real time with simple operations. 06,8480 Open Source Project# Expression Editor
AingDesk Open source one-click deployment tool for AI models, which provides users with a convenient platform to run and share a variety of big AI models. 06,8760 AI assistantOpen Source Project# model deployment
Ovis2 Alibaba's open source multimodal large language model with powerful visual understanding, OCR, video processing and reasoning capabilities, supporting multiple scale versions. 07,2000 Large ModelOpen Source Project# Multimodal Large Model
SkyReels-V1 The open source video generation model of AI short drama creation by Kunlun World Wide has film and TV level character micro-expression performance generation and movie level light and shadow aesthetics, and supports text-generated video and graph-generated video, which brings a brand-new experience to the creation of AI short dramas. 07,0790 Open Source Project# Video Generation
OmniParser V2.0 Microsoft has introduced a Visual Agent parsing framework that transforms large language models into intelligences that can manipulate computers, enabling efficient automated interactions. 07,2230 AI assistantOpen Source Project# Agent parsing framework
Confucius-o1 NetEaseYouDao launched the first 14B lightweight model in China that supports step-by-step reasoning and explanation, designed for educational scenarios, which can help students efficiently understand complex math problems. 06,4520 Large ModelOpen Source Project# Reasoning Model# Netease Youtube
DeepClaude An open source AI application development platform that combines the strengths of DeepSeek R1 and the Claude model to provide high-performance, secure and configurable APIs for a wide range of scenarios such as smart chat, code generation, and inference tasks. 06,6600 Open Source Project# Application Development
Eino Eino is byte jumping open source, based on componentized design and graph orchestration engine of the large model application development framework. 07,1440 Open Source Project# Application Development Framework
InspireMusic Open source AIGC toolkit with integrated music generation, song generation, and audio generation capabilities. 06,7010 AI music compositionOpen Source Project# Music Generation
DeepSeek-VL2 Developed by the DeepSeek team, it is an efficient visual language model based on a hybrid expert architecture with powerful multimodal understanding and processing capabilities. 07,2760 Large ModelOpen Source Project# Visual Language Model
DeepSeek-V3 Hangzhou Depth Seeker has launched an efficient open source language model with 67.1 billion parameters, using a hybrid expert architecture that excels at handling math, coding and multilingual tasks. 06,7640 Large ModelOpen Source Project# DeepSeek# Open Source Large Model
s1 An AI model developed by Fei-Fei Li's team that achieves superior inference performance at a very low training cost. 06,9730 Large ModelOpen Source Project# Reasoning Model# Model Distillation
Tülu 3 405B Allen AI introduces a large open source AI model with 405 billion parameters that combines multiple LLM training methods to deliver superior performance and a wide range of application scenarios. 01,0160 Open Source Project# Ai2# Open Source Model
Mistral Small 3 Open source AI model with 24 billion parameters featuring low-latency optimization and imperative task fine-tuning for conversational AI, low-latency automation, and domain-specific expertise applications. 01,2430 Open Source Project# Mistral.AI# Small 3
Shortest An end-to-end testing framework based on natural language processing and AI technologies which streamlines the testing process, increases testing efficiency, and lowers the testing threshold. 01,0460 Open Source Project# AI testing framework
OmAgent Device-oriented open-source smart body framework designed to simplify the development of multimodal smart bodies and provide enhancements for various types of hardware devices. 01,4690 Open Source Project# Smart Body Frame
Dify AI A next-generation large-scale language modeling application development framework for easily building and operating generative AI native applications. 01,6190 Open Source Project# Application Development Framework
LiveTalking An open source digital human production platform designed to help users quickly create naturalistic digital human characters, dramatically reduce production costs and increase work efficiency. 02,6650 AI Digital PersonOpen Source Project# Open Source# Digital People
ChatTTS An open source text-to-speech model optimized for conversational scenarios, capable of generating high-quality, natural and smooth conversational speech. 01,3200 AI audioOpen Source Project# Conversational TTS
Deep-Live-Cam Python-based open source AI real-time face replacement tool that supports millisecond face replacement effects and can be used in a variety of fields such as entertainment, art creation and education. 01,5050 Open Source Project# Real-time face change
GraphRAG Microsoft's open-source retrieval-enhanced generative model based on knowledge graph and graph machine learning techniques is designed to improve the understanding and reasoning of large language models when working with private data. 01,1590 Open Source Project# Large model
OpenHands Open source software development agent platform designed to improve developer efficiency and productivity through features such as intelligent task execution and code optimization. 01,2760 Open Source Project# Development Agent
LangChain An open source framework for building large-scale language modeling application designs, providing modular components and toolchains to support the entire application lifecycle from development to production. 01,2180 Open Source Project# Large model
MetaGPT Multi-intelligent body collaboration open source framework, through the simulation of software company operation process, to achieve efficient collaboration and automation of GPT model in complex tasks. 01,2410 Open Source Project# Multi-intelligence
FaceFusion AI face swap open source project that uses deep learning techniques to achieve high quality face replacement and image processing . 01,8240 Open Source Project# AI face swap
GPT-SoVITS Open source sound cloning tool focused on enabling high quality, cross-language sound (especially singing) conversion. 02,7420 Open Source Project# sound clone