Open Source Project

Total 106 articles 网址

Hot Products Domestic Selection Overseas Selection Category Recommendation Industrial Integration Courses of Study Open Source Project Large Model Large model evaluation AI Company Selection Latest Collections

Sorting

release update Views Like

GPT-SoVITS

Open source sound cloning tool focused on enabling high quality, cross-language sound (especially singing) conversion.

05,9530

Open Source Project # sound clone

Xiaomi MiMo

Xiaomi's open-sourced 7 billion parameter inference macromodel, which outperforms models such as OpenAI o1-mini in mathematical reasoning and code competitions by a small margin.

05,1030

Large Model Open Source Project # Reasoning Model

Toonflow

An open-source AI short-form drama production platform—a fully automated pipeline that takes a novel and turns it into a finished video. It features skeleton binding to prevent face distortion, produces an episode in 8 minutes, and keeps costs down to just over ten yuan.

03,6000

AI Video Creation Open Source Project # Short Drama Creation

HunyuanVideo-Avatar

Tencent hybrid open source voice digital human model, upload pictures and audio that generate multi-style, highly dynamic personalized dynamic video.

03,5611

AI Digital Person Open Source Project # Digital People

PromptEnhancer

Tencent's open source Chinese text-to-image prompt word enhancement framework that optimizes user-input prompts and improves the image quality and semantic accuracy of the generated model.

03,4600

AI assistant Open Source Project # Cue word enhancement

KittenTTS

An open source lightweight text-to-speech model that is less than 25 MB and can run in real time on ordinary CPUs, supports a variety of natural tones and can be used offline.

03,3730

AI speech generation Open Source Project # TTS # Video Generation

SkyReels-V2

The unlimited duration movie generation model introduced by KunlunWanwei team breaks through the bottleneck of the existing video generation technology and realizes high-quality, high-consistency and high-fidelity video creation.

03,1690

AI Video Creation Open Source Project # Video Generation Model

FaceFusion

AI face swap open source project that uses deep learning techniques to achieve high quality face replacement and image processing .

03,1630

Open Source Project # AI face swap

Qwen3-Coder

Ali open source code big model, support full-flow programming and complex task planning, performance over GPT-4.1, lower cost.

02,8280

AI programming Open Source Project # AI Programming

PaddleOCR-VL

Baidu's lightweight multimodal document parsing model, with 0.9B parameters, achieves accurate recognition and structured output of complex documents in 109 languages, with world-leading performance.

02,7590

AI document assistant Open Source Project # Document Analysis

Waver 1.0

Waver 1.0 is an open source full-featured video generation model that makes it easy to create text/images to HD video with efficiency, convenience and outstanding quality.

02,7370

AI Video Creation Open Source Project # Video Generation

Dify AI

A next-generation large-scale language modeling application development framework for easily building and operating generative AI native applications.

02,6080

Open Source Project # Application Development Framework

Deep-Live-Cam

Python-based open source AI real-time face replacement tool that supports millisecond face replacement effects and can be used in a variety of fields such as entertainment, art creation and education.

02,5650

Open Source Project # Real-time face change

OmAgent

Device-oriented open-source smart body framework designed to simplify the development of multimodal smart bodies and provide enhancements for various types of hardware devices.

02,5590

Open Source Project # Smart Body Frame

Kolors

Racer has open-sourced a text-to-image generation model called Kolors (Kotu), which has a deep understanding of English and Chinese and is capable of generating high-quality, photorealistic images.

02,4810

Open Source Project

Paper2Any

An AI tool developed by Peking University can automatically convert papers and text into editable PowerPoint presentations and structural diagrams. Supporting multimodal input, it efficiently addresses the challenges of scientific diagramming and converting lengthy documents into reports.

02,4600

AI efficiency tools AI document assistant # PPT generation # Document Generation

ChatTTS

An open source text-to-speech model optimized for conversational scenarios, capable of generating high-quality, natural and smooth conversational speech.

02,4010

AI audio Open Source Project # Conversational TTS

Gemma

Google's lightweight, state-of-the-art open-source models, including Gemma 2B and Gemma 7B scales, each available in pre-trained and instruction-fine-tuned versions, are designed to support developer innovation, foster collaboration, and lead to responsible use of the models through their powerful language understanding and generation capabilities.

02,3660

Open Source Project # Gemma # Open Source

Qwen-Image

Ali Tongyi Thousand Questions open source 20 billion parameter image generation model , specializing in Chinese and English high fidelity text rendering and complex scene detail processing , support for multi-style image generation .

02,3610

AI image generation Open Source Project # Image Generation

Meta Llama 3

Meta's high-performance open-source large language model, with powerful multilingual processing capabilities and a wide range of application prospects, especially in the conversation class of applications excel.

02,3520

Open Source Project # Open Source Large Model

InternLM

Shanghai AI Lab leads the launch of a comprehensive big model research and development platform, providing an efficient tool chain and rich application scenarios to support multimodal data processing and analysis.

02,3240

Open Source Project # InternLM # Scholar # Large model

ChatGLM-6B

An open source generative language model developed by Tsinghua University, designed for Chinese chat and dialog tasks, demonstrating powerful Chinese natural language processing capabilities.

02,2980

Open Source Project # ChatGLM-6B # Large model # Open Source

OpenHands

Open source software development agent platform designed to improve developer efficiency and productivity through features such as intelligent task execution and code optimization.

02,2830

Open Source Project # Development Agent

Mistral 7B

A powerful large-scale language model with about 7.3 billion parameters, developed by Mistral.AI, demonstrates excellent multilingual processing power and reasoning performance.

02,1480

Open Source Project # Mistral 7B # Mistral.AI # Open Source

SongBloom

Tencent AI Lab and other joint research and development of open source song generation model, 10 seconds of audio + lyrics into 2 minutes 30 seconds of high-quality music, comparable to commercial standards.

02,1460

AI music composition Open Source Project # Song Generation

Grok-1

xAI released an open source large language model based on hybrid expert system technology with 314 billion parameters designed to provide powerful language understanding and generation capabilities to help humans acquire knowledge and information.

02,1430

Open Source Project # Open Source

Mistral Small 3

Mistral Small 3

Open source AI model with 24 billion parameters featuring low-latency optimization and imperative task fine-tuning for conversational AI, low-latency automation, and domain-specific expertise applications.

02,1350

Open Source Project # Mistral.AI # Small 3

TeleChat

The 7 billion parameter semantic grand model based on the Transformer architecture launched by China Telecom has powerful natural language understanding and generation capabilities, and is applicable to multiple AI application scenarios such as intelligent dialog and text generation.

02,1320

Open Source Project # TeleChat # Open Source

MetaGPT

Multi-intelligent body collaboration open source framework, through the simulation of software company operation process, to achieve efficient collaboration and automation of GPT model in complex tasks.

02,1120

Open Source Project # Multi-intelligence

Skywork-13B

Developed by Kunlun World Wide Web, the open source big model, with 13 billion parameters and 3.2 trillion high-quality multi-language training data, has demonstrated excellent natural language processing capabilities in Chinese and other languages, especially in the Chinese environment, and is applicable to a number of domains.

02,0990

Open Source Project # Skywork # Skywork-13B # Open Source