Open Source Project

Total 95 articles 网址

Hot Products Domestic Selection Overseas Selection Category Recommendation Industrial Integration Courses of Study Open Source Project Large Model Large model evaluation AI Company Selection Latest Collections

Sorting

release update Views Like

HunyuanVideo-Avatar

Tencent hybrid open source voice digital human model, upload pictures and audio that generate multi-style, highly dynamic personalized dynamic video.

02,9971

AI Digital Person Open Source Project # Digital People

DeepSeek-V4

The new generation of domestic open-source flagship big model has become one of the strongest all-around AIs on the ground with millions of ultra-long contexts, performance comparable to the top international closed-source models, and extreme cost-effectiveness.

0500

Large Model Open Source Project # DeepSeek

NVIDIA Ising

The world's first open-source quantum AI model series, through AI-driven quantum chip calibration and error correction, provides a high-performance tool chain for practical quantum computing and reshapes the quantum industry ecosystem.

02000

Large Model Open Source Project # Quantum Computing

HappyHorse

The 2026 open source AI video generation benchmark, with a single-stream Transformer architecture to achieve text/image to 1080p HD video generation at breakneck speeds, and native support for multi-language lip-synchronization and sound generation, topped the global performance list.

03380

AI Video Creation Open Source Project # Video Generation

pyVideoTrans

Open source and free AI video translation and dubbing tool, supporting multi-language speech recognition, subtitle translation and natural dubbing, helping content creators and enterprises to easily realize the globalization of video distribution.

03560

AI video processing Open Source Project # Video Translation

BettaFish

Open source AI public opinion tool, multi-agent collaboration to analyze the whole network data, can accurately insight into the trend, predict the direction, applicable to brand public relations, market research and other scenarios.

03210

AI assistant Open Source Project # Intelligent Body # Opinion Analysis

Voxtral TTS

Mistral AI introduces an open source, low-latency text-to-speech model that supports cross-language timbre cloning with latency as low as 70ms and can be deployed at the edge.

03580

AI speech generation Open Source Project # Open Source # Text-to-speech

OpenClaw

An open source AI intelligence framework for local file management, cross-tool automation, and lightweight development assistance via natural language commands, balancing privacy protection with low-code ease of use.

06180

AI assistant AI efficiency tools # Intelligent Body

TranslateGemma

Google's open source lightweight multimodal translation model supports 55 languages and image translations, with performance that exceeds larger models, taking into account both mobile and cloud deployments, and facilitating efficient globalized communication.

06920

AI translation Large Model # Digital Split

SAM Audio

Meta introduces the world's first unified multimodal audio separation model that supports text, visual, and time cues to accurately separate target sounds from complex audio and video.

05420

AI Sound Separation Open Source Project # Audio Separation

Paper2Any

An AI tool developed by Peking University can automatically convert papers and text into editable PowerPoint presentations and structural diagrams. Supporting multimodal input, it efficiently addresses the challenges of scientific diagramming and converting lengthy documents into reports.

01,3570

AI efficiency tools AI document assistant # PPT generation # Document Generation

Voquill

Open-source voice input tool supporting multiple languages and intelligent text optimization, boosting input efficiency by several times. It balances local privacy with cloud convenience, serving as a powerful assistant for productive professionals.

07400

AI Audio Processing Open Source Project # Voice Input

Zen Browser

An open-source desktop browser based on the Firefox engine, featuring vertical tabs, workspaces, and split-screen views, emphasizing privacy protection and a modern browsing experience focused on efficiency and concentration.

08740

AI efficiency tools Open Source Project # Browser

Infographic

Alibaba's open-source AI infographic engine uses declarative syntax + 197+ templates to generate professional charts with just one line of code, suitable for all scenarios including data visualization and news illustrations.

01,2960

AI efficiency tools Open Source Project # Infographic

Qwen-Image-Layered

Qwen-Image-Layered

Alibaba's open-source AI image layering editor—automatically separates layers, precisely modifies content, no need for tedious masking, delivering efficient and professional results!

01,0490

AI image processing Open Source Project # Image Layering

Nemotron 3

NVIDIA's open-source AI model series, featuring Nano, Super, and Ultra variants, is specifically designed for intelligent agent applications, delivering high efficiency and precision.

06990

Large Model Open Source Project

DeepSeek-Math-V2

DeepSeek-Math-V2

The world's first large model of mathematical reasoning in open source form to reach the gold medal level of the International Mathematical Olympiad (IMO), realizing the rigor of reasoning and the ability to solve difficult mathematical problems through a self-verification framework.

08040

Large Model Open Source Project # Mathematical Reasoning

SeekDB

OceanBase is the world's first open source AI-native database, which focuses on multimodal hybrid search, minimal development and extreme security, redefining the way data and AI converge and helping developers build high-performance intelligent applications with a single click.

01,0630

AI data processing Open Source Project # Database

SAM 3D

Meta open source revolutionary single-image 3D generation model, support one-click from 2D photos to generate high-fidelity, interactive 3D models, covering the object/human body scene, empowering e-commerce, AR/VR, film and television, and other multi-industry cost reduction and efficiency.

01,2900

Open Source Project # 3D model # 3D generation

SmartResume

Ali open source SmartResume is a high-precision resume parsing system based on OCR and lightweight large models, which can convert 12 formats of resumes such as PDF/pictures into structured data in seconds, with an accuracy rate of 93.1%.

09790

Open Source Project # Resume Analysis

PaddleOCR-VL

Baidu's lightweight multimodal document parsing model, with 0.9B parameters, achieves accurate recognition and structured output of complex documents in 109 languages, with world-leading performance.

01,7850

AI document assistant Open Source Project # Document Analysis

SongBloom

Tencent AI Lab and other joint research and development of open source song generation model, 10 seconds of audio + lyrics into 2 minutes 30 seconds of high-quality music, comparable to commercial standards.

01,3740

AI music composition Open Source Project # Song Generation

PromptEnhancer

Tencent's open source Chinese text-to-image prompt word enhancement framework that optimizes user-input prompts and improves the image quality and semantic accuracy of the generated model.

02,2480

AI assistant Open Source Project # Cue word enhancement

HunyuanImage2.1

HunyuanImage2.1

Tencent launched the open source raw image model, which natively supports 2K HD raw images, accurately parses complex semantics, and can efficiently generate high-quality images with Chinese and English fusion.

01,1440

AI Video Creation Large Model # graphical model

HunyuanWorld-Voyager

HunyuanWorld-Voyager

Tencent introduced the industry's first open source world model that supports native 3D reconstruction and ultra-long roaming, allowing for rapid generation of interactive and immersive 3D scenes based on a single image or text.

01,6100

Open Source Project # Virtual Worlds

Waver 1.0

Waver 1.0 is an open source full-featured video generation model that makes it easy to create text/images to HD video with efficiency, convenience and outstanding quality.

02,2490

AI Video Creation Open Source Project # Video Generation

Seed-OSS

ByteDance's open-source 36 billion parameter-long contextual big language model supports 512K tokens, a controlled mind budget, excels in inference, code and agent tasks, and is freely commercially available under the Apache-2.0 license.

01,3150

Large Model Open Source Project # Large model

KittenTTS

An open source lightweight text-to-speech model that is less than 25 MB and can run in real time on ordinary CPUs, supports a variety of natural tones and can be used offline.

02,9060

AI speech generation Open Source Project # TTS # Video Generation

Qwen-Image

Ali Tongyi Thousand Questions open source 20 billion parameter image generation model , specializing in Chinese and English high fidelity text rendering and complex scene detail processing , support for multi-style image generation .

01,8270

AI image generation Open Source Project # Image Generation

Qwen3-Coder

Ali open source code big model, support full-flow programming and complex task planning, performance over GPT-4.1, lower cost.

02,4140

AI programming Open Source Project # AI Programming