LangChain An open source framework for building large-scale language modeling application designs, providing modular components and toolchains to support the entire application lifecycle from development to production. 01,6750 Open Source Project# Large model
OmniGen Unified image generation diffusion model, which naturally supports multiple image generation tasks with high flexibility and scalability. 01,6510 Open Source Project# Image Generation
MindSpore Huawei's full-scenario deep learning framework is designed to provide full-stack AI capabilities that are easy to develop and efficient to execute, supporting the complete process from data loading and model building to training, evaluation and deployment. 01,6430 Open Source Project# Deep Learning Framework
GraphRAG Microsoft's open-source retrieval-enhanced generative model based on knowledge graph and graph machine learning techniques is designed to improve the understanding and reasoning of large language models when working with private data. 01,6210 Open Source Project# Large model
HunyuanWorld-Voyager Tencent introduced the industry's first open source world model that supports native 3D reconstruction and ultra-long roaming, allowing for rapid generation of interactive and immersive 3D scenes based on a single image or text. 01,6100 Open Source Project# Virtual Worlds
BLOOM A large open-source multilingual language model developed by over 1,000 researchers from more than 60 countries and 250 institutions, with 176B parameters and trained on the ROOTS corpus, supporting 46 natural languages and 13 programming languages, aims to advance the research and use of large-scale language models by academics and small companies. 01,6070 Open Source Project
kotaemon RAG Open source chat application tool that allows users to query and access relevant information in documents by chatting. 01,6000 Open Source Project# Chat application
Phi-3 A high-performance large-scale language model from Microsoft, tuned with instructions to support cross-platform operation, with excellent language comprehension and reasoning capabilities, especially suitable for multimodal application scenarios. 01,5780 Open Source Project
BERT Developed by Google, the pre-trained language model based on the Transformer architecture provides a powerful foundation for a wide range of NLP tasks by learning bi-directional contextual information on large-scale textual data with up to tens of billions of parameters, and has achieved significant performance gains across multiple tasks. 01,5460 Open Source Project
Shortest An end-to-end testing framework based on natural language processing and AI technologies which streamlines the testing process, increases testing efficiency, and lowers the testing threshold. 01,4870 Open Source Project# AI testing framework
Tongyi Qianqian Qwen1.5 Alibaba launched a large-scale language model with multiple parameter scales from 0.5B to 72B, supporting multilingual processing, long text comprehension, and excelling in several benchmark tests. 01,3990 Open Source Project
Paper2Any An AI tool developed by Peking University can automatically convert papers and text into editable PowerPoint presentations and structural diagrams. Supporting multimodal input, it efficiently addresses the challenges of scientific diagramming and converting lengthy documents into reports. 01,3790 AI efficiency toolsAI document assistant# PPT generation# Document Generation
SongBloom Tencent AI Lab and other joint research and development of open source song generation model, 10 seconds of audio + lyrics into 2 minutes 30 seconds of high-quality music, comparable to commercial standards. 01,3750 AI music compositionOpen Source Project# Song Generation
Tülu 3 405B Allen AI introduces a large open source AI model with 405 billion parameters that combines multiple LLM training methods to deliver superior performance and a wide range of application scenarios. 01,3300 Open Source Project# Ai2# Open Source Model
Seed-OSS ByteDance's open-source 36 billion parameter-long contextual big language model supports 512K tokens, a controlled mind budget, excels in inference, code and agent tasks, and is freely commercially available under the Apache-2.0 license. 01,3150 Large ModelOpen Source Project# Large model
Infographic Alibaba's open-source AI infographic engine uses declarative syntax + 197+ templates to generate professional charts with just one line of code, suitable for all scenarios including data visualization and news illustrations. 01,2960 AI efficiency toolsOpen Source Project# Infographic
SAM 3D Meta open source revolutionary single-image 3D generation model, support one-click from 2D photos to generate high-fidelity, interactive 3D models, covering the object/human body scene, empowering e-commerce, AR/VR, film and television, and other multi-industry cost reduction and efficiency. 01,2900 Open Source Project# 3D model# 3D generation
Gemma 3n Google introduced a lightweight open source large language model , both high performance and easy to deploy , suitable for local development and multi-scenario applications . 01,2800 Large ModelOpen Source Project# Large Language Model
HunyuanImage2.1 Tencent launched the open source raw image model, which natively supports 2K HD raw images, accurately parses complex semantics, and can efficiently generate high-quality images with Chinese and English fusion. 01,1470 AI Video CreationLarge Model# graphical model
SeekDB OceanBase is the world's first open source AI-native database, which focuses on multimodal hybrid search, minimal development and extreme security, redefining the way data and AI converge and helping developers build high-performance intelligent applications with a single click. 01,0640 AI data processingOpen Source Project# Database
Qwen-Image-Layered Alibaba's open-source AI image layering editor—automatically separates layers, precisely modifies content, no need for tedious masking, delivering efficient and professional results! 01,0500 AI image processingOpen Source Project# Image Layering
SmartResume Ali open source SmartResume is a high-precision resume parsing system based on OCR and lightweight large models, which can convert 12 formats of resumes such as PDF/pictures into structured data in seconds, with an accuracy rate of 93.1%. 09800 Open Source Project# Resume Analysis
Zen Browser An open-source desktop browser based on the Firefox engine, featuring vertical tabs, workspaces, and split-screen views, emphasizing privacy protection and a modern browsing experience focused on efficiency and concentration. 08760 AI efficiency toolsOpen Source Project# Browser
DeepSeek-Math-V2 The world's first large model of mathematical reasoning in open source form to reach the gold medal level of the International Mathematical Olympiad (IMO), realizing the rigor of reasoning and the ability to solve difficult mathematical problems through a self-verification framework. 08060 Large ModelOpen Source Project# Mathematical Reasoning
Voquill Open-source voice input tool supporting multiple languages and intelligent text optimization, boosting input efficiency by several times. It balances local privacy with cloud convenience, serving as a powerful assistant for productive professionals. 07440 AI Audio ProcessingOpen Source Project# Voice Input
Nemotron 3 NVIDIA's open-source AI model series, featuring Nano, Super, and Ultra variants, is specifically designed for intelligent agent applications, delivering high efficiency and precision. 07000 Large ModelOpen Source Project
TranslateGemma Google's open source lightweight multimodal translation model supports 55 languages and image translations, with performance that exceeds larger models, taking into account both mobile and cloud deployments, and facilitating efficient globalized communication. 06920 AI translationLarge Model# Digital Split
OpenClaw An open source AI intelligence framework for local file management, cross-tool automation, and lightweight development assistance via natural language commands, balancing privacy protection with low-code ease of use. 06210 AI assistantAI efficiency tools# Intelligent Body
SAM Audio Meta introduces the world's first unified multimodal audio separation model that supports text, visual, and time cues to accurately separate target sounds from complex audio and video. 05420 AI Sound SeparationOpen Source Project# Audio Separation
Voxtral TTS Mistral AI introduces an open source, low-latency text-to-speech model that supports cross-language timbre cloning with latency as low as 70ms and can be deployed at the edge. 03580 AI speech generationOpen Source Project# Open Source# Text-to-speech