GraphRAG Microsoft's open-source retrieval-enhanced generative model based on knowledge graph and graph machine learning techniques is designed to improve the understanding and reasoning of large language models when working with private data. 01,1610 Open Source Project# Large model
BERT Developed by Google, the pre-trained language model based on the Transformer architecture provides a powerful foundation for a wide range of NLP tasks by learning bi-directional contextual information on large-scale textual data with up to tens of billions of parameters, and has achieved significant performance gains across multiple tasks. 01,1380 Open Source Project
BLOOM A large open-source multilingual language model developed by over 1,000 researchers from more than 60 countries and 250 institutions, with 176B parameters and trained on the ROOTS corpus, supporting 46 natural languages and 13 programming languages, aims to advance the research and use of large-scale language models by academics and small companies. 01,1290 Open Source Project
Phi-3 A high-performance large-scale language model from Microsoft, tuned with instructions to support cross-platform operation, with excellent language comprehension and reasoning capabilities, especially suitable for multimodal application scenarios. 01,1280 Open Source Project
kotaemon RAG Open source chat application tool that allows users to query and access relevant information in documents by chatting. 01,1150 Open Source Project# Chat application
Shortest An end-to-end testing framework based on natural language processing and AI technologies which streamlines the testing process, increases testing efficiency, and lowers the testing threshold. 01,0470 Open Source Project# AI testing framework
Tülu 3 405B Allen AI introduces a large open source AI model with 405 billion parameters that combines multiple LLM training methods to deliver superior performance and a wide range of application scenarios. 01,0160 Open Source Project# Ai2# Open Source Model
HunyuanWorld-Voyager Tencent introduced the industry's first open source world model that supports native 3D reconstruction and ultra-long roaming, allowing for rapid generation of interactive and immersive 3D scenes based on a single image or text. 01,0120 Open Source Project# Virtual Worlds
Qwen-Image Ali Tongyi Thousand Questions open source 20 billion parameter image generation model , specializing in Chinese and English high fidelity text rendering and complex scene detail processing , support for multi-style image generation . 09960 AI image generationOpen Source Project# Image Generation
Tongyi Qianqian Qwen1.5 Alibaba launched a large-scale language model with multiple parameter scales from 0.5B to 72B, supporting multilingual processing, long text comprehension, and excelling in several benchmark tests. 09930 Open Source Project
PromptEnhancer Tencent's open source Chinese text-to-image prompt word enhancement framework that optimizes user-input prompts and improves the image quality and semantic accuracy of the generated model. 08950 AI assistantOpen Source Project# Cue word enhancement
Gemma 3n Google introduced a lightweight open source large language model , both high performance and easy to deploy , suitable for local development and multi-scenario applications . 07690 Large ModelOpen Source Project# Large Language Model
PaddleOCR-VL Baidu's lightweight multimodal document parsing model, with 0.9B parameters, achieves accurate recognition and structured output of complex documents in 109 languages, with world-leading performance. 06990 AI document assistantOpen Source Project# Document Analysis
Seed-OSS ByteDance's open-source 36 billion parameter-long contextual big language model supports 512K tokens, a controlled mind budget, excels in inference, code and agent tasks, and is freely commercially available under the Apache-2.0 license. 06740 Large ModelOpen Source Project# Large model
SongBloom Tencent AI Lab and other joint research and development of open source song generation model, 10 seconds of audio + lyrics into 2 minutes 30 seconds of high-quality music, comparable to commercial standards. 05980 AI music compositionOpen Source Project# Song Generation
HunyuanImage2.1 Tencent launched the open source raw image model, which natively supports 2K HD raw images, accurately parses complex semantics, and can efficiently generate high-quality images with Chinese and English fusion. 05690 AI Video CreationLarge Model# graphical model
SAM 3D Meta open source revolutionary single-image 3D generation model, support one-click from 2D photos to generate high-fidelity, interactive 3D models, covering the object/human body scene, empowering e-commerce, AR/VR, film and television, and other multi-industry cost reduction and efficiency. 04970 Open Source Project# 3D model# 3D generation
SeekDB OceanBase is the world's first open source AI-native database, which focuses on multimodal hybrid search, minimal development and extreme security, redefining the way data and AI converge and helping developers build high-performance intelligent applications with a single click. 03300 AI data processingOpen Source Project# Database
SmartResume Ali open source SmartResume is a high-precision resume parsing system based on OCR and lightweight large models, which can convert 12 formats of resumes such as PDF/pictures into structured data in seconds, with an accuracy rate of 93.1%. 02550 Open Source Project# Resume Analysis
DeepSeek-Math-V2 The world's first large model of mathematical reasoning in open source form to reach the gold medal level of the International Mathematical Olympiad (IMO), realizing the rigor of reasoning and the ability to solve difficult mathematical problems through a self-verification framework. 02200 Large ModelOpen Source Project# Mathematical Reasoning