Kolors Racer has open-sourced a text-to-image generation model called Kolors (Kotu), which has a deep understanding of English and Chinese and is capable of generating high-quality, photorealistic images. 05250 Open Source Project
GPT-SoVITS Open source sound cloning tool focused on enabling high quality, cross-language sound (especially singing) conversion. 05210 Open Source Project# sound clone
Dify AI A next-generation large-scale language modeling application development framework for easily building and operating generative AI native applications. 04450 Open Source Project# Application Development Framework
InternLM Shanghai AI Lab leads the launch of a comprehensive big model research and development platform, providing an efficient tool chain and rich application scenarios to support multimodal data processing and analysis. 03760 Open Source Project# InternLM# Scholar# Large model
OpenHands Open source software development agent platform designed to improve developer efficiency and productivity through features such as intelligent task execution and code optimization. 03620 Open Source Project# Development Agent
ChatGLM-6B An open source generative language model developed by Tsinghua University, designed for Chinese chat and dialog tasks, demonstrating powerful Chinese natural language processing capabilities. 03590 Open Source Project# ChatGLM-6B# Large model# Open Source
Mistral Small 3 Open source AI model with 24 billion parameters featuring low-latency optimization and imperative task fine-tuning for conversational AI, low-latency automation, and domain-specific expertise applications. 03580 Open Source Project# Mistral.AI# Small 3
Tülu 3 405B Allen AI introduces a large open source AI model with 405 billion parameters that combines multiple LLM training methods to deliver superior performance and a wide range of application scenarios. 03470 Open Source Project# Ai2# Open Source Model
MetaGPT Multi-intelligent body collaboration open source framework, through the simulation of software company operation process, to achieve efficient collaboration and automation of GPT model in complex tasks. 03460 Open Source Project# Multi-intelligence
Deep-Live-Cam Python-based open source AI real-time face replacement tool that supports millisecond face replacement effects and can be used in a variety of fields such as entertainment, art creation and education. 03300 Open Source Project# Real-time face change
OmAgent Device-oriented open-source smart body framework designed to simplify the development of multimodal smart bodies and provide enhancements for various types of hardware devices. 03290 Open Source Project# Smart Body Frame
Gemma Google's lightweight, state-of-the-art open-source models, including Gemma 2B and Gemma 7B scales, each available in pre-trained and instruction-fine-tuned versions, are designed to support developer innovation, foster collaboration, and lead to responsible use of the models through their powerful language understanding and generation capabilities. 03280 Open Source Project# Gemma# Open Source
Skywork-13B Developed by Kunlun World Wide Web, the open source big model, with 13 billion parameters and 3.2 trillion high-quality multi-language training data, has demonstrated excellent natural language processing capabilities in Chinese and other languages, especially in the Chinese environment, and is applicable to a number of domains. 03250 Open Source Project# Skywork# Skywork-13B# Open Source
Grok-1 xAI released an open source large language model based on hybrid expert system technology with 314 billion parameters designed to provide powerful language understanding and generation capabilities to help humans acquire knowledge and information. 03230 Open Source Project# Open Source
BERT Developed by Google, the pre-trained language model based on the Transformer architecture provides a powerful foundation for a wide range of NLP tasks by learning bi-directional contextual information on large-scale textual data with up to tens of billions of parameters, and has achieved significant performance gains across multiple tasks. 03190 Open Source Project
Meta Llama 3 Meta's high-performance open-source large language model, with powerful multilingual processing capabilities and a wide range of application prospects, especially in the conversation class of applications excel. 03070 Open Source Project# Open Source Large Model
GraphRAG Microsoft's open-source retrieval-enhanced generative model based on knowledge graph and graph machine learning techniques is designed to improve the understanding and reasoning of large language models when working with private data. 03050 Open Source Project# Large model
ChatTTS An open source text-to-speech model optimized for conversational scenarios, capable of generating high-quality, natural and smooth conversational speech. 03000 AI audioOpen Source Project# Conversational TTS
OmniGen Unified image generation diffusion model, which naturally supports multiple image generation tasks with high flexibility and scalability. 02990 Open Source Project# Image Generation
Mistral 7B A powerful large-scale language model with about 7.3 billion parameters, developed by Mistral.AI, demonstrates excellent multilingual processing power and reasoning performance. 02980 Open Source Project# Mistral 7B# Mistral.AI# Open Source
BLOOM A large open-source multilingual language model developed by over 1,000 researchers from more than 60 countries and 250 institutions, with 176B parameters and trained on the ROOTS corpus, supporting 46 natural languages and 13 programming languages, aims to advance the research and use of large-scale language models by academics and small companies. 02880 Open Source Project
TeleChat The 7 billion parameter semantic grand model based on the Transformer architecture launched by China Telecom has powerful natural language understanding and generation capabilities, and is applicable to multiple AI application scenarios such as intelligent dialog and text generation. 02790 Open Source Project# TeleChat# Open Source
Emu3 Beijing Zhiyuan Artificial Intelligence Research Institute launched a large model containing several series with large-scale, high-precision, emergent and universal characteristics, and has been fully open-sourced. 02750 Open Source Project# Open Source Large Model
LangChain An open source framework for building large-scale language modeling application designs, providing modular components and toolchains to support the entire application lifecycle from development to production. 02690 Open Source Project# Large model
MindSpore Huawei's full-scenario deep learning framework is designed to provide full-stack AI capabilities that are easy to develop and efficient to execute, supporting the complete process from data loading and model building to training, evaluation and deployment. 02690 Open Source Project# Deep Learning Framework
Shortest An end-to-end testing framework based on natural language processing and AI technologies which streamlines the testing process, increases testing efficiency, and lowers the testing threshold. 02640 Open Source Project# AI testing framework
Tongyi Qianqian Qwen1.5 Alibaba launched a large-scale language model with multiple parameter scales from 0.5B to 72B, supporting multilingual processing, long text comprehension, and excelling in several benchmark tests. 02580 Open Source Project
Laminar An open source AI engineering optimization platform focused on AI engineering from first principles. It helps users collect, understand and use data to improve the quality of LLM (Large Language Model) applications. 02570 Open Source Project# Analysis Tool
Phi-3 A high-performance large-scale language model from Microsoft, tuned with instructions to support cross-platform operation, with excellent language comprehension and reasoning capabilities, especially suitable for multimodal application scenarios. 02570 Open Source Project
AutoGPT Based on the GPT-4 open-source project, integrating Internet search, memory management, text generation and file storage, etc., it aims to provide a powerful digital assistant to simplify the process of user interaction with the language model. 02290 Open Source Project# Digital Assistant