AI Resources Central Download - AI Resources Central Source code download

AI Resource Central

Gathering excellent artificial intelligence open source projects around the world!

Simplified Chinese English

Welcome to AI Resource Central ! This warehouse focuses on bringing together excellent artificial intelligence (AI) open source projects from around the world. Whether you are looking for inspiration to start your own project or want to learn how to use the latest AI technologies, this is a great starting point. We are committed to providing AI developers, researchers and enthusiasts with a platform to explore, communicate and share the code and implementation of various AI projects.

AI project summary

AI Resource Central

Intelligent body
Prompt word engineering
AI development interface
Agile development framework
Model
AI List
Reasoning Optimization
Information aggregation
Code Assistant
AI Tutorial
Workflow automation
AI robot
Multimodal model
Multilingual Model
Data organization
AI Services
Vector database
AI Education
AI development tools
Model training
AI image generation
Dataset
Model evaluation
Computational Management
AI Writing
Intelligent monitoring
Video generation
Data Management
AI Applications
AI Engineering
Target
Project Catalog
How to contribute
license

Target

Our main goals are:

Establish a comprehensive AI project library : projects covering a wide range of fields such as machine learning, deep learning, and natural language processing.
Promote the spirit of open source : By demonstrating high-quality open source projects, more people are encouraged to participate in the open source community.
Promote technological innovation : provide developers with practical cases and solutions, and accelerate the application and development of AI technology.
Support Learning and Development : Provide practical opportunities for learners of all levels to help them master the latest AI tools and technologies.

Project Catalog

Here is a list of some of our selected AI projects by different fields:

Intelligent body

Significant-Gravitas/AutoGPT - AutoGPT is designed to enable everyone to use and develop artificial intelligence. Its mission is to provide people with tools to focus on important matters.
geekan/MetaGPT - The first AI software company's multi-agent framework is aimed at natural language programming.
microsoft/autogen - A programming framework for autonomous artificial intelligence with related resources on PyPi, Discord and Office Hour.
reworkd/AgentGPT - Assemble, configure and deploy autonomous AI agents in the browser.
joaomdmoura/crewAI - Role-playing and autonomous AI body framework. It enables agents to collaborate and handle complex tasks.
microsoft/JARVIS - JARVIS is a system used to connect large language models (LLMs) with the machine learning (ML) community. (Paper: https://arxiv.org/pdf/2303.17580.pdf)
mem0ai/mem0 - The storage layer of artificial intelligence applications.
microsoft/semantic-kernel - Quickly and easily integrate state-of-the-art large language model (LLM) technology into your application.
yoheinakajima/babyagi -
openai/swarm - an ergonomic, lightweight multi-agent orchestration educational framework managed by the OpenAI solution team.
phidatahq/phidata - builds multimodal agents with memory, knowledge, tools and reasoning capabilities and chats through a beautiful agent user interface.
TransformerOptimus/SuperAGI - SuperAGI is a developer-first open source autonomous artificial intelligence proxy framework that allows developers to quickly and reliably build, manage and run useful autonomous agents.
composiohq/composio - Composio equips more than 100 high-quality integrations for artificial intelligence agents and large language models (LLMs) through function calls.
cpacker/MemGPT - Letta (formerly known as MemGPT), a framework for creating large language model (LLM) services with memory capabilities.
google-deepmind/deepmind-research - This repository contains implementation code and sample code for DeepMind publications.
botpress/botpress - An open source center for building and deploying GPT/LLM agents.
OpenMOSS/MOSS - An open source conversational language model developed by Fudan University and enhanced by tools.
smol-ai/developer - The first library that allows you to embed developer agents in your own applications.
OpenBMB/XAgent - An autonomous language model agent for solving complex tasks.
langchain-ai/langgraph - Constructs elastic language agents in the form of graphs.
e2b-dev/e2b - Secure open source cloud runtime for AI applications and proxy.
modelscope/agentscope - Easier to build multiagent applications empowered by large language models (LLMs).
homanp/superagent - Run an AI proxy through the API.
aiwaves-cn/agents -
frdel/agent-zero - Agent Zero artificial intelligence framework.
microsoft/TinyTroupe - Multiagent role simulation powered by large language models (LLM) to enhance imagination and gain business insights.
QwenLM/Qwen-Agent - A proxy framework and application based on Qwen≥2.0, with function calls, code interpreter, retrieval enhancement generation (RAG) and Chrome extension functions.
OpenBMB/AgentVerse - AgentVerse is designed to deploy multiple large language model (LLM)-based agents in applications, mainly providing task resolution and simulation frameworks.
Significant-Gravitas/Auto-GPT-Plugins - Auto-GPT plugin.
huggingface/smolagents - Smolagents is a basic library for agents. Agents use it to write Python code for tool-calling and agent-orchestrating.
Ironclad/rivet - An open source visual artificial intelligence programming environment and TypeScript library.
gmpetrov/databerry - A codeless platform for creating custom large language model (LLM) agents.
OpenBMB/BMTools - Open source solution for large models tool learning with ChatGPT plugin.
langroid/langroid - Use multi-agent programming to control large language models.
muellerberndt/mini-agi - MiniAGI is a simple universal autonomous agent that relies on the OpenAI API.
Farama-Foundation/PettingZoo - A multi-agent reinforcement learning application program interface (API) standard, including commonly used reference environments and utilities.
Josh-XT/AGiXT - AGiXT is a dynamic AI platform that uses adaptive memory, intelligent features and plug-in systems to manage instructions and perform tasks among multiple AI vendors to provide efficient AI solutions.
Togethercomputer/moa - Hybrid Agent (MoA) achieved 65.1% in Alpaca Evaluation (AlpacaEval) using open source software models.
AgentOps-AI/agentops - Python Software Development Kit (SDK) for AI Agent Monitoring, Large Language Model (LLM) Cost Tracking and Benchmarking. It integrates with a variety of large language models and proxy frameworks.
noahshinn/reflexion - [NeurIPS 2023] "Reflexion: Language Agents Based on Speech Reinforcement Learning"
SciSharp/BotSharp - Artificial Intelligence Multi-Agility Framework in .NET.
dot-agent/nextpy -
iterative/datachain - Extraction, transformation, loading (ETL), analysis and versioning of unstructured data.
agiresearch/OpenAGI - OpenAGI: Encountering of Large Language Models (LLMs) with domain experts.
InternLM/lagent - A lightweight framework for creating proxy based on large language models.
MineDojo/MineDojo - Use Internet-scale knowledge to build open embodied intelligent bodies.
Forethought-Technologies/AutoChain - AutoChain is used to create lightweight, scalable and testable large language model (LLM) agents.
landing-ai/vision-agent - Visual agent.
BCG-X-Official/agentkit - A starter kit for building restricted agents using Nextjs, FastAPI, and Langchain.
jina-ai/thinkgpt - A proxy technology for enhancing large language models (LLMs) and breaking through its limitations.
farizrahman4u/loopgpt - A modular framework for Auto - GPT.
Farama-Foundation/chatarena - ChatArena is a multiagent language gaming environment for large language models (LLMs) designed to develop the communication and collaboration capabilities of artificial intelligence.
THUDM/AgentTuning - Agent Tuning gives general agent capabilities to large language models.
Yifan-Song793/RestGPT - Autonomous proxy based on large language models controls real-world applications through RESTful API (Denotative State Transfer Application Program Interface).
Link-AGI/AutoAgents - At the 2024 International Joint Conference on Artificial Intelligence (IJCAI), different GPT roles were generated to form a collaborative entity to handle complex tasks.
AI-Engineer-Foundation/agent-protocol - This is a common interface that interacts with artificial intelligence agents, independent of the technology stack and can be used for any agent building framework.
kreneskyp/ix - A proxy platform for autonomous GPT-4.

Prompt word engineering

f/awesome-chatgpt-prompts - This library organizes ChatGPT prompt words to better use ChatGPT and other large language model (LLM) tools.
PlexPt/awesome-chatgpt-prompts-zh - ChatGPT Chinese training guide. Guide to use various scenarios. Learn how to make it follow your instructions.
dair-ai/Prompt-Engineering-Guide - Guides, papers, lectures, notes and resources for prompting engineering.
stanfordnlp/dspy - DSPy: A framework for programming programming language models rather than prompting.
guidance-ai/guidance - A bootstrapping language for controlling large language models.
outlines-dev/outlines - Structured text generation
mshumer/gpt-prompt-engineer -
jxnl/instructor - Structured output of large language models (LLMs).
brexhq/prompt-engineering - Tips and tricks for using OpenAI's GPT - 4 and other large language models.
LouisShark/chatgpt_system_prompt - A set of GPT system prompt words and knowledge about prompt injection/leakage.
microsoft/TypeChat - TypeChat is a library for building natural language interfaces with types.
sgl-project/sglang - SGLang is a fast service framework for large language models and vision-language models.
mit-han-lab/streaming-llm - An efficient streaming language model with attention convergence (ATtention Sinks) proposed at the 2024 International Conference on Learning Representation (ICLR).
spdustin/ChatGPT-AutoExpert - Enhanced custom instruction for ChatGPT (non-encoding) and ChatGPT advanced data analysis (encoding).
civitai/civitai - A repository containing model and text inversion.
Moonvy/OpenPromptStudio - AIGC Prompt Word Visual Editor | Operation and Maintenance | Open Prompt Studio
rockbenben/ChatGPT-Shortcut - Maximize efficiency and productivity with AI shortcuts. Customize, save and share tips, and find tips for different scenarios in the shared community.
microsoft/promptbase - Everything related to prompt project.
PrefectHQ/marvin - Create a pleasant AI interface.
promptfoo/promptfoo - Test prompt words, proxy and retrieval enhancement generation (RAG). At the same time, red team testing, penetration testing and vulnerability scanning are performed on large language models (LLMs), compare the performance of large language models, and simple configuration through command line and continuous integration/continuous deployment (CI/CD) integration.
princeton-nlp/tree-of-thought-llm - Thought on the use of large language models for deliberate problem solving at the 2023 Neural Information Processing System Conference (NeurIPS 2023).
pydantic/pydantic-ai - A proxy framework or populator for using Pydantic in large language models (LLMs).
1rgs/jsonformer - A reliable way to generate structured JSON from a language model.
thunlp/OpenPrompt - An open source framework for prompt learning.
guardrails-ai/guardrails - Add security restrictions or constraints to large language models.
eth-sri/lmql - A language that efficiently program large language models (LLMs) under constraint guidance.
prompttslab/Promptify - Prompt engineering and version control, using GPT or other prompt-based models to get structured output. Join Discord for related research.
shreyashankar/gpt3-sandbox - This project is designed to allow users to create excellent web demonstrations with a few lines of Python code by using the new OpenAI GPT-3 API.
hegelai/prompttools - an open source tool for prompting tests/experiments, supporting large language models (such as OpenAI, LLaMA) and vector databases (such as Chroma, Weaviate, LanceDB).
bigscience-workshop/promptsource - A toolkit for natural language prompts, including creation, sharing, and use.
YiVal/YiVal - Your general artificial intelligence application automatically prompts the engineering assistant.
microsoft/prompt-engine - A library that helps developers create prompts for large language models.
ianarawjo/ChainForge - An open source environment for visual programming for practical testing of tips for large language models (LLMs).
spcl/graph-of-thoughts - The official implementation of "Thinking Map: Solving Complex Problems with Large Language Models".
ysymyth/ReAct - [ICLR 2023] "ReAct: Combining Reasoning and Action in Language Models"
Microsoft/genaiscript - Automatic generation of artificial intelligence (general artificial intelligence) scripts.
jackmpcollins/magentic - Uninterrupted integration of large language models into Python functions.
adieyal/sd-dynamic-prompts - A custom script written for AUTOMATIC1111/stable - diffusion - webui to create a small template language to randomly generate prompt words.
zjunlp/EasyEdit - An easy-to-use knowledge editing framework for large language models (LLMs) at the 2024 American Association for Computational Linguistics (ACL) Conference.
microsoft/aici - AICI: prompt as a WebAssembly program.
zou-group/textgrad - TextGrad: uses large language models to backpropagate text gradients through automatic "derivative" of text.
microsoft/PromptCraft-Robotics - A community of using large language models (LLMs) in the field of robotics and a robot simulator integrated with ChatGPT.
greshake/llm-security - A new way to destroy large language models for integrated applications.
noamgat/lm-format-enforcer - Strengthen the output format of the language model (such as JSON pattern, regular expressions, etc.).
Ber666/llm-reasoners - A library for complex reasoning in large language models.
jujumilk3/leaked-system-prompts - A collection of leaked system prompts.
laiyer-ai/llm-guard - A security toolkit for large language model interactions.
hiyouga/FastEdit - Quickly edit large language models in 10 seconds.
timqian/openprompt.co - Create, use and share ChatGPT tips.
exploration/spacy-llm - Integrate large language models (LLMs) into structured natural language processing (NLP) processes.
protecttai/rebuff - Large Language Model (LLM) prompts the injection detector.
getmetal/motorhead - Motorhead is a server for large language models (LLMs) for memory and information retrieval.
Mirascope/mirascope - A large language model (LLM) abstract concept that is not obstructive.
cocacola-lab/ChatIE -

AI development interface

jmorganca/ollama - Get started quickly with Llama 3.3, Mistral, Gemma 2 and other large language models.
ChatGPTNextWeb/ChatGPT-Next-Web - ChatGPT, Gemini and other cross-platform user interface (UI) enables you to have your own large language model (LLM) applications in one click.
xtekky/gpt4free - The official gpt4free repository contains a variety of powerful language models.
oobabooga/text-generation-webui - A Gradio web user interface for large language models that support multiple inference backends.
RVC-Boss/GPT-SoVITS - One minute of speech data can be used to train a good speech synthesis (TTS) model (small sample speech clone).
gradio-app/gradio - Build and share excellent Python machine learning applications. Like to support.
mckaywrigley/chatbot-ui - All models offer AI chat capabilities.
openai/openai-python - The official Python library for OpenAI API.
danny-avila/LibreChat - An enhanced version of ChatGPT with various features such as different APIs, AI models and features, and it is an active self-hosted open source project.
sunner/ChatALL - Chat with multiple chatbots (such as ChatGPT, Bing Chat, etc.) simultaneously to find the best answer.
GaiZhenbiao/ChuanhuChatGPT - Graphical user interface (GUI) for ChatGPT API and many large language models (LLMs). It has various functions, such as agents and file-based QA, and has a beautiful user interface.
CopilotKit/CopilotKit - React UI and elegant infrastructure for a variety of artificial intelligence applications such as copilot, in-app agent, chatbot and text areas.
mlc-ai/web-llm - High-performance in-browser large language model (LLM) inference engine.
jina-ai/clip-as-service - Use CLIP to perform scalable embedding, reasoning, and sorting images and sentences.
chathub-dev/chathub - One-stop chatbot client.
TheRamU/Fay - Fay is an open source digital human framework. It has different versions for various applications.
sashabaranov/go-openai - Go wrapper for OpenAI ChatGPT, GPT - 3, GPT - 4, DALL·E and Whisper APIs.
SillyTavern/SillyTavern - Front-end for advanced users in large language models (LLM).
openai/openai-node - The official JavaScript/TypeScript library for OpenAI API.
sebastianstarke/AI4Animation - Use computer intelligence to bring characters to life in Unity.
xiangsx/gpt4free-ts - Provides a free OpenAI GPT-4 API in a replica project of a TypeScript version of xtekky/gpt4free.
wzpan/wukong-robot - Wukong Robot is a simple, flexible and elegant Chinese voice conversation robot/smart speaker project. It supports ChatGPT multi-round dialogue and may be the first open source smart speaker project to support brain-computer interaction.
yihong0618/xiaogpt - Play ChatGPT and other large language models (LLM) with Xiaomi smart speakers.
nat/openplayground - A large language model (LLM) playground that can run on a laptop.
postgresml/postgresml - Postgres (a database management system) with GPU for machine learning and artificial intelligence applications.
Shaunwei/RealChar - Create, customize and have real-time conversations with AI characters/partners. Use various technologies to achieve seamless AI dialogue anytime, anywhere.
ParisNeo/lollms-webui - The web user interface of the master of large language models.
zhayujie/bot-on-anything - A chatbot builder based on large models can quickly integrate artificial intelligence models such as ChatGPT, Claude and Gemini into software applications and websites such as Telegram, Gmail, Slack.
deanxv/coze-discord-proxy - Through Coze-Bot proxy Discord dialogue, request the GPT4 model through the API, providing functions such as dialogue, text-generated graphics, graphics-generated text and knowledge base search.
vocodedev/vocode-python - Build a large language model (LLM) agent based on voice, modular and open source.
alexrudall/ruby-openai - OpenAI API and Ruby.
ahmadbilaldev/langui - Artificial Intelligence User Interface. Open source Tailwind components for GPT, Generative Artificial Intelligence, and Large Language Model (LLM) projects.
ollama/ollama-js - Ollama JavaScript library.
xusenlinzy/api-for-open-llm - OpenAI-style API for open large language models. Supports various models, such as LLaMA, ChatGLM, etc.
Anse-app/anse - ChatGPT, DALL - E and Stable Diffusion models are super experience.
mylxsw/aidea-server - AIdEA is a multi-functional APP that supports GPT, domestic large-scale language models (such as Tongyi Qianwen and Wenxin Yiyan), as well as used in literary and biographical pictures, biographical pictures, SDXL1.0, super Stable Diffusion for resolution and image coloring.
aallam/openai-kotlin - Kotlin OpenAI API client that supports multi-platform and coroutines.
guinmoon/LLMFarm - Use GGML library for Llama and other large language models offline on iOS and MacOS.
uezo/ChatdollKit - ChatdollKit allows you to convert your own 3D models into chatbots.

Agile development framework

langchain-ai/langchain - Build an application with situational awareness reasoning capabilities.
nomic-ai/gpt4all - GPT4All is able to run local large language models (LLMs) on any device. It is open source and can be used for commercial purposes.
comfyanonymous/ComfyUI - The most powerful and modular diffusion model has graph/node interfaces for graphical user interfaces (GUI), application program interfaces (APIs), and backends.
langgenius/dify - Dify is an open source large language model (LLM) application development platform with an intuitive interface and a variety of functions to realize the process from rapid prototype production to production.
lobehub/lobe-chat - Lobe Chat is an open source artificial intelligence chat framework with modern design. It supports multiple AI vendors, knowledge bases and multimodals, and can deploy private chat applications for free with one click.
logspace-ai/langflow - Langflow is a Python-based, model-independent low-code application builder for RAG (retrieval enhancement generation) and multi-agent AI applications. It can work in conjunction with any API or database.
run-llama/llama_index - LlamaIndex is a data framework for large language model (LLM) applications.
FlowiseAI/Flowise - Create your personalized large language model (LLM) process using a drag-and-drop user interface.
chatchat-space/Langchain-Chatchat - Langchain - Chatchat (originally named Langchain - ChatGLM) is a search-enhanced generation (RAG) and agent (Agent) application based on Langchain, ChatGLM, Qwen, Llama, etc. for local knowledge-based Large Language Model (LLM).
go-skynet/LocalAI - An open source project that can replace services such as OpenAI and Claude. It can run on consumer hardware and perform various tasks such as generating different media types.
infiniflow/ragflow - RAGFlow is an open source RAG (retrieval enhancement generation) engine for deep document understanding.
mindsdb/mindsdb - AGI's query engine is a platform for building artificial intelligence that can learn and answer questions on federated data.
embedchain/embedchain - The storage layer of your artificial intelligence application.
songquanpeng/one-api - This is an OpenAI key management and redistribution system. It supports multiple large language models (LLMs), has an English user interface (UI), can be executed in a single file, and has Docker images for easy deployment.
Cinnamon/kotaemon - An open source tool based on retrieval enhancement generation (RAG) technology that can be used to chat with documents.
labring/FastGPT - FastGPT is a knowledge platform based on large language models (LLMs) that provides multiple functions to easily develop and deploy Q&A systems.
deepset-ai/haystack - An artificial intelligence orchestration framework for building large language model (LLM) applications for tasks like search augmented generation (RAG) and chatbots with advanced search methods.
BerriAI/litellm - Python SDK and proxy server (LLM gateway) are able to call more than 100 large language model (LLM) APIs in OpenAI format, including APIs such as Bedrock, Azure, etc.
flairNLP/flair - A very basic framework for high-level natural language processing.
langchain-ai/langchainjs - Build a reasoning application with context awareness.
xenova/transformers.js - State-of-the-art machine learning technology for networking allows running in browsers without servers? Transformers model.
netease-youdao/QAnything - Q&A based on anything.
h2oai/h2ogpt - Private chat with local GPT, supporting documents, images, videos and other content. It is 100% private, based on the Apache 2.0 protocol, supports oLLaMa, Mixtral, llama.cpp, etc., and there are examples in the given link.
pathwaycom/llm-app - Ready-to-use cloud templates for retrieval enhanced generation (RAG), artificial intelligence pipelines and enterprise search, can process real-time data, docker-friendly and synchronize with various data sources.
ludwig-ai/ludwig - A low-code framework for creating customized artificial intelligence models like large language models (LLMs) and neural networks.
vercel/ai - Build AI-powered applications using React, Svelte, Vue and Solid.
microsoft/promptflow - Build high-quality large language model (LLM) applications through prototyping, testing, production deployment and monitoring.
Unstructured-IO/unstructured - Open source libraries and APIs for creating custom preprocessing pipelines such as tagging, training, or production tasks in machine learning.
dataelement/bisheng - BISHENG is an open large language model (LLM) operation and maintenance platform for enterprise artificial intelligence applications. It has functions such as Generative Artificial Intelligence (GenAI) workflow, Retrieval Enhanced Generation (RAG) and other functions.
Togethercomputer/OpenChatKit -
llmware-ai/llmware - A unified framework for creating enterprise-level search-enhanced generation (RAG) pipelines using small dedicated models.
leptonai/search_with_lepton - Quickly create a conversation-based search demo using Lepton AI.
Deeptrain-Community/chatnio - Next-generation artificial intelligence one-stop B/C-end solution, supporting a variety of models and various functions.
Chainlit/chainlit - Quickly build conversational AI in minutes.
modelscope/modelscope - ModelScope turns the concept of model as a service into reality.
deeppavlov/DeepPavlov - An open source library for deep learning end-to-end conversation systems and chatbots.
langchain-ai/opengpts -
TaskingAI/TaskingAI - An open source platform for developing native artificial intelligence applications.
wenda-LLM/wenda - Wenda is a large language model (LLM) call platform designed to efficiently generate content in a specific environment, taking into account the limitations of personal and small and medium-sized enterprise computing resources as well as knowledge security and privacy issues.
rustformers/llm - An unmaintained Rust library ecosystem for handling large language models. See the readme file for details.
josStorer/RWKV-Runner - An 8MB fully automatic RWKV management and startup tool with an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and commercially available.
langchain4j/langchain4j - Java version of LangChain.
OpenBMB/ToolBench - An open platform for training, service and evaluation of large language models for tool learning (ICLR'24 Focus Paper).
microsoft/FLAML - A fast library for automatic machine learning (AutoML) and parameter adjustment. There is also a link to join Discord (a chat software).
microsoft/lmops - A general technology for realizing artificial intelligence capabilities through large language models (LLMs) and multimodal large language models (MLLMs).
llm-workflow-engine/llm-workflow-engine - Power CLI and workflow manager for large language models (core packages).
timescale/pgai - A set of tools for easier development of search-enhanced generation (RAG), semantic search, and other AI applications using PostgreSQL.
FreedomIntelligence/LLMZoo - LLM Zoo is a project that provides data, models and evaluation benchmarks for large language models.
casibase/casibase - AI Cloud is an open source retrieval enhanced generation (RAG) knowledge base similar to LangChain. It supports multiple models and has chatbots and administrative user interface (UI) demos.
getzep/zep - Zep: The memory basis of your AI stack.
leptonai/leptonai - A Python framework for simplifying the construction of artificial intelligence services.
pezzolabs/pezzo - an open source, developer-first LLMOps platform for simplifying work in all aspects such as prompt design and version management.
cheshire-cat-ai/core - Artificial intelligence proxy microservices.
aurelio-labs/semantic-router - Ultra-high-speed artificial intelligence for multimodal data decision-making and intelligent processing.
install-ai/vdp - Instill Core is a full-stack AI infrastructure tool for data, models, and pipeline orchestration that simplifies the process of building multiple AI-first applications.
intelligence/intel-extension-for-transformers - Use state-of-the-art compression technology to quickly build your chatbots on your device and run large language models efficiently on Intel platforms.
griptape-ai/griptape - A modular Python framework for artificial intelligence agents and workflows with thinking chain reasoning, tools and memory capabilities.
run-llama/LlamaIndexTS - Data framework for large language model (LLM) applications with a focus on server-side solutions.
Agenta-AI/agenta - An open source LLMOps platform that integrates prompt word playground, prompt word management, large language model (LLM) evaluation and large language model observability.
marella/ctransformers - Python binding provided for Transformer models in C/C++ through the GGML library.
devflowinc/trieve - An API-based infrastructure that integrates search, recommendation, search augmented generation (RAG) and analysis.
YangLing0818/RPG-DiffusionMaster - [ICML 2024] Master text-to-image diffusion role-playing games (RPGs) through resubtitles, planning, and generation of multimodal large language models (LLMs).
trypromptly/LLMStack - A codeless multi-proxy framework for building large language model (LLM) agents, workflows, and applications using your data.
getzep/graphiti - Build and query dynamic knowledge graphs with time-awareness.
KimMeen/Time-LLM - Official implementation of "Time-LLM: Time Series Prediction by Reprogramming Large Language Models" in ICLR 2024.
floneum/floneum - Instant, controllable, and artificial intelligence model in the locally pre-trained Rust language.
jina-ai/langchain-serve - Langchain application produced using Jina and FastAPI.
SqueezeAILab/LLMCompiler - LLM Compiler (LLMCompiler) proposed at the 2024 International Conference on Machine Learning (ICML) is a large language model (LLM) compiler for parallel function calls.
andreibondarev/langchainrb - Build an application powered by large language model (LLM) using Ruby.
psychic-api/rag-stack - Deploy a private version of ChatGPT alternative in a virtual private cloud (VPC), connect to your organization's knowledge base, and support open source large language models (LLMs).
DAGWorks-Inc/burr - Build applications for decision-making, such as chatbots, and manage them on your own infrastructure.
IntelLabs/fastRAG - Efficient search enhancement and generation framework.
sobelio/llm-chain - "llm-chain" is a powerful Rust crate (crate, which can be understood as a code base) used to build chains in large language models, implement text summary and completion of complex tasks.
microsoft/windows-ai-studio -
vercel/modelfusion - A TypeScript library for creating artificial intelligence applications.
axflow/axflow - A TypeScript framework for artificial intelligence development.
gabrielchua/RAGxplorer - An open source tool for visualizing your search enhancement generation (RAG).
parthsarthi03/raptor - Official implementation of RAPTOR (Raptor) for tree-like organization retrieval through recursive abstraction processing.
google/generative-ai-swift - The official Swift library for Google Gemini API.
pinecone-io/canopy - Retrieval enhanced generation (RAG) framework and context engine powered by Pinecone.
safevideo/autollm - Launches large language model (LLM) network applications based on search augmented generation (RAG) in seconds.

Model

openai/whisper - Enable robust voice recognition through large-scale weak supervision.
CompVis/stable-diffusion - A potential text-to-image diffusion model.
facebookresearch/llama - Inference code for Llama models.
xai-org/grok-1 - Public version of Grok is released.
Stability-AI/stablediffusion - High resolution image synthesis using potential diffusion models.
karpathy/nanoGPT - The easiest and fastest library for training/fine-tuning medium-sized GPT.
TencentARC/GFPGAN - GFPGAN focuses on creating practical algorithms for face repair in real-life scenarios.
lllyasviel/ControlNet -
tatsu-lab/stanford_alpaca - Code and documentation for training Stanford Alpaca models and generating data.
meta-llama/llama3 - Official Meta Llama 3 GitHub website.
Stability-AI/generative-models - Generative Models of Stability AI
lucidrains/vit-pytorch - Implementing Vision Transformer in PyTorch, using only one transformer encoder to reach the most advanced level (SOTA) in visual classification tasks.
apple/ml-stable-diffusion - Stable Diffusion using Core ML on Apple Silicon.
facebookresearch/codellama - Inference code for CodeLlama model.
QwenLM/Qwen - The official code base of Qwen, Tongyi Qianwen is a large-scale language model proposed by Alibaba Cloud.
AI4Finance-Foundation/FinGPT - FinGPT - Open source finance large-scale language model. Training model released in HuggingFace.
state-spaces/mamba - Mamba SSM architecture
BlinkDL/RWKV-LM - RWKV is a recurrent neural network (RNN) that performs well in large language models (LLM) and can be trained like a GPT transformer. It has the characteristics of excellent performance and linear time.
CompVis/latent-diffusion - High resolution image synthesis using potential diffusion models.
QwenLM/Qwen1.5 - Tongyi Qianwen 2.5 is a large-scale language model series developed by Alibaba Cloud Tongyi Qianwen team.
lucidrains/DALLE2-pytorch - A neural network DALL for text and graphics synthesis that implements OpenAI update in PyTorch - E 2.
NVIDIA/Megatron-LM - Continues the research on large-scale training of Transformer models.
guoyww/AnimateDiff - The official implementation of AnimateDiff.
databrickslabs/dolly - Databricks' Dolly is a large language model trained on the Databricks machine learning platform.
mlfoundations/open_clip - an open source implementation of CLIP (contrast language-image pre-training).
THUDM/CogVideo - Text and Image to Video Generation: CogVideoX (2024) and CogVideo (2023 International Conference on Characterization Learning)
AIGC-Audio/AudioGPT - AudioGPT is related to understanding and generating voice, music, sound, and speaking avatars.
nlpxucan/WizardLM - Large language models (LLMs) are built on Evol Insturct (Evol instructions): WizardLM (wizard language model), WizardCoder (wizard encoder), WizardMath (wizard mathematics).
lucidrains/denoising-diffusion-pytorch - Implementing a denoising diffusion probability model in Pytorch.
THUDM/CodeGeeX - CodeGeeX is an open source multilingual code generation model (KDD 2023).
Vaibhavs10/insanely-fast-whisper -
01-ai/Yi - A series of large language models developed from the ground up by 01-ai developers.
lucidrains/PaLM-rlhf-pytorch - Implementing human feedback reinforcement learning (RLHF) similar to ChatGPT on PaLM.
HumanAIGC/EMO - Use the Audio2Video Diffusion Model to generate expressive portrait videos: vivid emoticon portraits.
alembics/disco-diffusion - No translateable description is provided.
openlm-research/open_llama - OpenLLaMA is an open source replica of LLaMA 7B of Meta AI with a loose license agreement and is trained on RedPajama dataset.
OpenBMB/MiniCPM - MiniCPM3 - 4B, a large language model (LLM) at the edge, with better performance than GPT - 3.5 - Turbo.
LargeWorldModel/LWM - Large world model for text and video modeling with millions of large contexts.
LiheYoung/Depth-Anything - "Deep Everything: Unleashing the Power of Large-scale Unmarked Data", a basic model for depth estimation based on monocular images in the 2024 Conference on Computer Vision and Pattern Recognition (CVPR).
openai/point-e - Point cloud diffusion is used for 3D model synthesis.
google-research/text-to-text-transfer-transformer - Code for the paper "Exploring the limits of transfer learning with a unified text-to-text converter"
Lightning-AI/lit-llama - The nanoGPT-based LLaMA language model implementation supports a variety of features such as fast attention mechanism, quantization, fine-tuning, and pre-training, and is licensed under the Apache 2.0 license.
OpenGVLab/LLaMA-Adapter - At the 2024 International Conference on Learning Characterization (ICLR), LLaMA is fine-tuned in 1 hour to follow the instructions using 1.2 million parameters.
NVIDIA/DALI - A GPU (Graphics Processor) acceleration library has highly optimized building blocks and execution engines for data processing to accelerate deep learning training and inference applications.
allenai/OLMo - OLMo code for modeling, training, evaluation and reasoning.
salesforce/CodeGen - CodeGen is an open source model series for program synthesis, trained on TPU-v4, comparable to OpenAI Codex.
lucidrains/x-transformers - A highly focused, concise and complete transformer with experimental characteristics from multiple papers.
SCIR-HI/Huatuo-Llama-Med-Chinese - Ben Cao (formerly known as Hua Tuo), a model library that uses Chinese medical knowledge to fine-tune instructions on large language models.
luosiallen/latent-consistency-model - latent consistency model: synthesis of high-resolution images by small step inference.
microsoft/BioGPT -
google-research/simclr - SimCLRv2: Large self-supervised models are powerful semi-supervised learners.
llSourcell/Doctor-Dignity - Doctor Dignity is a large language model (LLM) that can pass the US Medical License Examination (USMLE), be offline, cross-platform, and protect the privacy of healthy data.
google-research/multinerf - Code release for Mip - NeRF 360, Ref - NeRF and RawNeRF.
jaymody/picoGPT - A very small GPT implemented with NumPy - 2 version.
google-research/albert - ALBERT is a streamlined version of BERT (Bidirectional Encoder Representations from Transformers) for self-supervised learning of language representation.
project-baize/baize-chatbot - Use ChatGPT for hours with just one GPU.
salesforce/CodeT5 - CodeT5 is designed to provide support for code understanding and generation for large language models (LLMs) of open code.
facebookresearch/jepa - PyTorch code and model for self-supervised learning from videos through vision - based on joint embedding prediction architecture (V-JEPA).
paperswithcode/galai - GALACTICA's model application program interface.
dvlab-research/LongLoRA - Code and documentation for LongLoRA and LongAlpaca (ICLR 2024 verbal report).
baaivision/Painter - Painter and SegGPT series: Visual basic model from Beijing Zhiyuan Artificial Intelligence Research Institute (BAAI).
databricks/dbrx - Code examples and resources for large language model DBRX for Databricks.
state-spaces/s4 - Structured state space sequence model.
google-research/electra - ELECTRA pre-trains text encoder as a discriminator rather than a generator.
EleutherAI/pythia - Center for research on interpretability and learning dynamics.
ise-uiuc/magicoder - Magicoder (ICML'24) implements code generation through open source directives (OSS - Instruct).
epfLLM/meditron - Meditron is an open source medical large language model.
MetaGLM/FinGLM - FinGLM aims to build an open, public welfare and lasting financial model project to promote the development of "AI+finance" through open source.
deepseek-ai/DeepSeek-LLM - DeepSeek Large Language Model (LLM): There will be an answer.
allenai/scispacy - Complete spaCy pipelines and models for scientific/biomedical documentation.
apple/ml-4m - 4M: Large-scale multimodal mask modeling.
google-research/language - a shared library for Google's artificial intelligence language team's open source project.
google/maxtext - A simple, high-performance, and scalable Jax large language model (LLM).
netease-youdao/BCEmbedding - Open source model for retrieving enhanced generation (RAG) products (embedding and resorting).
SHI-Labs/OneFormer - OneFormer in CVPR 2023 is a Transformer for universal image segmentation.
google-research/FLAN -
lxtGH/OMG-Seg - OMG-LLaVA and OMG-Seg code bases are related to CVPR-24 (Computer Vision and Pattern Recognition Conference - 2024) and NeurIPS-24 (Neural Information Processing System Conference - 2024).
SHI-Labs/Versatile-Diffusion - Versatile Diffusion: A diffusion model that combines text, images and variants, published on preprint platform arXiv in 2022 and presented at the International Conference on Computer Vision (ICCV) in 2023.
time-series-foundation-models/lag-llama - Lag - Llama: Methods for predicting basic models of probabilistic time series
openai/lm-human-preferences - Code for papers on fine-tuning language model.
IBM/Dromedary - Dromedary is designed to be a useful, ethical and reliable large-scale language model.
dauparas/ProteinMPNN - The paper code called ProteinMPNN.
SHI-Labs/Neighborhood-Attention-Transformer - Neighborhood Attention Transformer published in arxiv in 2022 and CVPR in 2023. In addition, the hollow neighborhood attention transformer published in arxiv in 2022.
THUDM/SwissArmyTransformer - SwissArmyTransformer is a flexible and powerful library for developing Transformer variants.
ctllllll/LLM-ToolMaker -
Xwin-LM/Xwin-LM - Xwin-LM: A powerful, stable and reproducible large language model alignment.
microsoft/ToRA - ToRA (for ICLR'24) is a large language model agent with integrated tools to solve difficult mathematical reasoning problems.
SalesforceAIResearch/uni2ts - The universal time series prediction transformer is trained uniformly.
replit/ReplitLM - ReplitLM model series inference code and configuration.
HazyResearch/safari - Convolution in the context of sequence modeling.

AI List

fighting41love/funNLP -
linexjlin/GPTs - GPT related leak tips.
e2b-dev/awesome-ai-agents - a series of autonomous agents for artificial intelligence.
eugeneyan/open-llms - A list of open large language models (LLMs) available for commercial use.
Shubhamsaboo/awesome-llm-apps - A great set of large language model (LLM) applications with search enhanced generation (RAG) capabilities that use OpenAI, Anthropic, Gemini and open source models.
RUCAIBox/LLMSurvey - A Survey of Large Language Models The official GitHub page for this survey paper.
WooooDyy/LLM-Agent-Paper-List - A list of papers on the rise and potential of agents based on large language models: A review, written by Xi Zhiheng (transliteration) and others
steven2358/awesome-generative-ai - List of contemporary generative artificial intelligence projects and services.
wgwang/LLMs-In-China - Large-scale Chinese model.
lonePatient/awesome-pretrained-chinese-nlp-models - A series of high-quality Chinese pretrained models, large models, multimodal models and large language models.
tensorchord/Awesome-LLMOps - A large list of excellent LLMOps tools selected for developers.
opendilab/awesome-RLHF - A continuously updated list of reinforcement learning resources based on human feedback.
DSXiangLi/DecryptPrompt - Summary of the application of Prompt and Large Language Models (LLM), open source data and models, and artificial intelligence generated content (AIGC).
FreedomIntelligence/Medical_NLP - Medical Natural Language Processing Competition, Datasets, Large Models and Papers.
archinetai/audio-ai-timeline - A timeline for the latest audio-generated AI models starting in 2023.
chiphuyen/aie-book - Resources of AI engineers and auxiliary materials for Artificial Intelligence Engineering (Chip Hughn, 2025).
EgoAlpha/prompt-in-context-learning - A high-quality resource for context learning and prompt engineering. Master large language models with the latest updates such as ChatGPT, GPT-3 and FlanT5.
taranjeet/awesome-gpts - A collection of all GPTs (generated pretrained converters) produced by the community.
cfahlgren1/natural-sql - A series of high-performance text-to-SQL large language models.
yokoffing/ChatGPT-Prompts - Prompt word management for ChatGPT and Bing AI.

Reasoning Optimization

ggerganov/llama.cpp - Large Language Model (LLM) Inference in C/C++.
ggerganov/whisper.cpp - A ported version of the Whisper model of OpenAI written in C/C++.
karpathy/llm.c - Train a large language model (LLM) using simple raw C/CUDA.
Mozilla-Ocho/llamafile - Use only one file to distribute and run large language models (LLMs).
unslothai/unsloth - Increase fine-tuning speeds of Llama 3.3, Mistral, Phi, Qwen 2.5 and Gemma large language models (LLMs) by 2-5 times while using 70% less memory.
mlc-ai/mlc-llm - A universal large language model (LLM) deployment engine with machine learning (ML) compilation capabilities.
karpathy/llama2.c - Infer Llama 2 in a single pure C file.
Dao-AILab/flash-attention - Fast and efficient precise attention mechanism, both fast and memory-saving.
openai/triton - a development library for the Triton language and compiler.
microsoft/BitNet - Official inference framework for 1-bit large language models (LLMs).
ggerganov/ggml - A tensor library for machine learning.
NVIDIA/TensorRT - NVIDIA TensorRT is a software development toolkit (SDK) for high-performance deep learning reasoning on NVIDIA GPUs. This repository has its open source components.
bigscience-workshop/petals - Run large language models (LLMs) at home in a BitTorrent-like manner, fine-tuning and inference speeds up to 10 times faster than offloading.
NVIDIA/TensorRT-LLM - TensorRT-LLM provides an easy-to-use Python API for defining large language models (LLMs) and building an optimized TensorRT engine for efficient GPU inference, and has a Python and C++ runtime for creating Python and C++ runtimes to execute components of these engines.
intelligence-analytics/BigDL - Accelerate local large language model (LLM) inference and fine-tuning on Intel XPUs and integrate with various related frameworks.
intelligence-analytics/ipex-llm - Accelerate local large language model (LLM) inference and fine-tuning on Intel XPU (accelerated processor for Intel architecture) and integrate with a variety of tools.
TimDettmers/bitsandbytes - A large language model accessible through K-bit quantization of PyTorch.
google/gemma.cpp - A lightweight standalone C++ inference engine for Google Gemma models.
NVIDIA/cutlass - CUDA template for linear algebra subroutines.
pytorch-labs/gpt-fast - Use less than 1000 lines of Python code to implement a simple and efficient native PyTorch transformer for text generation.
PanQiWei/AutoGPTQ - A large language model quantization package based on GPTQ algorithm, easy to use and user-friendly interface.
turboderp/exllamav2 - A fast inference library for running large language models (LLMs) locally on common consumer-grade GPUs.
OpenNMT/CTranslate2 - Fast transformer model inference engine.
ztxz16/fastllm - A pure C++ full-platform large language model (LLM) acceleration library that supports Python calls. It can make the single-card ChatGLM-6B-level model reach more than 10,000 tokens per second, support GLM, Llama, Moss basic models, and run smoothly on mobile devices.
qwopqwop200/GPTQ-for-LLaMa - Quantize LLaMA into 4 bits using GPTQ.
VainF/Torch-Pruning - [CVPR 2023] DepGraph: Pruning for arbitrary structures.
turboderp/exllama - A rewrite version of the HF Transformers version of Llama for quantizing weights, with more memory efficiency.
lucidrains/vector-quantize-pytorch - Vector (and scalar) quantization in PyTorch
mit-han-lab/llm-awq - AWQ: Activation-aware weight quantification (method) for compression and acceleration of large language models won the 2024 MLSys Best Paper Award.
Jittor/JittorLLMs - The Jittor model inference library has the characteristics of high performance, low configuration requirements, good Chinese support and portability.
FasterDecoding/Medusa - Medusa: A simple framework that accelerates the generation of large language models (LLMs) through multiple decoding terminals.
intel/neural-compressor - SOTA low-bit LLM quantization (including INT8/FP8/INT4/FP4/NF4) and sparseness are leading model compression techniques for TensorFlow, PyTorch and ONNX runtimes.
neuralmagic/sparseml - A library that facilitates easy application of sparseness to neural networks, resulting in faster and smaller models.
IST-DASLab/gptq - The paper code on the precise post-training quantization of Generative Pre-training Transformers (GPT) at the 2023 International Conference on Learning Characterization (ICLR), called "GPTQ".
HazyResearch/ThunderKittens - Tile primitives for fast kernels.
uTensor/uTensor - A micro machine learning artificial intelligence inference library.
pytorch-labs/ao - PyTorch native quantization and sparseness for training and reasoning.
saharNooby/rwkv.cpp - INT4/INT5/INT8 and FP16 inference on CPUs for RWKV language model.
mit-han-lab/smoothquant - SmoothQuant: Accurate and efficient post-training quantization of large language models.
Lightning-AI/lightning-thunder - Thunder is a PyTorch source-to-source compiler that can increase model speed by up to 40% and use different hardware actuators on multiple GPUs.
pytorch-labs/segment-anything-fast - A Segment for batch offline reasoning - Anything version.
Vahe1994/AQLM - Official PyTorch library, containing two papers on extreme compression of large language models: one is quantized by addition (https://arxiv.org/pdf/2401.06118.pdf), and the other is PV - Fine Tuning ( https://arxiv.org/abs/2405.14852).
hao-ai-lab/LookaheadDecoding - Use lookahead decoding to break the sequential dependencies in large language model (LLM) inference (ICML 2024).
horseee/LLM-Pruner - [NeurIPS 2023] Large language models (such as Llama - 3/3.1, Llama - 2, LLaMA, etc.) structure pruned LLM - Pruner.
kuleshov/minillm - MiniLLM is a system that runs modern large language models (LLM) on consumer-grade GPUs in a minimal way.

Information aggregation

binary-husky/gpt_academic - Provides practical interactive interface for large language models (LLMs) such as GPT/GLM, especially optimizes paper reading, polishing and writing experience. Supports multiple functions and integrates multiple models.
imartinez/privateGPT - Use GPT to interact with documents privately, no data breach.
Mintplex-Labs/anything-llm - Desktop and Docker AI applications are integrated, with built-in retrieval enhancement generation (RAG) and AI proxy.
khoj-ai/khoj - It is a self-hosted second brain of artificial intelligence that can get answers from various sources, build custom agents, arrange automated tasks, and conduct research, and can transform large language models into personal artificiality for free intelligent.
PromtEngineer/localGPT - Chat privately with local documents through GPT model, data does not leave the device.
kaixindelele/ChatPaper - Use ChatGPT to accelerate scientific research, including summary of arXiv papers, professional translation, polishing, peer review and response to peer review.
assafelovic/gpt-researcher - Autonomous agent based on large language model (LLM) conducts local and web research on any topic and creates a comprehensive report with citations.
arc53/DocsGPT - Document chatbots are able to chat with data, deploy privately, and integrate knowledge into AI workflows for sharing.
mayooear/gpt4-pdf-chatbot-langchain - GPT4 and LangChain chatbot for large PDF documents.
danswer-ai/danswer - Gen - AI Chat for Teams is like ChatGPT, but can gain special knowledge of the team.
josStorer/chatGPTBox - Deeply integrate ChatGPT into your browser. Everything you need is here.
facebookresearch/nougat - Application of Nougat in neurooptical understanding of academic documents.
bhaskatripathi/pdfGPT - PDF GPT is able to interact with PDF content through GPT capabilities and is an effective open source solution for turning PDF into a chatbot.
whitead/paper-qa - High-precision search-enhanced generation (RAG) technology for answering cited questions based on scientific literature.
weaviate/Verba - A search-enhanced generation (RAG) chatbot powered by Weaviate.
run-llama/rags - Use your data to build ChatGPT, all in natural language.
MuiseDestiny/zotero-gpt - GPT meets Zotero.
madawei2699/myGPTReader - A community-driven method of interacting with artificial intelligence robots using ChatGPT.
swirlai/swirl-search - AI Search & RAG can obtain instant answers from company knowledge in numerous applications while ensuring data is secure and fast deployment.
dvorka/mindforger - a thought note and a Markdown editor.
kha-white/manga-ocr - mainly used for Japanese character optical character recognition in Japanese comics.
nlmatics/llmsherpa - Developer application program interface (API) for accelerating large language model (LLM) projects.
ucbepic/docetl - A data processing and ETL (decision, transformation, loading) system driven by autonomous language model (LLM).
KnowledgeCanvas/knowledge - Knowledge is a tool used for various operations (such as saving, search, etc.) of websites, documents, and files.
rotatemweiss57/gpt-newspaper - An autonomous GPT-based agent that can create personalized newspapers based on user preferences.
nlmatics/nlm-ingestor - This repository provides server-side code for llmsherpa API connections and parsers for different file formats.
kha-white/mokuro - Read Japanese comics with selectable text in your browser.
BruceMacD/chatd - Chat with your documents via local AI.
akshata29/entaoai - Use your own data to chat and ask questions. Quickly upload enterprise data to use OpenAI services to chat and ask questions about uploaded data.

Code Assistant

abi/screenshot-to-code - Insert a screenshot and convert it into concise code (HTML/Tailwind/React/Vue).
gpt-engineer-org/gpt-engineer - A terminal-based platform for experiencing artificial intelligence software engineers, unlike https://gptengineer.app.
OpenDevin/OpenDevin - OpenHands: implement more functions with less code.
Pythongora-io/gpt-pilot - The first person to truly become a developer in the field of artificial intelligence.
getcursor/cursor - Artificial Intelligence Code Editor.
OpenBMB/ChatDev - Create customized software based on natural language ideas through multi-agent collaboration with large language models (LLM).
paul-gauthier/aider - Aider is artificial intelligence pairing programming in terminals.
TabbyML/tabby - A self-hosted AI coding assistant.
continuedev/continue - Continue is an open source artificial intelligence code assistant. It can be connected to models and contexts for custom autocomplete and chat in VS Code and JetBrains.
stageai/devika - Devika is an agent artificial intelligence software engineer who can understand human instructions, break down instructions, conduct research and write code. Its goal is to be an open source alternative to Devin developed by Cognition AI, and there is no official website.
emilwallner/Screenshot-to-code - A neural network for converting design models into static websites.
fauxpilot/fauxpilot - FauxPilot is an open source alternative to GitHub Copilot servers.
eosphoros-ai/DB-GPT - Native artificial intelligence data application development framework with AWEL (Agistrator Workflow Expression Language) and agents.
princeton-nlp/SWE-agent - SWE - Agent automatically fixes GitHub issues using GPT-4 or other language models, and can also be used for offensive cybersecurity or competitive programming challenges. [NeurIPS 2024]
Sinaptik-AI/pandas-ai - Interact with various databases (SQL, CSV, etc.) and use PandasAI to perform conversational data analysis through large language models (LLMs) and search augmented generation (RAG).
vanna-ai/vanna - Interact with your SQL database. Generate accurate text-to-SQL statements through large language models (LLM) using search-based generation (RAG) technology.
ShishirPatil/gorilla - Gorilla: Training and evaluation of large language models for function calls (tool calls).
codota/TabNine - AI code completion refers to the function of an AI system to provide programmers with suggestions or complete code segments.
TheR1D/shell_gpt - A command line productivity tool powered by AI large language models such as GPT-4, helps to complete tasks faster and more efficiently.
Nutlope/aicommits - A command line interface (CLI) that uses artificial intelligence to write git submission information for you.
GreyDGL/PentestGPT - A penetration testing tool empowered by GPT.
joshpxyne/gpt-migrate - Easily migrate your codebase between frameworks or languages.
kuafuai/DevOpsGPT - An artificial intelligence-powered software development multi-agent system combines large language models (LLMs) with DevOps tools to transform natural language requirements into runnable software, supporting any development language and extending existing code.
di-sukharev/opencommit - The GPT wrapper for Git is able to generate commit messages in 1 second using a large language model (LLM), works well with Claude 3.5 and supports local models.
sqlchat/sqlchat - A SQL client and editor based on chat functions for the next decade.
Exafunction/codeium.vim - A free and super fast Copilot alternative for Vim and Neovim.
varunshenoy/GraphGPT - Inferring knowledge graphs from unstructured text using GPT - 3.
Nutlope/llamacoder - Open source product made by Claude, built using Llama 3.1 405B.
mckaywrigley/ai-code-translator - leverages artificial intelligence to translate code between different languages.
shobrook/adrenaline - Interact with and visualize the code base.
QwenLM/Qwen2.5-Coder - Tongyi Qianwen 2.5 - Coder is the code version of Tongyi Qianwen 2.5. Tongyi Qianwen 2.5 is a large language model series developed by Alibaba Cloud Tongyi Qianwen team.
ricklamers/gpt-code-ui - An open source implementation of OpenAI's ChatGPT code interpreter.
gofireflyio/aiac - Artificial Intelligence Infrastructure Code Generator.
defog-ai/sqlcoder - The most advanced language model for converting natural language problems into SQL queries.
gptscript-ai/gptscript - Build an artificial intelligence assistant for interacting with your system.
RootbeerComputer/backend-GPT -
mpoon/gpt-repository-loader - Converts the codebase to a large language model (LLM) prompt-friendly format created primarily by GPT-4.
Canner/WrenAI - An open source artificial intelligence agent enables data and product teams to interact and chat with data through text-to-SQL (Text - to - SQL) to create charts, spreadsheets, reports and business intelligence (BI).
nus-apr/auto-code-rover - An autonomous software engineer who understands the structure of the project is dedicated to autonomous program improvement. It completed a percentage of tasks in both benchmarks and costs less than $0.7 per task.
fern-api/fern - Enter OpenAPI and output the SDK (Software Development Kit) and documentation.
georgia-tech-db/evadb - An application database system powered by artificial intelligence.
AbanteAI/mentat - Mentat - Artificial Intelligence Coding Assistant
emcf/engshell - An English language shell (shell) powered by large language models (LLMs) that can be used on any operating system (OS).
AI-Citizen/SolidGPT - A developer artificial intelligence for searching for roles.
context-labs/autodoc - An experimental toolkit that automatically generates codebase documents using large language models (LLMs).
knuckleswtf/scribe - Generate API documentation for humans from the Laravel code base.
jina-ai/dev-gpt - Your virtual development team can be a group of remote developers working to develop software or other projects through virtual collaboration.
Pythongora-io/pythagora - Generate automated tests for Node.js applications using large language models (LLMs) without the need for developers to write any code.
eli64s/readme-ai - A readme generator powered by artificial intelligence.
mattzcarey/code-review-gpt - Code review using large language models (GPT4, Sonnet 3.5) and embeddings improve code quality and detect errors in the pre-production phase, and continuous integration with Github/GitLab/Azure DevOps (CI )integrated.
smallcloudai/refact - WebUI for fine-tuning and self-hosting open source large coding language models.
eyelonmiz/react-agent - Open source React.js Autonomous Large Language Model (LLM) proxy.
gorilla-llm/gorilla-cli - Large language models (LLMs) for command line interfaces (CLIs).
huggingface/llm-vscode - Development powered by large language model (LLM) in VSCode.
peterw/Chat-with-Github-Repo - This repository has two Python scripts for creating chatbots through Streamlit, OpenAI GPT-3.5-turbo and Activeloop's Deep Lake.
paralleldrive/sudolang-llm-support - SudoLang Large Language Model (LLM) support in Visual Studio Code.
ricklamers/shell-ai - A command line interface (CLI) powered by LangChain for generating and running shell commands.
google/oss-fuzz-gen - via OSS - Fuzz driven by large language model (LLM).
kantord/SeaGOAT - Semantic code search engine that adopts a locally preferred approach.
OpenAutoCoder/Agentless - An agentless method for automatically solving software development problems.
ferrislucas/promptr - Promptr is a command line interface (CLI) tool that can modify the code base in easy-to-understand English.

AI Tutorial

microsoft/generative-ai-for-beginners - 21 courses to start building using generative artificial intelligence. Link: https://microsoft.github.io/generative-ai-for-beginners/
openai/openai-cookbook - OpenAI API usage examples and guides.
mlabonne/llm-course - A course in-depth with large language models (LLMs) containing roadmaps and Colab notebooks.
rasbt/LLMs-from-scratch - Gradually implement a ChatGPT-like large language model (LLM) in PyTorch from scratch.
lutzroeder/netron - A visualization tool for neural networks, deep learning and machine learning models.
datawhalechina/prompt-engineering-for-developers - Introduction to large language model (LLM) for developers, Chinese version of the Chinese version of the Ng model series course.
liguodongiot/llm-action - This project aims to share technical principles and practical experience related to large models (large model engineering and large-scale model application implementation).
stas00/ml-engineering - Open book on machine learning engineering.
mikeroyal/Self-Hosting-Guide - Self-hosting Guide: Host and manage software applications locally by individuals or organizations, covering cloud, large language models (LLMs) and more.
hua1995116/awesome-ai-painting - Collection of artificial intelligence painting materials, including domestic and foreign platforms, tutorials and news, such as Stable diffusion, AnimateDiff, Stable Cascade, Stable SDXL Turbo SDXL Turbocharged).
Mooler0410/LLMsPracticalGuide - A list of practical guide resources for large language models (LLMs) including LLMs trees, examples, and papers.
GoogleCloudPlatform/generative-ai - Sample code and notebook for Generative AI using Gemini in Vertex AI on Google Cloud.
kyrolabs/awesome-langchain - Excellent list of tools and projects using LangChain framework.
microsoft/DeepSpeedExamples - DeepSpeed sample model.
huggingface/alignment-handbook - A robust approach to bringing language models into line with human and artificial intelligence preferences.
trigaten/Learn_Prompting - A guide to prompt engineering, generative artificial intelligence and large language models (LLM), provided by Learn Prompting. Join its Discord (a chat software) to get the biggest tips for engineering learning community.
bbycroft/llm-viz - 3D visualization of GPT-style large language models.
ray-project/llm-numbers - Numbers that every large language model (LLM) developer should know.
luban-agi/Awesome-AIGC-Tutorials - Selected tutorials and resources for large language models, artificial intelligence painting, etc.
georgezouq/awesome-ai-in-finance - A range of carefully selected excellent large language models (LLMs), deep learning strategies and tools in the financial market.
howl-anderson/unlocking-the-power-of-llms - Make ChatGPT a powerful productivity tool using Prompts and Chains. Unlocking the potential of large language models (LLMs).
ashishpatel26/LLM-Finetuning - Finetuning large language models (LLM) using PEFT (parameter efficient fine-tuning).
ray-project/llm-applications - A comprehensive guide to the development of large language model (LLM) applications based on search augmented generation (RAG).
premAI-io/state-of-open-source-ai - In the chaotic and fast-paced world of open source innovation, you need to have clear ideas.
pionxzh/chatgpt-exporter - Export and share your ChatGPT chat history.
ianand/spreadsheets-are-all-you-need-
majacinka/crewai-experiments - Experiment using native models and models accessible through application program interfaces (APIs).
thu-vu92/local-llms-analyse-finance -

Workflow automation

KillianLucas/open-interpreter - The computer's natural language interface.
StanGirard/quivr - An opinionated search-enhanced generation (RAG) technology used to integrate generative artificial intelligence (GenAI) into applications, focusing on products. Easily integrated and customized in existing products, and is versatile in large language models (LLM), vector storage and files.
danielmiessler/fabric - Fabric is an open source AI-enhanced human framework with a modular structure that solves problems with crowdsourcing AI tips.
openai-translator/openai-translator - Browser and desktop application that uses the ChatGPT API for translation.
Skyvern-AI/skyvern - Automating browser-based tasks using large language models (LLMs) and computer vision technologies.
activepieces/activepieces - Your most friendly open source artificial intelligence automation tool. It is a workflow automation tool with more than 200 integrated worksflow automation tools and is an alternative to Zapier in enterprise automation.
OthersideAI/self-operating-computer - A framework for multimodal models to operate computers.
microsoft/UFO - A Windows operating system interactive proxy focused on the user interface.
yihong0618/bilingual_book_maker - Use artificial intelligence translation methods to create bilingual epub books.
lavague-ai/LaVague - Large action model framework for developing artificial intelligence network agents.
aisingapore/TagUI - A free robotic process automation (RPA) tool developed by Singapore’s artificial intelligence.
openchatai/OpenCopilot - Language to Behavior Engine
KillianLucas/01 - Top open source voice interface for desktop, mobile and ESP32 chips.
katanaml/sparrow - Data processing using machine learning, large language models (LLM), and large language models based on vision.
xlang-ai/OpenAgents - OpenAgents in 2024: an open platform for wild language agents.
BAAI-Agents/Cradle -
Cormanz/smartgpt - A program that enables large language models (LLMs) to complete complex tasks with the help of plug-ins.
fiatrete/OpenDAN-Personal-AI-OS - OpenDAN is an open source personal artificial intelligence operating system that integrates various artificial intelligence modules for personal use.
n4ze3m/page-assist - Use locally run AI models to assist in web browsing.
OS-Copilot/FRIDAY -
andrewnguonly/Lumos - A search-enhanced generation (RAG) large language model (LLM) co-pilot powered by the local large language model (LLM) for web browsing.
Dicklesworthstone/swiss_army_llama - A FastAPI service that performs semantic text search through pre-computed embedding, similarity metrics, and text support file types through texttract.

AI robot

lencx/ChatGPT - ChatGPT desktop application for Mac, Windows and Linux systems.
LAION-AI/Open-Assistant - OpenAssistant is a chat-based assistant that can understand tasks, interact with third-party systems and retrieve information dynamically.
zhayujie/chatgpt-on-wechat - A chatbot based on large models, supporting multiple platforms (WeChat official account, enterprise WeChat application, Feishu, DingTalk, etc.), multiple models (GPT3.5/GPT - 4o/GPT - o1/Claude/Wenxin Yiyan/iFlytek/Tongyi Qianwen/GLM-4/Claude/Qimi/Link Love), able to process text, voice and pictures, access the operating system and The Internet and supports customized enterprise intelligent customer service based on its own knowledge base.
Chanzhaoyu/chatgpt-web - A ChatGPT demo webpage built using Express and Vue3.
janhq/jan - Jan is an open source ChatGPT alternative that runs completely offline on your computer.
Bin-Huang/chatbox - A friendly desktop client application for AI models/large language models (such as GPT, Claude, Gemini, Ollama, etc.).
joonspk-research/generative_agents - Generative agent: interactive simulation of human behavior.
Unity-Technologies/ml-agents - Unity ML - Agents Toolkit is an open source project for training agents in games and simulations through deep reinforcement learning and imitation learning.
transitive-bullshit/chatgpt-api - A standard library of AI agents compatible with any large language model (LLM) and TypeScript AI Software Development Kit (SDK).
leon-ai/leon - Leon is your open source personal assistant.
xcanwin/KeepChatGPT - This is a plugin that enhances the security and efficiency of ChatGPT data. It offers many free innovative features to provide a better AI experience.
lss233/chatgpt-mirai-qq-bot - One-click deployment! True AI chatbot, supporting multiple platforms and multiple functions.
getumbrel/llama-gpt - A self-hosted offline chatbot like ChatGPT, powered by Llama 2, is private, has no data left the device, and now supports Code Llama.
sfyc23/EverydayWechat - WeChat Assistant: 1. Send customized messages to friends (girlfriends) regularly every day. 2. The robot automatically responds to friends. 3. Group assistant functions (such as garbage classification query, weather, calendar, real-time movie box office, express logistics, PM2.5, etc.).
BlinkDL/ChatRWKV - ChatRWKV is an open source language model like ChatGPT, but powered by RWKV, a 100% recurrent neural network.
ztjhz/BetterChatGPT - A great user interface for ChatGPT, available on websites and on a variety of operating systems including Windows, MacOS and Linux.
a16z-infra/ai-town - A starter kit that follows the MIT license for building AI towns where AI characters live, chat and socialize.
memochou1993/gpt-ai-assistant - OpenAI, LINE and Vercel combine to form the GPT AI assistant.
miurla/morphic - A search engine powered by artificial intelligence and with a generative user interface.
interstellard/chatgpt-advanced - WebChatGPT is a browser extension that enhances ChatGPT prompts through network results.
linyiLYi/street-fighter-ai - This is an artificial intelligence agent for Street Fighter II Champion Edition.
vincelwt/chatgpt-mac - ChatGPT for Mac resides in your menu bar.
camel-ai/camel - CAMEL: The first and best multi-agent framework for discovering the expansion law of agents. (https://www.camel - ai.org)
MineDojo/Voyager - Open embossed agents associated with large language models.
a16z-infra/companion-app - AI partner with memory: a lightweight stack for creating and hosting your own AI partner.
ConnectAI-E/Feishu-OpenAI - Feishu (combined with GPT-4, GPT-4V, DALL·E-3 and Whisper) provides a great work experience, including voice conversations, role-playing, multi-topic discussions, image creation, Table analysis and document export.
simonw/llm - Access large language models via command line.
sigoden/aichat - A large language model (LLM) command line interface (CLI) tool integrating Shell Assistant, Chat - REPL, Retrieval Enhanced Generation (RAG), Artificial Intelligence Tools and Agents, which can access OpenAI , Claude and other platforms.
lencx/nofwl - No FireWall (NoFWL) desktop application.
Kent0n-Li/ChatDoctor -
xtekky/chatgpt-clone - ChatGPT interface with improved user interface.
deep-diver/LLM-As-Chatbot - Large Language Model (LLM) as a chatbot service.
gragland/chatgpt-chrome-extension - A ChatGPT Chrome extension that integrates ChatGPT into every text box on the internet.
ohmplatform/FreedomGPT - This code base is for a React with a chat interface - Electron (a technology that uses the React framework to build desktop applications) application that runs FreedomGPT large language models locally on Mac and Windows systems ( LLM).
SoraWebui/SoraWebui - SoraWebui is an open source Sora network client that easily creates videos from text using OpenAI's Sora model.
karthink/gptel - A simple Emacs client using large language models.
a16z-infra/llama2-chatbot - LLaMA v2 chatbot.
ItsPi3141/alpaca-electron - The easiest way to run alpaca and other native large language models (LLMs) based on LLaMA on your own PC.
opendilab/DI-star - An artificial intelligence platform for StarCraft II, with large-scale distributed training and master-level agents.
jncraton/languagemodels - Explore large language models with 512MB of memory.
SamurAIGPT/Camel-AutoGPT - Introduction to CAMEL, a role-playing method for large language models (LLMs) and automatic agents (auto-agents). It enables agents to collaborate and have potential in multiple fields.
Syan-Lin/CyberWaifu - A realistic chatbot composed of large language model (LLM) + voice synthesis (TTS), a QQ robot that supports emojis, QQ emoticons and Internet search.

Multimodal model

PaddlePaddle/PaddleOCR - A great multilingual optical character recognition (OCR) toolkit based on PaddlePaddle. They are practical, ultra-lightweight, support over 80 languages and can be used on a variety of devices.
suno-ai/bark - A generative audio model prompted by text.
openai/CLIP - CLIP (Contrast Language - Image Pretraining): Predict the most relevant text fragments for images.
hpcaitech/Open-Sora - Open-Sora: Enable everyone to perform efficient video production in a democratic way.
Visual instruction adjustment (LLaVA) in haotian-liu/LLaVA - NeurIPS'23 Oral (Personal Report of the Conference on Neural Information Processing Systems 2023) is designed to gain even stronger capabilities at the GPT-4V level.
fishaudio/fish-speech - state-of-the-art open source speech synthesis (TTS) technology.
borisdayma/dalle-mini - DALL·E Mini generates images based on text prompts.
google-deepmind/alphafold - Open source code for AlphaFold 2.
OpenBMB/OmniLMM - MiniCPM - V 2.6 is a multimodal large language model (MLLM) that reaches the GPT-4V level for single images, multiple images and videos on mobile phones.
PKU-YuanGroup/Open-Sora-Plan - This project aims to reproduce Sora (OpenAI's text-to-video model) and hopes that the open source community will contribute.
openai/shap-e - Generate 3D objects based on text or image.
facebookresearch/seamless_communication - The basic model for state-of-the-art speech and text translation.
openai/DALL-E - The PyTorch software package is used for discrete variational autoencoder (VAE) in DALL·E.
google-research/vision_transformer -
magic-research/magic-animate - MagicAnimate in CVPR 2024 uses a diffusion model to achieve time-consistent human image animation.
ashawkey/stable-dreamfusion - Use Neural Radiation Field (NeRF)+Diffusion technology for text-to-3D, image-to-3D and mesh export.
lucidrains/imagen-pytorch - Implement Google's text-to-image neural network Imagen in Pytorch.
openai/jukebox - Code for the paper "Jukebox: A Music Generation Model".
deep-floyd/IF -
netease-youdao/EmotiVoice - EmotiVoice?: A voice synthesis (TTS) engine with multiple tones and prompt control.
IDEA-Research/GroundingDINO - The official implementation of the paper "Grounding DINO: Combining DINO with Basic Pre-training for Open Set Object Detection" in ECCV 2024.
FoundationVision/VAR - Official implementation of the NeurIPS 2024 oral report "Visual Autoregressive Modeling: Scalable Image Generation through Next-Scale Prediction". This is an extremely simple, user-friendly state-of-the-art code base for autoregressive image generation.
threestudio-project/threestudio - A framework for uniformly generating 3D content.
openai/guided-diffusion -
THUDM/CogVLM - A state-of-the-art open visual language model, a multimodal pre-trained model.
openai/consistency_models - Official library for consistency model.
levihsu/OOTDiffusion - OOTDiffusion: Clothing fusion based on latent diffusion is used for controllable virtual trial-off-off-off.
clovaai/donut - Official implementation of Donut (Document Understanding Transformer without Optical Character Recognition) and SynthDoG (Synthetic Document Generator) in ECCV 2022.
google/gemma_pytorch - The official PyTorch implementation of Google Gemma model.
QwenLM/Qwen-VL - Tongyi Qianwen-VL (an official warehouse of chat and pre-trained large visual language model of Alibaba Cloud).
yl4579/StyleTTS2 - StyleTTS 2 aims to achieve human-level text-to-speech conversion through style diffusion and adversarial training using large speech language models.
snakers4/silero-models - The Silero model is a pre-trained model for speech to text, text to speech and text enhancement, which are very simple to make.
salesforce/BLIP - PyTorch code for BLIP (Bootstrapping Language - Image Pre - training), BLIP is used to achieve unified vision-language understanding and generation through bootstrapping language - image pre-training.
google-deepmind/alphageometry -
metavoiceio/metavoice-src - A human-like, expressive text-to-speech (TTS) basic model.
Luodian/Otter - Otter is a multimodal model based on OpenFlamingo, trained on MIMIC - IT datasets with better instruction compliance and context learning capabilities.
NExT-GPT/NExT-GPT - NExT - GPT (an arbitrary to any multimodal large language model).
openai/improved-diffusion - Releases a model for improved denoising diffusion.
X-PLUG/MobileAgent - Mobile Devices - Agent: A powerful family of mobile device operation assistants.
dvlab-research/MiniGemini - "Mini - Gemini: Exploring the Potential of Multimodal Visual Language Models" Official Knowledge Base.
lucidrains/musiclm-pytorch - Implement Google's state-of-the-art music generation model MusicLM in PyTorch using attention network.
hustvl/Vim - Vision Mamba (proposed at the 2024 International Machine Learning Conference) realizes efficient visual representation learning through bidirectional state space models.
OpenGVLab/Ask-Anything - CVPR2024 Highlights: VideoChatGPT enables ChatGPT to understand videos. It also supports other language models such as miniGPT4, StableLM and MOSS.
microsoft/lida - Automatically generate visual content and infographics through large language models.
google-research/frame-interpolation - Large motion frame interpolation (FILM) in ECCV 2022
InternLM/InternLM-XComposer - InternLM - XComposer2.5 - OmniLive: A multimodal system for long-term video and audio interaction.
yerfor/GeneFace - GeneFace: Broadly and high-fidelity 3D speaking face synthesis, ICLR 2023, including official code.
OpenGVLab/InternImage - InternImage in CVPR 2023: Exploring large-scale visual basic models using deformable convolution.
google-deepmind/gemma - Google DeepMind's open weight large language model.
baaivision/EVA - EVA series: Fantasy of visual representation from Beijing Zhiyuan Artificial Intelligence Research Institute (BAAI).
MzeroMiko/VMamba - VMamba: Visual state space model. Its code is based on Mamba.
deepseek-ai/DeepSeek-VL - DeepSeek - VL aims to understand vision-language in the real world.
openai/consistencydecoder - Consistent distillation differential variational autoencoder.
gligen/GLIGEN - Open - Text-to-image generation based on grounding.
dvlab-research/LISA - "LISA: Inference Segmentation Based on Large Language Models" project page.
3DTopia/LGM - LGM in ECCV 2024 Oral Report is used for high resolution 3D content creation.
lyuchenyang/Macaw-LLM - Macaw - Large Language Model (LLM) integrates images, video, audio and text for multimodal language modeling.
OpenMotionLab/MotionGPT - MotionGPT is a unified motion-language generation model using large language models (LLMs) at the 2023 Neural Information Processing Systems Conference (NeurIPS 2023).
OpenGVLab/InternVideo - Video basic models and data for multimodal understanding in ECCV2024 (European International Conference on Computer Vision 2024).
openai/Video-Pre-Training - Video Pre-training (VPT) involves learning actions by observing unlabeled online videos.
THUDM/ImageReward - ImageReward in NeurIPS 2023: Learning and evaluating human preferences for literary and genomic images.
evo-design/evo - Basic biological modeling from molecular to genome scale.
google-research/tapas - An end-to-end neural model for understanding tables and texts.
apple/ml-aim - This repository provides code and model checkpoints for AIMv1 and AIMv2 research projects.
showlab/Show-o - Show-o code base, a single Transformer for unified multimodal understanding and generation.
ELLA-Diffusion/ELLA - Equip diffusion models with large language models (LLMs) to enhance semantic alignment.
declare-lab/tango - A family of diffusion models for literary and sound.
OpenBMB/VisCPM - Chinese-English dual-mode model series based on CPM basic model (chat and painting)
OpenGVLab/VisionLLM - Visual Large Language Model (VisionLLM) series.
BAAI-DCAI/Bunny - Lightweight multimodal model family.
Ligo-Biosciences/AlphaFold3 - Open source implementation of AlphaFold3.
Vchitect/SEINE - SEINE: A short-to-long video diffusion model for generative transformation and prediction in the 2024 International Conference on Learning Representation (ICLR).
google-deepmind/materials_discovery -
OpenGVLab/SAM-Med2D - SAM - Official implementation of Med2D.
OpenMOSS/AnyGPT - Code for "AnyGPT: Unified Multimodal Large Language Model (LLM) with Discrete Sequence Modeling".

Multilingual Model

THUDM/ChatGLM-6B - ChatGLM-6B is an open bilingual dialogue language model.
ymcui/Chinese-LLaMA-Alpaca - China's LLaMA (Alpaca) and Alpaca (Alpaca) Large Language Model + Local CPU (Central Processor)/GPU (Graphics Processor) Training and Deployment
UKPLab/sentence-transformers - state-of-the-art text embedding technology.
FlagAlpha/Llama2-Chinese - The Chinese community of Alpaca (Llama) has opened Llama3 for online experience and fine-tuning. It updates all code for Llama3, is completely open source and is available for commercial use, and also writes the latest Llama3 learning materials.
THUDM/ChatGLM3 - ChatGLM3 series: large language model for open source bilingual chat.
ymcui/Chinese-LLaMA-Alpaca-2 - Chinese version of LLaMA-2 and Alpaca-2 large-scale model phase II project and 64K long text model.
InternLM/InternLM - The InternLM2.5 basic model and chat model are officially released, supporting a 1 million (1M) context.
Facico/Chinese-Vicuna - China - Vicuna: A Chinese model based on LLaMA following instructions - a low-resource Chinese llama + lora solution that references alpaca structure.
LC1332/Luotuo-Chinese-LLM - Luotuo is an open source Chinese language model developed by Chen Qiyuan of Central China Normal University, Li Lulu and Leng Zi'ang of SenseTime.
wenge-research/YAYI2 - YAYI 2 is a new generation of open source large language model developed by Zhongke Wenge, which uses more than 2 trillion high-quality, multilingual corpus marks for pre-training.
wenge-research/YaYi - Yayi Large Model is developed by the Zhongke Wenge Algorithm Team. It is a safe and reliable exclusive large model created for customers. It is trained based on large-scale Chinese and English multi-domain instruction data from the LlaMA2 and BLOOM series.
TigerResearch/TigerBot - TigerBot is a large language model that supports multiple languages and tasks.
LinkSoul-AI/Chinese-Llama-2-7b - The first downloadable and operational Chinese LLaMA2 model in the open source community!
MiuLab/Taiwan-LLM - Traditional Mandarin language model for Taiwan.
zjunlp/KnowLM - An open source large language model framework with knowledge.
google-research/multilingual-t5 -
SkyworkAI/Skywork - The Skywork series model is pre-trained on 3.2TB of multilingual and code data, and related projects are open source.

Data organization

photoprism/photoprism - Photo application powered by artificial intelligence on decentralized networks.
freedmand/semantra - A multi-functional tool for semantic search.
neo4j/NaLLM - Repository of NaLLM projects.

AI Services

vllm-project/vllm - A high-throughput and memory-efficient engine for inference and services of large language models (LLMs).
guillaumekln/faster-whisper - Faster-faster-whisper - Faster-faster Whisper transcription using CTranslate2.
bentoml/OpenLLM - Run open source large language models (LLMs) like Llama and Mistral as OpenAI-compatible API endpoints in the cloud.
huggingface/text-generation-inference - Large-scale language model text generation reasoning.
FMInference/FlexGen - Run large language models on a single GPU in a throughput-focused scenario.
triton-inference-server/server - Triton inference server provides optimized inference solutions for both the cloud and the edge.
dusty-nv/jetson-inference - Guide to Deploying Deep Learning Inference Networks and Deep Vision Primitives with TensorRT and NVIDIA Jetson in Hello AI World.
openvinotoolkit/openvino - OpenVINO™ is an open source AI inference optimization and deployment toolkit.
zilliztech/GPTCache - Semantic cache for large language models (LLMs), fully integrated with LangChain and llama_index.
Portkey-AI/gateway - An AI gateway that is very fast, has an integrated guardrail and can be routed to many large language models (LLMs) and AI guardrails via one API.
tensorflow/serving - A flexible and high-performance machine learning model service system.
xorbitsai/inference - Replace OpenAI GPT with another large language model (LLM) in your application by modifying a line of code using Xinference, Xinference supports reasoning for running various models anywhere.
allegroai/clearml - ClearML is an MLOps/LLMOps solution for artificial intelligence workloads (including experiments and data management, etc.).
InternLM/lmdeploy - LMDeploy is a toolkit for compressing, deploying, and serving large language models.
argmaxinc/WhisperKit - Apple silicon chip device-side voice recognition.
kserve/kserve - A standardized serverless machine learning inference platform based on Kubernetes.
neuralmagic/deepsparse - CPU-based sparse-aware deep learning inference runtime.
huggingface/text-embeddings-inference - A very fast inference solution for text embedding models.
open-mmlab/mmdeploy - OpenMMLab's model deployment framework.
ModelTC/lightllm - LightLLM is a large language model (LLM) inference and service framework based on Python. It is lightweight, easy to scale and fast.
predibase/lorax - A multi-LoRA inference server that can be scaled to thousands of fine-tuned large language models (LLMs).
langchain-ai/langserve - LangServe (LangServe) ?️?
S-LoRA/S-LoRA - S-LoRA: LoRA adapter serving a large number of concurrent.
michaelfeil/infinity - Infinity is a service engine for text embedding, reordering models, clip, clap and colpali, with high throughput and low latency.
roboflow/inference - A command hub for transforming any computer or edge device into a computer vision project.
ray-project/ray-llm - RayLLM - Large language model based on Ray.
PygmalionAI/aphrodite-engine - Large-scale Language Model (LLM) inference engine.
punica-ai/punica - serves multiple LoRA fine-tuned large language models as one.
msoedov/langcorn - Automatically serves LangChain large language model (LLM) applications and agents for LLMops using FastApi.
mosecorg/mosec - A high-performance machine learning model service framework with dynamic batch processing and CPU/GPU pipelines to maximize computer utilization.

Vector database

facebookresearch/faiss - A library for efficient dense vector similarity search and clustering.
milvus-io/milvus - Milvus is a high-performance, cloud-native vector database for scalable vector approximate nearest neighbor search.
qdrant/qdrant - Qdrant is a high-performance, large-scale vector database and search engine for the next generation of artificial intelligence, and also provides cloud services.
chroma-core/chroma - Open source embedded database native to artificial intelligence.
spotify/annoy - C++/Python approximate nearest neighbor algorithm optimized for memory usage and disk I/O.
weaviate/weaviate - Weaviate is an open source vector database. It stores objects and vectors, and can search vectors under structured filtering, which is fault-tolerant and scalable.
neuml/txtai - An all-in-one open source embedded database for semantic search, large language model (LLM) orchestration and language model workflows.
activeloopai/deeplake - A database for use by artificial intelligence that can store various types of data such as vectors, images, text and videos. It can be used with large language models/language chains (LLMs/LangChain) for operations such as storing, querying, versioning and visualizing AI data, and can transmit data in real time to PyTorch/TensorFlow.
vespa-engine/vespa - Artificial Intelligence + data, available online at https://vespa.ai.
lancedb/lancedb - A serverless vector database for artificial intelligence applications, developer-friendly and easily add long-term memory to large language model (LLM) applications.
marqo-ai/marqo - A unified embedding generation and search engine, which can also be used in the cloud - cloud.marqo.ai.
nmslib/hnswlib - A C++/python library with only header files for fast approximation of nearest neighbor searches.
unum-cloud/usearch - A fast open source search and clustering engine for multiple programming languages.
tensorchord/pgvecto.rs - Scalable, low-latency vector search in Postgres, a database management system, with the help of hybrid capabilities. It revolutionizes vector search rather than databases.
spotify/voyager - A Python and Java library for approximate nearest neighbor searches, focusing on ease of use, simplicity, and deployability.
rapidsai/raft - RAFT has basic algorithms and primitives for machine learning and information retrieval (IR) that are accelerated by CUDA for high-performance applications.

AI Education

JushBJJ/Mr.-Ranedeer-AI-Tutor - A customizable GPT for personalized learning experiences - 4 artificial intelligence tutor tip words.
Nutlope/llamatutor - An AI private tutor is built on Llama 3.1.
codeacme17/examor - Large language models (LLMs) help learning for students, scholars, respondents and lifelong learners.

AI development tools

jina-ai/jina - Use cloud native technology stack to build multimodal artificial intelligence applications.
iterative/dvc - Data versioning and machine learning experiments.
unifyai/ivy - Convert machine learning code between different frameworks.
HigherOrderCO/HVM - Large-scale parallel optimal function runtime written in Rust.
marimo-team/marimo - Python reactive notebooks can be used for reproducibility experiments, script execution, application deployment, and version control with Git.
arogozhnikov/einops - Flexible and powerful tensor operations for creating code that has both readability and reliability, suitable for PyTorch, Jax, TensorFlow, etc.
replicate/cog - Container used in machine learning.
jessevig/bertviz - BertViz: Visualize attention in natural language processing models such as BERT, GPT2, BART, etc.
AbdBarho/stable-diffusion-webuui-docker - Easily set up Docker for Stable Diffusion through a user-friendly interface.
huggingface/safetensors - An easy and safe way to store and allocate tensors.
wangzhaode/mnn-llm - Deploy a large language model (LLM) project based on MNN.
ajndkr/lanarky - A network framework for building large language model (LLM) microservices.

Model training

tensorflow/tensorflow - an open source machine learning framework that everyone can use.
huggingface/transformers - Transformer: state-of-the-art machine learning technology for Pytorch, TensorFlow and JAX.
pytorch/pytorch - Use powerful GPU acceleration capabilities, Python-based tensors and dynamic neural networks.
hpcaitech/ColossalAI - Reduce the cost, increase its speed and enhance its accessibility of large AI models.
hiyouga/LLaMA-Factory - Unified and efficient fine-tuning of more than 100 large language models (LLMs) (ACL 2024)
lm-sys/FastChat - An open platform for training, service and evaluation of large language models and a publishing library for Vicuna and Chatbot Arena.
coqui-ai/TTS - ? is a deep learning toolkit for text to speech that has been validated in research and production.
microsoft/DeepSpeed - DeepSpeed is a deep learning library for easy, efficient and efficient distributed training and inference.
ray-project/ray - Ray is an AI computing engine with core distributed runtime and AI libraries for accelerating machine learning workloads.
google-research/google-research - Google Research
google/jax - Python+NumPy programs can be combined and transformed in a variety of ways, such as derivative, vectorization, and real-time compilation (JIT) to GPU/TPU, etc.
open-mmlab/mmdetection - OpenMMLab detection toolkit and benchmarking.
tinygrad/tinygrad - If you like PyTorch and Micrograd, then you will also like Tinygrad.
huggingface/diffusers - Diffusers: The most advanced diffusion model for PyTorch and FLAX for images, videos and audio.
mozilla/DeepSpeech - DeepSpeech is an open source voice-to-text engine that can be used in real time on a variety of devices.
modularml/mojo - Moho programming language (more details are not provided in the given description)
microsoft/unilm - Self-supervised pre-training covering tasks, languages and modalities at scale.
ml-explore/mlx - MLX is an array framework for Apple chips.
HigherOrderCO/Bend - A high-level programming language with large-scale parallelism.
huggingface/peft - PEFT: Optimal high-efficiency fine-tuning of parameters.
huggingface/candle - A minimalist machine learning framework for the Rust language.
NVIDIA/NeMo - A framework for researchers and developers in the fields of AI, such as large language models (LLMs), multimodal and voice AI, is scalable and generative.
PaddlePaddle/PaddleNLP - An easy-to-use and powerful natural language processing (NLP) and large language model (LLM) library with a large number of excellent models that support a wide range of natural language processing tasks from research to industrial applications.
PaddlePaddle/PaddleSpeech - An easy-to-use voice toolkit contains multiple features and won the NAACL2022 Best Demo Award.
Lightning-AI/litgpt - More than 20 high-performance large language models (LLMs) and related solutions for large-scale pre-training, fine-tuning and deployment.
huggingface/trl - Use reinforcement learning to train Transformer language models.
artidoro/qlora - QLoRA can achieve efficient fine-tuning of quantified large language models.
salesforce/LAVIS - LAVIS is a one-stop language - visual smart library.
nerfstudio-project/nerfstudio - A collaborative and friendly studio for neural radiation fields (NeRFs).
mozilla/TTS - Deep learning for speech synthesis (discussion forum: https://discourse.mozilla.org/c/tts)
trace-ai/burn - Burn is a new dynamic deep learning framework built by Rust to achieve flexibility, efficiency and portability.
facebookresearch/pytorch3d - PyTorch3D is a library for FAIR (Facebook Artificial Intelligence Research) for deep learning of 3D data.
facebookresearch/xformers - Transformer build modules are flexible and optimized to support combined builds.
OptimalScale/LMFlow - An expandable toolkit for fine-tuning and inference of large basic models, allowing large models to be used by everyone.
OpenAccess-AI-Collective/axolotl - Just ask the newt questions.
FlagOpen/FlagEmbedding - Search and search for enhanced large language models (LLMs)
huggingface/accelerate - A simple way to handle PyTorch models on a variety of devices and configurations, with features like automatic mixing accuracy and support for fully sharded data parallelism (FSDP) and DeepSpeed.
LianjiaTech/BELLE - BELLE is an open source large-scale Chinese dialogue language model engine for everyone.
cloneofsimo/lora - Quickly fine-tune diffusion model through low rank adaptation.
EleutherAI/gpt-neox - Implement an autoregressive transformer with model parallelism on GPU using Megatron and DeepSpeed libraries.
open-mmlab/mmagic - OpenMMLab is a multimodal toolbox that can be used for various tasks such as artificial intelligence generated content (AIGC), with easy-to-use application program interfaces (APIs) and model libraries.
facebookresearch/metaseq - External large-scale work database.
Maartengr/BERTopic - Generate easy-to-understand topics using BERT and c-TF-IDF.
Project-MONAI/MONAI - A toolkit for medical imaging (processing) in the field of artificial intelligence.
yangjianxin1/Firefly - Firefly is a large model training tool that supports training multiple large models, such as Qwen2.5, Qwen2, etc.
google-deepmind/graphcast -
mosaicml/composer - Enhance your model training.
cg123/mergekit - A tool for combining pre-trained large language models.
CarperAI/trlx - A code base for distributed training of language models using human feedback reinforcement learning (RLHF).
pytorch/torchtune - A PyTorch native post-training library.
google-deepmind/open_spiel - OpenSpiel is a set of environments and algorithms for general reinforcement learning and game search/planning research.
huggingface/autotrain-advanced - Automatic training advanced version.
InternLM/xtuner - An efficient, flexible and fully functional toolkit for fine-tuning a wide range of large language models (LLMs).
mosaicml/llm-foundry - Large Language Model (LLM) training code for Databricks basic models.
baidu-research/warp-ctc - Fast parallel connectionist time classification (CTC).
JohnSnowLabs/spark-nlp - The most advanced technology in natural language processing.
FlagAI-Open/FlagAI - FlagAI is a toolkit for large-scale models that is fast, easy to use and scalable.
mlfoundations/open_flamingo - An open source framework for training large multimodal models.
OpenLLMAI/OpenRLHF - an easy-to-use, scalable and high-performance human feedback reinforcement learning (RLHF) framework, with nearly 70 billion parameters, full fine-tuning of the proximal strategy optimization (PPO) and iterative direct preference optimization (DPO), low-rank auto- Functions such as LoRA, RingAttention, and recursive fine-tuning (RFT).
google-deepmind/acme - A library of components and proxy for reinforcement learning.
open-mmlab/mmpretrain - Pre-training toolbox and benchmarks for OpenMMLab.
shibing624/MedicalGPT - MedicalGPT uses ChatGPT training pipeline to train medical GPT models to achieve incremental pre-training, supervised fine-tuning, human feedback reinforcement learning (RLHF), direct preference optimization (DPO), and sort-based preference optimization (ORPO).
iryna-kondr/scikit-llm - Integrate large language models (LLMs) into scikit-learning without any problem.
google-research/scenic - Scenic: A Jax library for computer vision research and more.
facebookresearch/fairscale - PyTorch extension for high performance and large-scale training.
alpa-projects/alpa - Training and serving large-scale neural networks through automatic parallelization.
microsoft/torchscale - The infrastructure for large and medium-sized language models.
google-deepmind/dm-haiku - A neural network library based on JAX.
eureka-research/Eureka - The official repository of the paper "Eureka: Reward Design at the Human Level by Coding Large Language Models" published at the ICLR 2024 conference.
Alpha-VLLM/LLaMA2-Accessory - An open source toolkit for developing large language models.
google-research/t5x -
google-deepmind/alphatensor -
PhoebusSi/Alpaca-CoT -
huggingface/optimum - Use easy-to-use hardware optimization tools to accelerate reasoning and training of Transformer, Diffuser, TIMM and Sentence Transformer.
stochasticai/xTuring - Easily build, customize and control your own large language models (LLMs) from data preprocessing to fine-tuning and join its Discord community.
adapter-hub/adapters - A unified library for parameter efficient and modular transfer learning.
openai/weak-to-strong -
OpenPipe/OpenPipe - Convert expensive tips to affordable fine-tuning models.
lamini-ai/lamini - The official Python client for the Lamini API.
google-research/big_vision - Official code base for developing vision transformers, SigLIP, multi-layer perceptron mixers (MLP-Mixer), LiT, etc.
young-geng/EasyLM - EasyLM (based on JAX/Flax) provides a one-stop solution for pre-training, fine-tuning, evaluation and service operations of various large language model (LLM) operations.
pyro-ppl/numpyro - Probability programming is used with NumPy, and automatic derivatives are used with JAX and instant compilation (JIT compilation) to GPU/TPU/CPU.
eric-mitchell/direct-preference-optimization - a reference implementation of direct preference optimization (DPO).
huggingface/setfit - Use Sentence Transformers for efficient small sample learning.
allenai/open-instruct -
allenai/RL4LMs - A modular reinforcement learning (RL) library for fine-tuning language models according to human preferences.
lxe/simple-llm-finenetuner - A simple user interface for fine-tuning large language models (LLMs).
THUDM/P-tuning-v2 - An optimized depth prompt fine-tuning strategy works as well as fine-tuning in different sizes and tasks.
tensorflow/privacy - A library for training machine learning models and privacy-protecting training data.
xlang-ai/instructor-embedding - Single embedder for any task: Instructions - Fine-tune text embedding (ACL 2023)
unslothai/hyperlearn - Machine learning algorithms are 2-2000 times faster, memory usage is reduced by 50%, and can run on all hardware.
salesforce/ctrl - A conditional conversion language model for controllable generation.
google-deepmind/optax - Optax is a JAX library for gradient processing and optimization.
google-deepmind/penzai - A JAX toolkit for building, modifying and visualizing neural networks.
microsoft/i-Code -
kubeflow/training-operator - Distributed machine learning training and fine-tuning on Kubernetes.
AetherCortex/Llama-X - Carry out open academic research to enable LLaMA to reach the most advanced large language model.
salesforce/ALBEF - A new vision-code for language pre-training method ALBEF.
kubeflow/katib - Automated machine learning in Kubernetes environment.
facebookresearch/multimodal - TorchMultimodal is a PyTorch library for large-scale training of state-of-the-art multimodal multitasking models.
jina-ai/finenetuner - Task-oriented embedding adjustment for BERT, CLIP, etc.
salesforce/CodeTF - CodeTF: One-stop Transformer library for the latest code large language models (LLM).
AnswerDotAI/fsdp_qlora - Train large language model (LLM) using QLoRA and full sharded data parallel (FSDP).
nerfstudio-project/nerfacc - A universal neural radiation field (NeRF) acceleration toolbox based on PyTorch.
jquesnelle/yarn - YaRN: Efficient context window extension for large language models.
PKU-Alignment/safe-rlhf - Safe human feedback reinforcement learning (Safe RLHF) utilizes secure reinforcement learning based on human feedback to achieve limited value alignment.
lucidrains/self-rewarding-lm-pytorch - Implementing the training framework proposed by MetaAI in the "Self-rewarding Language Model".
OpenLMLab/MOSS-RLHF - This is the secret about human feedback reinforcement learning (RLHF) in large language models, especially the first part of the near-end strategy optimization algorithm (PPO).
JonasGeiping/cramming - Training of compressed BERT type language models within a limited computing resource.
AlibabaResearch/DAMO-ConvAI - Damo - Conversational Artificial Intelligence (ConvAI) is an official resource library with the code base of DAMO Conversational AI.
databricks/megablocks -
AGI-Edgerunners/LLM-Adapters - Code of EMNLP 2023 paper "LLM - Adapters: Adapter Family for Efficient Fine-Tuning of Parameters in Large Language Models".
KhoomeiK/LlamaGym - Fine-tune large language model (LLM) agents through online reinforcement learning.
thunlp/OpenDelta - A plug and play parameter efficient adjustment (Delta Tuning) library.
Liuhong99/Sophia - Official implementation of the paper "Sophia: A scalable random second-order optimizer for pre-training of language models".
yuchenlin/LLM-Blender - LLM-Blender is an integration framework in [ACL2023] that eliminates weaknesses through sorting and enhances the capabilities of large language models (LLMs) by generating fusion advantages.
google-deepmind/xmanager - A platform for handling machine learning experiments.
google-deepmind/chex -

AI image generation

AUTOMATIC1111/stable-diffusion-webui - Stable diffusion Web user interface.
lllyasviel/Fooocus - The description is about focusing on prompts and generation.
upscayl/upscayl - Upscayl is the number one free open source artificial intelligence image amplifier for Linux, MacOS and Windows systems.
s0md3v/roop - One-click face change means you can complete face change with just one click.
invoke-ai/InvokeAI - Invoke is a creative engine for the Stable Diffusion model that provides a WebUI (user interface) and serves as the basis for commercial products.
facefusion/facefusion - An industry-leading facial processing platform.
Sanster/lama-cleaner - Image repair tools powered by state-of-the-art artificial intelligence models can remove unwanted elements in pictures or replace things in them.
Mikubill/sd-webui-controlnet - WebUI extension for ControlNet.
camera/stable-diffusion-webui-colab - Stable diffusion web user interface on Colab (Google Collaboration Platform).
divamgupta/diffusionbee-stable-diffusion-ui - Diffusion Bee is the easiest way to run Stable Diffusion locally on an M1 Mac. It has a one-click installer that requires no dependencies or technical knowledge.
Baiyuetribe/paper2gui - Convert AI papers to graphical user interface (GUI) so that everyone can easily use AI technology.
easydiffusion/easydiffusion - The easiest way to create exquisite art works using artificial intelligence on the PC side is to operate it in one click. Just enter the text prompt word to generate images through the browser user interface without technical knowledge.
Stability-AI/StableStudio - Community interface for generative artificial intelligence.
carson-katri/dream-textures - Stable Diffusion integrated into Blender.
TheLastBen/fast-stable-diffusion - Fast-stable-diffusion and DreamBooth.
godly-devotion/MochiDiffusion - Run Stable Diffusion locally on your Mac.
HumanAIGC/OutfitAnyone - Ultra-high quality virtual try-on for everyone and clothing.
sensing-ai/dot - Deep forgery attack toolkit.
leap-ai/headshots-starter -
Nutlope/restorePhotos - Use artificial intelligence to fix blurred old face photos.
jina-ai/discoart - A group creates disco diffusion artwork.
mlc-ai/web-stable-diffusion - introduces a stable diffusion model into a web browser; everything runs in the browser without server support.
all-in-aigc/aicover - A tool for generating AI covers.

Dataset

huggingface/datasets - The largest machine learning model dataset center has fast, easy to use and efficient data manipulation tools.
BuilderIO/gpt-crawler - Crawl website content from a URL to generate the knowledge file needed to create a custom GPT.
joke2k/faker - Faker is a Python package for generating fake data.
DS4SD/docling - Get your documentation ready for generative artificial intelligence.
openai/tiktoken - Tiktoken is a fast BPE word segmenter suitable for OpenAI models.
cleanlab/cleanlab - Standard AI packages focus on data for quality and machine learning, handling cluttered real-world data and labels.
karpathy/minbpe - A concise code for byte-pair encoding (BPE) algorithms commonly used in large language model (LLM) tokenization.
huggingface/tokenizers - Fast, advanced word segmentation optimized for research and production.
arsenetar/dupeguru - Find duplicate files.
QuivrHQ/MegaParse - Optimized file parser for lossless large language model (LLM) ingestion, able to parse PDF, Docx, PPTx files into an ideal format suitable for large language model (LLM).
Togethercomputer/RedPajama-Data - RedPajama - There is code in the data repository for preparing large datasets for training large language models.
lk-geimfari/mimesis - Mimesis is a Python data generator that can create various types of fake data in multiple languages.
Instruction-Tuning-with-GPT-4/GPT-4-LLM - Use GPT-4 to fine-tune the instruction.
yizhongw/self-instruct - Aligns the pretrained language model with its own generated instruction data.
dedupeio/dedupe - A Python library for precise and scalable fuzzy matching, record deduplication and entity parsing.
argilla-io/argilla - Argilla is a tool that allows artificial intelligence engineers and field experts to create high-quality data sets.
mshumer/gpt-llm-trainer -
life4/textdistance - Use over 30 pure Python algorithms, common interfaces, and optionally use external libraries to calculate distances between sequences.
Docta-ai/docta - A doctor is here to take care of your data.
alibaba/data-juicer - Provides high-quality, rich and easy-to-process data for the basic model.
towhee-io/towhee - Towhee is a framework that makes neural data processing pipelines simple and fast.
QData/TextAttack - TextAttack is a Python framework for natural language processing (NLP) tasks such as adversarial attacks, data augmentation, and model training.
seatgeek/thefuzz - Fuzzy string matching in Python.
ekzhu/datasketch - Minimum hash, locally sensitive hash, locally sensitive hash forest, weighted minimum hash, hyperlog, hyperlog++, locally sensitive hash integration and hierarchical navigational small world map.
thunlp/UltraChat - Large, informative and diverse multi-round chat data and models.
modAL-python/modAL - A modular active learning framework written in Python.
chiphuyen/lazynlp - A library for crawling and cleaning web pages to generate large data sets.
huggingface/datatrove - Provides a set of platform-independent customizable pipeline processing modules, so that data processing no longer depends on scripting.
refuel-ai/autolabel - Use large language models to mark, clean and augment text datasets.
google-deepmind/code_contests -
Tencent/MedicalNet - Many studies have shown that the amount of training data can significantly affect the performance of deep learning. The MedicalNet project provides 3D-ResNet pre-trained models and code.
argilla-io/distilabel - Distilabel is a synthetic data and artificial intelligence feedback framework for engineers who need to build fast, reliable and scalable pipelines based on proven research papers.
google-deepmind/mathematics_dataset - This dataset code creates mathematical question and answer pairs at school difficulty levels from various question types.
openai/prm800k - 800,000 step-level correctness labels for answers to large language models (LLMs) to solve mathematical problems.
salesforce/WikiSQL - A large corpus for the development of natural language interfaces with semantic analysis annotations.
anthropics/hh-rlhf - Human preference data for training assistants (making it useful and harmless) reinforcement learning through human feedback.
moj-analytical-services/splink - Fast, scalable and accurate probabilistic data links, supporting multiple SQL backends.
dleemiller/WordLlama - Tag embedding of large language models (LLMs) can accomplish things.
AI4Finance-Foundation/FinRL-Meta - FinRL - Meta provides dynamic data sets and market environment for FinRL.
tensorflow/text - Use text as the primary element in TensorFlow.
google-research/deduplicate-text-datasets -
allenai/dolma - Data and tools for creating and inspecting OLMo pretrained data.
lilacai/lilac - Improve data management of large language models.
1e0ng/simhash - A Python implementation of Simhash algorithm.
J535D165/recordlinkage - A Python toolkit for recording links and repeated data detection, powerful and modular.
google-deepmind/tree - Tree is a library for processing nested data structures.
xtreme1-io/xtreme1 - Xtreme1 is an integrated multimodal data training platform that supports 3D lidar point cloud, image and large language models (LLM) for data labeling and annotation.
datadreamer-dev/DataDreamer - DataDreamer: Prompt, generate synthetic data, train and calibrate models.
HazyResearch/meerkat - Creative and interactive views of all datasets.

Model evaluation

openai/evals - Evals is a framework for evaluating large language models (LLMs) and their systems, and is also an open source benchmarking registry.
exploitinggradients/ragas - Inject powerful momentum into your evaluation of large language model (LLM) applications.
EleutherAI/lm-evaluation-harness - A framework for evaluating language models using a small number of samples.
erikbern/ann-benchmarks - Benchmarks for approximate nearest neighbor libraries in Python.
Trusted-AI/adversarial-robustness-toolbox - Adversarial Robustness Toolkit (ART) is a Python library for machine learning security, covering evasion, poisoning, extraction and reasoning for both red and blue teams (both offense and defense). (attack).
open-compass/opencompass - OpenCompass is a large language model (LLM) evaluation platform that supports more than 100 data sets and a variety of models such as Llama3 and Mistral.
Arize-ai/phoenix - Observability and evaluation of artificial intelligence.
NVIDIA/NeMo-Guardrails - NeMo Guardrails is an open source toolkit that easily adds programmable guardrails to conversational systems based on large language models (LLM).
confident-ai/deepeval - Large Language Model (LLM) evaluation framework.
Giskard-AI/giskard - Open source evaluation and testing of artificial intelligence and large language model (LLM) systems.
fchollet/ARC - The abstract reasoning corpus is a resource, but it is difficult to give a more specific description without more details. It may be used for tasks related to abstraction and reasoning in various fields.
llm-attacks/llm-attacks - A general migration attack that aligns language models.
leondz/garak - Large Language Model (LLM) Vulnerability Scanner.
jeinlee1991/chinese-llm-benchmark - The list of Chinese big model capability assessments contains 134 models, including commercial and open source models, and provides capability scores and original output.
google/BIG-bench - Beyond imitation game: a collaborative benchmark for measuring and inferring language model capabilities.
meta-llama/PurpleLlama - A set of tools for evaluating and enhancing the security of large language models (LLMs).
openai/human-eval - The paper "Evaluation of Code-Based Large Language Models" Code
salesforce/decaNLP - Natural Language Decathlon (Contest) is a multitasking challenge in the field of natural language processing.
THUDM/AgentBench - A comprehensive benchmark for evaluating large language models (LLMs) as agents (International Conference on Learning Representation ICLR'24).
truera/trulens - Evaluation and tracking in large language model (LLM) experiments.
princeton-nlp/SWE-bench - [International Conference on Learning Representation (ICLR) 2024] SWE - bench: Can language models solve the problem of GitHub (code hosting platform) in the real world.
Lightning-AI/torchmetrics - Machine learning metrics for distributed and scalable PyTorch applications.
openai/simple-evals -
huggingface/evaluate - A library called Evaluate for easy evaluation of machine learning models and datasets.
embeddings-benchmark/mteb - MTEB is a large-scale text embedding benchmark.
Azure/PyRIT - PyRIT is an open source framework that helps security professionals and engineers proactively identify risks in generative artificial intelligence systems.
stanford-crfm/helm - HELM is a framework for improving the transparency of language models, which is also used to evaluate other models such as text-to-image models in HEIM and vision-language models in VHELM.
TransformerLensOrg/TransformerLens - A library for mechanical interpretability of GPT-like language models.
beir-cellar/beir - A heterogeneous information retrieval (IR) benchmark for easily evaluating models with more than 15 different datasets.
tatsu-lab/alpaca_eval - An automatic evaluator for directive-compliant language models. It has been manually proven, high quality, low cost and fast.
microsoft/CodeXGLUE - CodeXGLUE is a project or entity, but without more context information, it is difficult to say more specific.
google-deepmind/bsuite - Bsuite is a carefully designed set of experiments to explore the core capabilities of reinforcement learning agents.
CalculatedContent/WeightWatcher - The WeightWatcher tool is used to predict the accuracy of deep neural networks.
facebookresearch/LAMA - Language model analysis.
evalplus/evalplus - Strict evaluation of code synthesized by large language model (LLM) in NeurIPS 2023 and COLM 2024.
vectara/hallucination-leaderboard - A ranking for comparing large language models (LLMs) to produce hallucination performance when summarizing short essays.
hendrycks/test - Measure large-scale multitasking language comprehension at the 2021 International Conference on Learning Representation (ICLR).
mlcommons/inference - Reference implementation of MLPerf™ inference benchmarks.
openai/grade-school-math -
rlancemartin/auto-evaluator - An evaluation tool for large language model question and answer chains.
openai/automated-interpretability -
allenai/natural-instructions - This description is about extending natural instructions.
WeOpenML/PandaLM -
thu-coai/Safety-Prompts - Chinese security tips for evaluating and improving the security of large language models.
salesforce/OmniXAI - OmniXAI is an interpretable artificial intelligence (XAI) library.
bigcode-project/bigcode-evaluation-harness - A framework for evaluating autoregressive code generation language models.
hsiehjackson/RULER - This repository contains the source code for RULER (the real context size regarding the long context language model).

Computational Management

kubeflow/kubeflow - A machine learning toolkit designed specifically for Kubernetes.
Netflix/metaflow - an open source platform for artificial intelligence and machine learning.
skypilot-org/skypilot - SkyPilot is able to run AI and batch jobs on any infrastructure (Kubernetes or more than 12 clouds), providing unified execution, cost savings, and high GPU availability through a simple interface.
gpuweb/gpuweb - This is where the GPU (Graphics Processor) works in the network.
zenml-io/zenml - ZenML: The connection between machine learning and operations.
higgsfield-ai/higgsfield - Fault-tolerant and highly scalable GPU orchestration, and a machine learning framework for training large-scale models.
Haidra-Org/AI-Horde - A crowdsourcing distributed cluster for generating artificial intelligence art and text.

AI Writing

steven-tey/novel - A Notion-like editor with an AI-powered autocompletion feature.
reorproject/reor - A private and localized artificial intelligence personal knowledge management application for high entropy people.
shibing624/pycorrector - Pycorrector is a text error correction toolkit that has multiple model applications for error correction and is easy to use.
BlinkDL/AI-Writer - Artificial intelligence creates novels such as fantasy and romantic web novels. It is a Chinese pre-trained generative model similar to GPT-2. Using the RWKV model.
mshumer/gpt-author -
Nutlope/twitterbio - Create your Twitter profile using artificial intelligence.
nhaouari/obsidian-textgenerator-plugin - Text Generator is an Obsidian plugin that can be used to generate text with various artificial intelligence vendors such as OpenAI, Anthropic, Google, and local models.
google-deepmind/dramatron - Dramatron uses large language models to generate coherent scripts and scripts.

Intelligent monitoring

nebuly-ai/nebuly - A set of libraries for optimizing the performance of AI models.
langfuse/langfuse - An open source large language model engineering platform with multiple integrations (for large language model observability, metrics, etc.). It comes from YC W23.
evidentlyai/evidently - Evidently is an open source framework for machine learning (ML) and large language model (LLM) observability, which can be used to evaluate, test and monitor artificial intelligence-related systems or data pipelines, with more than 100 items index.
traceloop/openllmetry - Rely on OpenTelemetry to provide open source observability for your large language model (LLM) applications.
Helicone/helicone - An open source large language model (LLM) observability platform. One line of code can be used for monitoring, evaluation and experimentation.
whylabs/whylogs - An open source library for recording data in machine learning models and pipelines. It provides visibility into data quality and model performance, as well as privacy-protected data collection.
uptrain-ai/uptrain - UpTrain is an open source platform for evaluating and improving generative artificial intelligence applications. It provides ratings for pre-configured inspections, analyzes failures and gives solutions.
labmlai/labml - Monitor deep learning model training and hardware usage through mobile phones.
lmnr-ai/lmnr - Laminar is an open source integrated platform for building artificial intelligence products. It creates a data flywheel for AI applications through tracking, evaluation, datasets and labeling (YC S24).
llmonitor/llmonitor - The production toolkit for large language models (LLMs) involves observability, prompt management, and evaluation.
lunary-ai/lunary - The production toolkit for large language models (LLMs) includes observability, prompt management, and evaluation.
dillionverma/llm.report - llm.report is an open source platform for OpenAI for recording API requests, cost analysis, and prompt optimization.
whylabs/langkit - LangKit is an open source toolkit for large language model (LLM) monitoring. It has features such as text quality and sentiment analysis for LLM observability.

Video generation

RayVentura/ShortGPT - ShortGPT - An experimental AI framework for automated operation of YouTube Shorts/TikTok (TikTok International Edition) channels.
all-in-aigc/sorafm - Sora Artificial Intelligence Video Generator provided by Sora.FM.

Data Management

ibis-project/ibis - Portable Python dataframe library.
SuperDuperDB/superduperdb - Superduper is able to build end-to-end AI applications and proxy workflows on existing data infrastructure and preferred tools without data migration.
run-llama/llama-hub - A community-made data loader library for large language models (LLMs) for LlamaIndex and/or LangChain.
webdataset/webdataset - A Python-based input and output (I/O) system suitable for deep learning problems of all sizes, strongly supports PyTorch.
NVIDIA/aistore - AIStore: Scalable AI application storage.
mosaicml/streaming - A database of data flows for efficient neural network training.

AI Applications

code-kern-ai/refinery - An open source option for data scientists to scale, evaluate and maintain natural language data. Treat training data as software artifacts.