Awesome Open Source AI

Awesome Open Source AI is a curated list of useful open-source AI resources, including inference engines, LLMs, and tools. Specifically, this list aims to link to resources that meet the Open Source Initiative® (OSI) definition of open source as much as possible.

Inference Engines

koboldcpp - Is the swiss army knife of LLM inference engines that supports multiple API endpoints such as Ollama.
llama.cpp - Barebones LLM inference engine with additional options to control LLM output beyond those of Ollama.
Ollama - Easy-to-use LLM inference engine offering an OpenAI-compatible API server.
OpenedAI Speech - Great OpenAI-compatible API TTS server that works with Piper TTS, which runs fast on the CPU.

Language Models

DeepSeek-R1 - Great language model that's suited for reasoning & logic related tasks.
Granite 3.1 - Great for RAG and suited for deployment on low-end consumer hardware.
InternVL 2.5¹ - Great language model for vision-related tasks and scores better on benchmarks compared to Qwen2-VL.
OLMo 2 - Completely open language model with all parts of the model training process open-source and scores well on benchmarks.
Phi-4 - Great language model that is SOTA in various benchmarks for a medium-sized language model that's great for instruction-following and logical tasks (Note: Phi models typically underform outside benchmarks).
Qwen 2.5² - Best in-class language models that excel in benchmarks for math, etc., but may have censorship on certain topics (i.e. Tiananmen Square).
Qwen 2.5 Coder³ - Best in-class language models that excel in coding related tasks.
Qwen 2.5-1M - Similar to Qwen 2.5 but supports a context length of up to 1M tokens, making it suitable for long-context tasks such as podcast generation.
Qwen2-VL⁴ - Great LLM that supports vision understanding (image & video) and is supported by llama.cpp.
SmolLM2 - A series of tiny language models that is great for tinkering around and summarizing content (not so great for finetuning).

Embedding Models

BGE-EN-ICL - SOTA embedding model that scores great on the MTEB leaderboard.
all-MiniLM-L6-v2 - Great small embedding model perfectly suited for deployment on low-end consumer hardware.

Image Models

Coming soon...

Video Models

Coming soon...

Tools

LLM Finetuning Script - A script for finetuning language models based on dataset from the synthetic dataset generation tool.
Open WebUI - Provides a ChatGPT-like interface for chatting with local/external language models.
Synthetic Dataset Generation w/ InternVL2 - A script for generating synthetic datasets from PDF files (using vision instead of text extraction).
TextCraft - Add-in for Microsoft Word with tools for proofreading, text generation, etc.

Only 1B and 38B are open-source. ↩
The 3B and 72B models are not open-source. ↩
The 3B model is not open-source. ↩
The 72B model is not open-source. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Open Source AI

Table of Contents

Inference Engines

Language Models

Embedding Models

Image Models

Video Models

Tools

About

License

suncloudsmoon/awesome-open-source-ai

Folders and files

Latest commit

History

Repository files navigation

Awesome Open Source AI

Table of Contents

Inference Engines

Language Models

Embedding Models

Image Models

Video Models

Tools

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks