Awesome Open Source AI is a curated list of useful open-source AI resources, including inference engines, LLMs, and tools. Specifically, this list aims to link to resources that meet the Open Source Initiative® (OSI) definition of open source as much as possible.
- koboldcpp - Is the swiss army knife of LLM inference engines that supports multiple API endpoints such as Ollama.
- llama.cpp - Barebones LLM inference engine with additional options to control LLM output beyond those of Ollama.
- Ollama - Easy-to-use LLM inference engine offering an OpenAI-compatible API server.
- OpenedAI Speech - Great OpenAI-compatible API TTS server that works with Piper TTS, which runs fast on the CPU.
- DeepSeek-R1 - Great language model that's suited for reasoning & logic related tasks.
- Granite 3.1 - Great for RAG and suited for deployment on low-end consumer hardware.
- InternVL 2.51 - Great language model for vision-related tasks and scores better on benchmarks compared to Qwen2-VL.
- OLMo 2 - Completely open language model with all parts of the model training process open-source and scores well on benchmarks.
- Phi-4 - Great language model that is SOTA in various benchmarks for a medium-sized language model that's great for instruction-following and logical tasks (Note: Phi models typically underform outside benchmarks).
- Qwen 2.52 - Best in-class language models that excel in benchmarks for math, etc., but may have censorship on certain topics (i.e. Tiananmen Square).
- Qwen 2.5 Coder3 - Best in-class language models that excel in coding related tasks.
- Qwen 2.5-1M - Similar to Qwen 2.5 but supports a context length of up to 1M tokens, making it suitable for long-context tasks such as podcast generation.
- Qwen2-VL4 - Great LLM that supports vision understanding (image & video) and is supported by llama.cpp.
- SmolLM2 - A series of tiny language models that is great for tinkering around and summarizing content (not so great for finetuning).
- BGE-EN-ICL - SOTA embedding model that scores great on the MTEB leaderboard.
- all-MiniLM-L6-v2 - Great small embedding model perfectly suited for deployment on low-end consumer hardware.
Coming soon...
Coming soon...
- LLM Finetuning Script - A script for finetuning language models based on dataset from the synthetic dataset generation tool.
- Open WebUI - Provides a ChatGPT-like interface for chatting with local/external language models.
- Synthetic Dataset Generation w/ InternVL2 - A script for generating synthetic datasets from PDF files (using vision instead of text extraction).
- TextCraft - Add-in for Microsoft Word with tools for proofreading, text generation, etc.