SGLang is a fast serving framework for large language models and vision language models.
-
Updated
Jan 23, 2025 - Python
SGLang is a fast serving framework for large language models and vision language models.
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
多平台模型接入,可扩展,多种输出格式,提供大语言模型聊天服务的插件 | A bot plugin for LLM chat services with multi-model integration, extensibility, and various output formats
A project to compare language models like DeepSeek-V3 and Llama3.1 for tasks such as text generation. Includes a FastAPI-based REST API for real-time inference, Docker support for deployment, and a flexible framework for evaluation and experimentation.
This project is a web-based PDF question-answering chatbot powered by LangChain, and Deepseek v3's Language Learning Models (LLMs). Users can upload PDFs, ask questions related to the uploaded documents, and receive accurate responses. Use the link below to try it out!
MiniMax-01 is a simple implementation of the MiniMax algorithm, a widely used strategy for decision-making in two-player turn-based games like Tic-Tac-Toe. The algorithm aims to minimize the maximum possible loss for the player, making it a popular choice for developing AI opponents in various game scenarios.
🔗 Easily connects DeepSeek R1 & v3 to OpenAI compatible endpoints (like Cursor) using a simple bridge, making integration seamless and reliable for API usage
An AI-powered text corrector and translator for Windows that uses OpenAI, DeepSeek V3, or custom GPTs to help users correct their texts into any language.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Add a description, image, and links to the deepseek-v3 topic page so that developers can more easily learn about it.
To associate your repository with the deepseek-v3 topic, visit your repo's landing page and select "manage topics."