Chat data cleaning, filtering and deduplication pipeline.
-
Updated
Jul 25, 2023 - Python
Chat data cleaning, filtering and deduplication pipeline.
Upload data to PostHog-LLM
Standardized spec and vendor-specific transforms for ChatML
Upload data to PostHog-LLM
Qwen2.5-Coder: Family of LLMs excels in code, debugging, etc
Dolphin 3.0 🐬: Versatile AI for coding, math, and more
SmolLM2 🤗: Family of lightweight language models, performs diverse tasks on-device
Add a description, image, and links to the chatml topic page so that developers can more easily learn about it.
To associate your repository with the chatml topic, visit your repo's landing page and select "manage topics."