Doge Family of Small Language Model
python nlp natural-language-processing reinforcement-learning deep-learning pytorch transformer chinese attention-mechanism r1 attention-is-all-you-need mechine-learning foundation-models small-language-models dynamic-mask-attention cross-domain-mixture-of-experts deepseek-r1
-
Updated
Feb 23, 2025 - Python