Skip to content

Issues: ai-glimpse/toyllm

Feature: GRPO
#84 opened Feb 28, 2025 by shenxiangzhuang
Open
Feature: KV Cache
#81 opened Feb 24, 2025 by shenxiangzhuang
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

qwen model structure
#102 opened Mar 12, 2025 by shenxiangzhuang
deepseek model structure
#101 opened Mar 12, 2025 by shenxiangzhuang
llama model structure
#100 opened Mar 12, 2025 by shenxiangzhuang
Beyond GPT-2
#99 opened Mar 12, 2025 by shenxiangzhuang
Why MoE works?
#98 opened Mar 12, 2025 by shenxiangzhuang
Questions
#96 opened Mar 12, 2025 by shenxiangzhuang
MoE enhancement New feature or request
#95 opened Mar 12, 2025 by shenxiangzhuang
Feature: GRPO
#84 opened Feb 28, 2025 by shenxiangzhuang
Feature: KV Cache enhancement New feature or request
#81 opened Feb 24, 2025 by shenxiangzhuang
SELECTIVE ATTENTION
#55 opened Nov 15, 2024 by shenxiangzhuang
BPE
#33 opened Sep 30, 2024 by shenxiangzhuang
RLHF: PPO & DPO & GRPO
#35 opened Jun 3, 2024 by shenxiangzhuang
4 tasks done
LoRA, QLoRA and DoRA
#38 opened Mar 18, 2024 by shenxiangzhuang
LLM: BN or LN
#41 opened Mar 12, 2024 by shenxiangzhuang
LLM: Perplexity
#43 opened Mar 5, 2024 by shenxiangzhuang
ProTip! Adding no:label will show everything without a label.