-
-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Pull requests: unslothai/unsloth
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Support custom
auto_model
for wider model compatibility (Whisper, Bert,etc) & attn_implementation
support
#2263
opened Apr 1, 2025 by
Etherll
Loading…
Fix Precision Mismatch in Continued Pretraining with FP16 Embeddings
#2259
opened Apr 1, 2025 by
rupaut98
Loading…
loader.py: when dispatching to FastModel, use original model name
#2246
opened Mar 31, 2025 by
ushakov
Loading…
Fix Qwen2.5 'str object is not callable' error in generate()
#2239
opened Mar 30, 2025 by
aditya0155
Loading…
Fix batched generation for prompts of different lengths
#2216
opened Mar 27, 2025 by
RunFMe
Loading…
remove dead code from fast_rms_layernorm_inference
#2135
opened Mar 21, 2025 by
KareemMusleh
Loading…
VLM Data Collator - Make text & image mixing work efficiently
#2133
opened Mar 21, 2025 by
mmathew23
Loading…
Add load_in_16bit Parameter and Fix 8-bit Quantization Config
#2022
opened Mar 14, 2025 by
marcelodiaz558
Loading…
DynamicFlexAttention wrapper class for dynamic sequence lengths
#1960
opened Mar 9, 2025 by
zyklotomic
Loading…
Add automatic image resizing to prevent memory explosion
#1946
opened Mar 7, 2025 by
issamarabi
Loading…
[DRAFT]: Adding save to gguf support for qwen2_vl
#1904
opened Mar 5, 2025 by
Captain-T2004
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.