Skip to content

Commit

Permalink
adds updated model paths, adds eval to sft scripts
Browse files Browse the repository at this point in the history
  • Loading branch information
edbeeching committed Nov 10, 2023
1 parent 4b0769d commit 13141a4
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 3 deletions.
2 changes: 1 addition & 1 deletion recipes/zephyr-7b/dpo/config_full.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# Model arguments
model_name_or_path: lewtun/zephyr-7b-sft
model_name_or_path: alignment-handbook/zephyr-7b-sft-full

# Data training arguments
# For definitions, see: src/h4/training/config.py
Expand Down
3 changes: 1 addition & 2 deletions recipes/zephyr-7b/dpo/config_lora.yaml
Original file line number Diff line number Diff line change
@@ -1,6 +1,5 @@
# Model arguments
model_name_or_path: HuggingFaceH4/mistral-7b-ift
model_revision: v14.0
model_name_or_path: alignment-handbook/zephyr-7b-sft-lora
torch_dtype: auto

# LORA
Expand Down
1 change: 1 addition & 0 deletions recipes/zephyr-7b/sft/config_full.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ preprocessing_num_workers: 12

# SFT trainer config
bf16: true
do_eval: true
evaluation_strategy: epoch
gradient_accumulation_steps: 2
gradient_checkpointing: true
Expand Down
1 change: 1 addition & 0 deletions recipes/zephyr-7b/sft/config_lora.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,7 @@ preprocessing_num_workers: 12

# SFT trainer config
bf16: true
do_eval: true
evaluation_strategy: epoch
gradient_accumulation_steps: 128
ddp_find_unused_parameters: true
Expand Down

0 comments on commit 13141a4

Please sign in to comment.