You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Default `peft_type` is `lora`, you could enable adalora or ia3 using `--peft_type adalora` or `--peft_type ia3`, or enable llama-adapter for llama model using `--peft_type llama-adapter`.
695
695
696
+
#### Custom Files
697
+
698
+
To run on your own training and validation files, use the following command:
699
+
700
+
```bash
701
+
python run_lora_clm.py \
702
+
--model_name_or_path bigcode/starcoder \
703
+
--train_file path_to_train_file \
704
+
--validation_file path_to_validation_file \
705
+
--per_device_train_batch_size 8 \
706
+
--per_device_eval_batch_size 8 \
707
+
--do_train \
708
+
--do_eval \
709
+
--output_dir /tmp/test-lora-clm \
710
+
--bf16 \
711
+
--use_habana \
712
+
--use_lazy_mode \
713
+
--use_hpu_graphs_for_inference \
714
+
--dataset_concatenation \
715
+
--throughput_warmup_steps 3
716
+
```
717
+
718
+
The format of the jsonlines files (with extensions .json or .jsonl) is expected to be
719
+
720
+
```json
721
+
{"text": "<text>"}
722
+
{"text": "<text>"}
723
+
{"text": "<text>"}
724
+
{"text": "<text>"}
725
+
```
726
+
727
+
The format of the text files (with extensions .text or .txt) is expected to be
728
+
729
+
```json
730
+
"<text>"
731
+
"<text>"
732
+
"<text>"
733
+
"<text>"
734
+
```
735
+
736
+
> Note: When using both custom files i.e `--train_file` and `--validation_file`, all files are expected to be of the same type i.e json or text.
737
+
696
738
### Prompt/Prefix/P-tuning
697
739
698
740
To run prompt tuning finetuning, you can use `run_prompt_tuning_clm.py`.
0 commit comments