-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add post processing logic to accelerate launch #346
Conversation
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Angel Luu <angel.luu@us.ibm.com>
Signed-off-by: Angel Luu <angel.luu@us.ibm.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
…ss_LoRA Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* get num_added_tokens Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * remove extra code Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> --------- Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Angel Luu <angel.luu@us.ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Abhishek <maurya.abhishek@ibm.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
* refactor saving tokens metadata Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * remove extra check Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * post processing script Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * post processing script Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix: unit test args Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * undo post_process_vLLm flag Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> --------- Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
Thanks for making a pull request! 😃 |
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
It works for llama3-8b! Tuning and inference detailsTuning config:
Inference config:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Small comments on fixes, Will is currently testing, otherwise LGTM
@@ -38,14 +38,16 @@ For example, the below config is used for running with two GPUs and FSDP for fin | |||
"per_device_train_batch_size": 4, | |||
"learning_rate": 1e-5, | |||
"response_template": "\n### Label:", | |||
"dataset_text_field": "output" | |||
"dataset_text_field": "output", | |||
"lora_post_process_for_vllm": true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would be interested to hear if @Ssukriti has thoughts on this param but looks good to me and thanks for updating these docs!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ya sounds good
Signed-off-by: Will Johnson <mwjohnson728@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Signed-off-by: Anh Uong <anh.uong@ibm.com>
adding changes from main didn't resolve extra commits here although the files changed are correct. rebasing was taking a long time and hard to validate so we moved over to a new PR #351 |
Description of the change
Add post processing logic from PR #338 to accelerate launch, with unit tests and documentation
Related issue number
How to verify the PR
Was the PR tested