Llama2 Unsupervised Finetuning #8491

gtx-cyber · 2024-02-23T09:10:21Z

gtx-cyber
Feb 23, 2024

Hi I was trying to perform unsupervised fine-tuning with NVIDIA NeMo Framework on my custom dataset of 3B tokens on a pretrained Llama2, and I had downloaded Llama2 7b from HuggingFace and converted it to NeMo config file- Uning megatron_gpt_continue_training.py and megatron_llama_config.yaml.
After various parameters for -
restore_from_path: /workspace/mount/llama2-7b-hf/llama2-7b.nemo OR /workspace/mount/llama2-7b-hf/
resume_from_checkpoint: /workspace/mount/llama2-7b.nemo

The model trains from scratch. I want NeMo to use the pretrained weights.

Also a feature request being gradual unfreezing

lhb8125 · 2024-02-26T01:10:40Z

lhb8125
Feb 26, 2024

Specify the .nemo ckpt for restore_from_path and ignore resume_from_checkpoint.

0 replies

AnirudhVIyer · 2024-03-17T02:01:47Z

AnirudhVIyer
Mar 17, 2024

@gtx-cyber were you able tp resolve this?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama2 Unsupervised Finetuning #8491

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Llama2 Unsupervised Finetuning #8491

gtx-cyber Feb 23, 2024

Replies: 2 comments

lhb8125 Feb 26, 2024

AnirudhVIyer Mar 17, 2024

gtx-cyber
Feb 23, 2024

lhb8125
Feb 26, 2024

AnirudhVIyer
Mar 17, 2024