Skip to content

Commit

Permalink
added multipacking
Browse files Browse the repository at this point in the history
  • Loading branch information
huseinzol05 committed Oct 12, 2024
1 parent 0d2c88d commit 21b2f36
Show file tree
Hide file tree
Showing 2 changed files with 561 additions and 0 deletions.
23 changes: 23 additions & 0 deletions session/llama3/lora-embedding-128-8b-multipack.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
WANDB_PROJECT=unsloth-Meta-Llama-3.1-8B-Instruct-lora-128-embedding-16k-multipack \
deepspeed run-instruction-qlora-embedding-multipack.py \
--deepspeed ds_config_zero3.json \
--model_name_or_path unsloth/Meta-Llama-3.1-8B-Instruct \
--per_device_train_batch_size 1 \
--gradient_accumulation_steps 6 \
--output_dir unsloth-Meta-Llama-3.1-8B-Instruct-lora-128-embedding-16k-multipack \
--bf16 \
--do_train \
--do_eval false \
--num_train_epochs 5 \
--train_file 'malaysian-llama3.1-24k-language-multipack' \
--logging_steps 1 \
--learning_rate 2e-5 \
--embedding_learning_rate 5e-6 \
--weight_decay 0.01 \
--block_size 24576 \
--save_steps 20 \
--save_total_limit 3 \
--gradient_checkpointing true \
--neftune_noise_alpha 5.0 \
--torch_dtype bfloat16 \
--rank 128
Loading

0 comments on commit 21b2f36

Please sign in to comment.