데이터 μ…‹

LIMO

LIMO ν•œκ΅­μ–΄ λ²ˆμ—­

νŠΉμ΄μ‚¬ν•­

  • μ›λž˜ LIMOμ—μ„œλŠ” 15 epoch ν•™μŠ΅μ„ μˆ˜ν–‰ν•¨
  • μ˜μ–΄1+ν•œκ΅­μ–΄2 데이터 셋을 μ„žμ€ ν›„ 5 epoch ν•™μŠ΅μ‹œμΌœ μ›λž˜ ν•™μŠ΅ 방법과 μœ μ‚¬ν•œ 횟수만큼, κ·ΈλŸ¬λ‚˜ μ•½κ°„μ˜ λ³€ν˜•μ΄ μžˆλ„λ‘ ν•™μŠ΅μ‹œν‚€λ €κ³  함
  • κ·ΈλŸ¬λ‚˜ μ •μ„± ν‰κ°€μ—μ„œ 4 epoch μ‹œμ μ˜ checkpointκ°€ κ°€μž₯ μ„±λŠ₯이 μ’‹μ•„ λ³΄μ˜€μŒ

Training Details

  • 4xH200 SXM, 13.5 Hours

image/png

Axolotl config
base_model: beomi/EXAONE-3.5-32B-Instruct-Llamafied
model_type: AutoModelForCausalLM
tokenizer_config: beomi/EXAONE-3.5-32B-Instruct-Llamafied
tokenizer_type: AutoTokenizer

load_in_8bit: false
load_in_4bit: false
strict: false

datasets:
  - path: werty1248/kk_oo_llliiimmmooo
    field_messages: conversations
    type: chat_template
    chat_template: tokenizer_default

dataset_prepared_path: ./data_preparation
output_dir: /workspace/data

hf_use_auth_token: true

sequence_len: 32768
sample_packing: false
pad_to_sequence_len: true

plugins:
  - axolotl.integrations.liger.LigerPlugin
liger_rope: true
liger_rms_norm: true
liger_layer_norm: true
liger_glu_activation: true
liger_fused_linear_cross_entropy: true

wandb_project:
#wandb_entity:
#wandb_watch:
wandb_name:
#wandb_log_model:

gradient_accumulation_steps: 2
micro_batch_size: 1
num_epochs: 5
optimizer: paged_adamw_8bit
lr_scheduler: cosine
learning_rate: 5.0e-6

train_on_inputs: false
group_by_length: false
bf16: auto
fp16: 
tf32: false

gradient_checkpointing: true
early_stopping_patience:
resume_from_checkpoint:
local_rank:
logging_steps: 1
xformers_attention:
flash_attention: true

warmup_ratio: 0.05
eval_table_size:

save_total_limit: 2

deepspeed: ./deepspeed_configs/zero3_bf16.json

special_tokens:
  pad_token: "[|endofturn|]"
Downloads last month
32
Safetensors
Model size
32B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for werty1248/EXAONE-3.5-32B-LIMO-Ko-e4

Finetuned
(4)
this model
Quantizations
1 model

Datasets used to train werty1248/EXAONE-3.5-32B-LIMO-Ko-e4