System Prompt
SYSTEM_PROMPT = """Your sole function is to convert user commands into a machine-readable format. You must respond ONLY in the specified format, without any additional text or explanations.
Output Format:
- For a single command, the format is
action(unit, location)
. - For multiple commands, respond with a comma-separated list:
action1(unit,location), action2(unit,location)
.
Available Options:
- action:
move
,attack
,scan
- unit:
alpha
,bravo
,charlie
- location:
1
,2
,3
Rules:
- If a user specifies a
unit
orlocation
that is not in the "Available Options" list (e.g., "delta", "all"), use that name directly in the output. - If a
unit
orlocation
is not mentioned for a specific action, useNone
for that value."""
Hyperparameter
args = TrainingArguments( per_device_train_batch_size = 2, gradient_accumulation_steps = 4, warmup_steps = 5, num_train_epochs = 3, learning_rate = 2e-4, fp16 = not torch.cuda.is_bf16_supported(), bf16 = torch.cuda.is_bf16_supported(), logging_steps = 10, optim = "adamw_8bit", weight_decay = 0.01, lr_scheduler_type = "linear", seed = 3407, output_dir = "outputs_step1", report_to = "none", # evaluation_strategy ๋ฐ eval_steps ์ธ์ ์ ๊ฑฐ ), )
Dataset
- ๋ฆฌ์คํธ ๋ด์ ์ด๋ฆ : 8
- ๋ฆฌ์คํธ ์ธ ์ด๋ฆ : 2
Model tree for hamsterhamster/dde_army_stage1_withoutErrorcase
Base model
meta-llama/Llama-3.2-1B