Svngoku
/

Qwen3-VL-TimeTravel

text-generation-inference

Model card Files Files and versions

Svngoku commited on Oct 17

Commit

ac15039

·

verified ·

1 Parent(s): 5c66500

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ This model is a fine-tuned version of the Qwen3-VL-8B-Instruct model using the U
 ## Training
-The model was trained for 200 steps with a batch size of 8 (2 per device with 4 gradient accumulation steps). LoRA adapters were used for parameter efficient finetuning, targeting both vision and language layers, as well as attention and MLP modules.
 ## Dataset

 ## Training
+The model was trained for 4 epochs with a batch size of 8 (4 per device with 8 gradient accumulation steps). LoRA adapters were used for parameter efficient finetuning, targeting both vision and language layers, as well as attention and MLP modules.
 ## Dataset