Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -19,7 +19,10 @@ tags:
 # LLaDA-8B-Tools
-This repository contains a variant of the [GSAI-ML/LLaDA-8B-Instruct](https://huggingface.co/GSAI-ML/LLaDA-8B-Instruct) model, fine-tuned by [Proximile LLC](https://proximile.llc) to enhance its tool calling capabilities. Proximile specializes in secure, on-premise AI solutions for small and medium-sized businesses.
 ![Demo](demo.gif)
@@ -36,12 +39,12 @@ This merged LoRA model was trained to improve LLaDA's ability to handle tool cal
 ### Training Details
-- **Base Model**: GSAI-ML/LLaDA-8B-Instruct
-- **Training Method**: Supervised Fine-Tuning (SFT) with LoRA
-- **LoRA Configuration**:
-  - Rank (r): 128
-  - Alpha: 256
-  - Target Modules: q_proj, k_proj, v_proj, gate_proj
 - **Training Data**: A modified subset of the [ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) dataset.
 ## Installation

 # LLaDA-8B-Tools
+## Update Timeline
+- **May 14 2025** – Initial public release. Training examples were missing the pad tokens filling out the rest of the generation window.
+- **May 17 2025** – Patched training script to include correct padding; updated model weights pushed to this repository.
 ![Demo](demo.gif)
 ### Training Details
+- **Base Model**: GSAI-ML/LLaDA-8B-Instruct
+- **Training Method**: Supervised Fine-Tuning (SFT) with LoRA
+- **LoRA Configuration**:
+  - Rank (r): 128
+  - Alpha: 256
+  - Target Modules: `q_proj`, `k_proj`, `v_proj`, `gate_proj`
 - **Training Data**: A modified subset of the [ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) dataset.
 ## Installation