ProximileAdmin commited on
Commit
e885e64
·
verified ·
1 Parent(s): 930484d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -19,7 +19,10 @@ tags:
19
 
20
  # LLaDA-8B-Tools
21
 
22
- This repository contains a variant of the [GSAI-ML/LLaDA-8B-Instruct](https://huggingface.co/GSAI-ML/LLaDA-8B-Instruct) model, fine-tuned by [Proximile LLC](https://proximile.llc) to enhance its tool calling capabilities. Proximile specializes in secure, on-premise AI solutions for small and medium-sized businesses.
 
 
 
23
 
24
  ![Demo](demo.gif)
25
 
@@ -36,12 +39,12 @@ This merged LoRA model was trained to improve LLaDA's ability to handle tool cal
36
 
37
  ### Training Details
38
 
39
- - **Base Model**: GSAI-ML/LLaDA-8B-Instruct
40
- - **Training Method**: Supervised Fine-Tuning (SFT) with LoRA
41
- - **LoRA Configuration**:
42
- - Rank (r): 128
43
- - Alpha: 256
44
- - Target Modules: q_proj, k_proj, v_proj, gate_proj
45
  - **Training Data**: A modified subset of the [ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) dataset.
46
 
47
  ## Installation
 
19
 
20
  # LLaDA-8B-Tools
21
 
22
+ ## Update Timeline
23
+
24
+ - **May 14 2025** – Initial public release. Training examples were missing the pad tokens filling out the rest of the generation window.
25
+ - **May 17 2025** – Patched training script to include correct padding; updated model weights pushed to this repository.
26
 
27
  ![Demo](demo.gif)
28
 
 
39
 
40
  ### Training Details
41
 
42
+ - **Base Model**: GSAI-ML/LLaDA-8B-Instruct
43
+ - **Training Method**: Supervised Fine-Tuning (SFT) with LoRA
44
+ - **LoRA Configuration**:
45
+ - Rank (r): 128
46
+ - Alpha: 256
47
+ - Target Modules: `q_proj`, `k_proj`, `v_proj`, `gate_proj`
48
  - **Training Data**: A modified subset of the [ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) dataset.
49
 
50
  ## Installation