Upload README.md
Browse files
README.md
CHANGED
@@ -12,8 +12,8 @@ model-index:
|
|
12 |
results: []
|
13 |
---
|
14 |
|
15 |
-
|
16 |
-
|
17 |
|
18 |
# zephyr-2b-gemma-sft-qlora
|
19 |
|
@@ -21,20 +21,6 @@ This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/g
|
|
21 |
It achieves the following results on the evaluation set:
|
22 |
- Loss: 1.2493
|
23 |
|
24 |
-
## Model description
|
25 |
-
|
26 |
-
More information needed
|
27 |
-
|
28 |
-
## Intended uses & limitations
|
29 |
-
|
30 |
-
More information needed
|
31 |
-
|
32 |
-
## Training and evaluation data
|
33 |
-
|
34 |
-
More information needed
|
35 |
-
|
36 |
-
## Training procedure
|
37 |
-
|
38 |
### Training hyperparameters
|
39 |
|
40 |
The following hyperparameters were used during training:
|
|
|
12 |
results: []
|
13 |
---
|
14 |
|
15 |
+
**Note**: This model card has been generated automatically according to the information the Trainer had access to.
|
16 |
+
Visit the [model card](https://ritvik19.github.io/zephyr-mini/) to see the full description.
|
17 |
|
18 |
# zephyr-2b-gemma-sft-qlora
|
19 |
|
|
|
21 |
It achieves the following results on the evaluation set:
|
22 |
- Loss: 1.2493
|
23 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
### Training hyperparameters
|
25 |
|
26 |
The following hyperparameters were used during training:
|