Update README.md
Browse files
README.md
CHANGED
@@ -1,8 +1,41 @@
|
|
1 |
---
|
2 |
license: artistic-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
4 |
|
5 |
This model is developed by TroyDoesAI (Troy Andrew Schultz).
|
6 |
The architecture is based on my personal research-driven decisions, including a higher attention head-to-layer ratio, fewer layers than the number of key-value pairs, and other structural optimizations.
|
7 |
|
8 |
-
The focus of this model is task-oriented performance. It is designed to handle specific tasks efficiently rather than being trained on a broad dataset such as the entire internet. Initially scrambled and incoherent, the model has been fine-tuned using a curated 66K entry dataset, distilling 22 billion parameters into its current state. The model operates under the personality known as BlackSheep.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: artistic-2.0
|
3 |
---
|
4 |
+
### Checkpoint : Personality Distilled from 22B -> 3.14B
|
5 |
+
- Further Checkpoints will be for Task oriented BlackSheep's.
|
6 |
+
No longer do we need a ChatGPT, its time to add personality and Task Oriented Models rather than big bloated money burners.
|
7 |
+
Join Me in creating LLMs with personality.
|
8 |
+
This type of model can be given Only the knowledge you need, rather than the entire internet trivia chatbots we have today that beat benchmarks.
|
9 |
|
10 |
This model is developed by TroyDoesAI (Troy Andrew Schultz).
|
11 |
The architecture is based on my personal research-driven decisions, including a higher attention head-to-layer ratio, fewer layers than the number of key-value pairs, and other structural optimizations.
|
12 |
|
13 |
+
The focus of this model is task-oriented performance. It is designed to handle specific tasks efficiently rather than being trained on a broad dataset such as the entire internet. Initially scrambled and incoherent, the model has been fine-tuned using a curated 66K entry dataset, distilling 22 billion parameters into its current state. The model operates under the personality known as BlackSheep.
|
14 |
+
|
15 |
+
---
|
16 |
+
modelFile included for ease of use for Ollama People
|
17 |
+
|
18 |
+
# Instructions For Ollama People
|
19 |
+
```
|
20 |
+
ollama create BlackSheep-Pi
|
21 |
+
```
|
22 |
+
|
23 |
+
You will fucking see something like this
|
24 |
+
|
25 |
+
```
|
26 |
+
transferring model data 100%
|
27 |
+
using existing layer sha256:dc272d6f68e47bfda2babcae3e26e7f1d821d13b5a55a2ae50a11e2a016b49dc
|
28 |
+
creating new layer sha256:26a275c25f864ae816ca3733ea7da04703d916c1528447e2130bf244fd9d0370
|
29 |
+
creating new layer sha256:c69d48de48dc2a45afb309594615213b37b918f9f9ccf4b69d76b7c4014ee8b9
|
30 |
+
creating new layer sha256:a2b99648f21d2974dcc96acd928740486d67dbd53b850aadd797dbfbfbd883d1
|
31 |
+
writing manifest
|
32 |
+
success
|
33 |
+
```
|
34 |
+
|
35 |
+
If it looks like that above, then run that shit!
|
36 |
+
```
|
37 |
+
ollama run BlackSheep-Pi
|
38 |
+
```
|
39 |
+
---
|
40 |
+
|
41 |
+
I will release the Base Model Soon Once I add a final Alignment Layer, currently adding some python skills to the model
|