Upload folder using huggingface_hub

Browse files

Files changed (14) hide show

.gitattributes +12 -0
Llama-68M-Chat-v1-Q2_K.gguf +3 -0
Llama-68M-Chat-v1-Q3_K_L.gguf +3 -0
Llama-68M-Chat-v1-Q3_K_M.gguf +3 -0
Llama-68M-Chat-v1-Q3_K_S.gguf +3 -0
Llama-68M-Chat-v1-Q4_0.gguf +3 -0
Llama-68M-Chat-v1-Q4_K_M.gguf +3 -0
Llama-68M-Chat-v1-Q4_K_S.gguf +3 -0
Llama-68M-Chat-v1-Q5_0.gguf +3 -0
Llama-68M-Chat-v1-Q5_K_M.gguf +3 -0
Llama-68M-Chat-v1-Q5_K_S.gguf +3 -0
Llama-68M-Chat-v1-Q6_K.gguf +3 -0
Llama-68M-Chat-v1-Q8_0.gguf +3 -0
README.md +233 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+Llama-68M-Chat-v1-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

Llama-68M-Chat-v1-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8ed06dc5bd84bce3154a2b7e751c45a56562691933ee25b5823393f909329a67
+size 35877760

Llama-68M-Chat-v1-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:33fa387d82be5b90465fe7c184813283d9186a4c0fba4f22ae5c482c46e0056b
+size 41396608

Llama-68M-Chat-v1-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e50c759e534e8f92106d6235d7a05a429d2cda18f0b5c279f4e637ea9305a974
+size 40659328

Llama-68M-Chat-v1-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:65b873cc87e097a42acb5b8675cc4681655170d3a90d33697ebd4e0a990fa0d9
+size 39571840

Llama-68M-Chat-v1-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b0035499b56c216b9fd425908591e97a98b2139b2357b13f3970baf38d910ce2
+size 45342592

Llama-68M-Chat-v1-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:616e1e4d4425667c513962faefdcba2f2f7f7b9126873813d268f0bca652a6d1
+size 46102912

Llama-68M-Chat-v1-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9af004c50a0817dae49eff3cb4df56ab18a2efc268a9205d9f6af785a27e8d06
+size 45490048

Llama-68M-Chat-v1-Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:30bf6b50d07e79e20f633c47b714c079de4451b27532a64b867ce4a73fe12c57
+size 50773888

Llama-68M-Chat-v1-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:57b1e5541a92e4e0e6b6abe0654289a7d6455bc4bb5791d6ca8118d654813f4c
+size 51165568

Llama-68M-Chat-v1-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9fd2fcb3ef06263ece62ad6b5d7760142ec64a02d8d5c943008ec505b9328660
+size 50773888

Llama-68M-Chat-v1-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e6b42925a28678ad7f8974743fa8b139f51c5601842e4a2f4a8d03be4b143abc
+size 56544640

Llama-68M-Chat-v1-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:93274a0493dc551793d702a6c9d22f9818251bacb396c886363fe54cf97da5e9
+size 73019776

README.md ADDED Viewed

	@@ -0,0 +1,233 @@

+---
+language:
+- en
+license: apache-2.0
+tags:
+- text-generation
+- TensorBlock
+- GGUF
+datasets:
+- THUDM/webglm-qa
+- databricks/databricks-dolly-15k
+- cognitivecomputations/wizard_vicuna_70k_unfiltered
+- totally-not-an-llm/EverythingLM-data-V3
+- Amod/mental_health_counseling_conversations
+- sablo/oasst2_curated
+- starfishmedical/webGPT_x_dolly
+- Open-Orca/OpenOrca
+- mlabonne/chatml_dpo_pairs
+base_model: Felladrin/Llama-68M-Chat-v1
+widget:
+- messages:
+  - role: system
+    content: You are a career counselor. The user will provide you with an individual
+      looking for guidance in their professional life, and your task is to assist
+      them in determining what careers they are most suited for based on their skills,
+      interests, and experience. You should also conduct research into the various
+      options available, explain the job market trends in different industries, and
+      advice on which qualifications would be beneficial for pursuing particular fields.
+  - role: user
+    content: Heya!
+  - role: assistant
+    content: Hi! How may I help you?
+  - role: user
+    content: I am interested in developing a career in software engineering. What
+      would you recommend me to do?
+- messages:
+  - role: system
+    content: You are a knowledgeable assistant. Help the user as much as you can.
+  - role: user
+    content: How to become healthier?
+- messages:
+  - role: system
+    content: You are a helpful assistant who provides concise responses.
+  - role: user
+    content: Hi!
+  - role: assistant
+    content: Hello there! How may I help you?
+  - role: user
+    content: I need to build a simple website. Where should I start learning about
+      web development?
+- messages:
+  - role: system
+    content: You are a very creative assistant. User will give you a task, which you
+      should complete with all your knowledge.
+  - role: user
+    content: Write the background story of an RPG game about wizards and dragons in
+      a sci-fi world.
+inference:
+  parameters:
+    max_new_tokens: 64
+    penalty_alpha: 0.5
+    top_k: 4
+model-index:
+- name: Llama-68M-Chat-v1
+  results:
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: AI2 Reasoning Challenge (25-Shot)
+      type: ai2_arc
+      config: ARC-Challenge
+      split: test
+      args:
+        num_few_shot: 25
+    metrics:
+    - type: acc_norm
+      value: 23.29
+      name: normalized accuracy
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: HellaSwag (10-Shot)
+      type: hellaswag
+      split: validation
+      args:
+        num_few_shot: 10
+    metrics:
+    - type: acc_norm
+      value: 28.27
+      name: normalized accuracy
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: MMLU (5-Shot)
+      type: cais/mmlu
+      config: all
+      split: test
+      args:
+        num_few_shot: 5
+    metrics:
+    - type: acc
+      value: 25.18
+      name: accuracy
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: TruthfulQA (0-shot)
+      type: truthful_qa
+      config: multiple_choice
+      split: validation
+      args:
+        num_few_shot: 0
+    metrics:
+    - type: mc2
+      value: 47.27
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: Winogrande (5-shot)
+      type: winogrande
+      config: winogrande_xl
+      split: validation
+      args:
+        num_few_shot: 5
+    metrics:
+    - type: acc
+      value: 54.3
+      name: accuracy
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+  - task:
+      type: text-generation
+      name: Text Generation
+    dataset:
+      name: GSM8k (5-shot)
+      type: gsm8k
+      config: main
+      split: test
+      args:
+        num_few_shot: 5
+    metrics:
+    - type: acc
+      value: 0.0
+      name: accuracy
+    source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/Llama-68M-Chat-v1
+      name: Open LLM Leaderboard
+---
+<div style="width: auto; margin-left: auto; margin-right: auto">
+<img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+    <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p style="margin-top: 0.5em; margin-bottom: 0em;">
+            Feedback and support: TensorBlock's  <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
+        </p>
+    </div>
+</div>
+## Felladrin/Llama-68M-Chat-v1 - GGUF
+This repo contains GGUF format model files for [Felladrin/Llama-68M-Chat-v1](https://huggingface.co/Felladrin/Llama-68M-Chat-v1).
+The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
+## Prompt template
+```
+<|im_start|>system
+{system_prompt}<|im_end|>
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+## Model file specification
+| Filename | Quant type | File Size | Description |
+| -------- | ---------- | --------- | ----------- |
+| [Llama-68M-Chat-v1-Q2_K.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q2_K.gguf) | Q2_K | 0.033 GB | smallest, significant quality loss - not recommended for most purposes |
+| [Llama-68M-Chat-v1-Q3_K_S.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q3_K_S.gguf) | Q3_K_S | 0.037 GB | very small, high quality loss |
+| [Llama-68M-Chat-v1-Q3_K_M.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q3_K_M.gguf) | Q3_K_M | 0.038 GB | very small, high quality loss |
+| [Llama-68M-Chat-v1-Q3_K_L.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q3_K_L.gguf) | Q3_K_L | 0.039 GB | small, substantial quality loss |
+| [Llama-68M-Chat-v1-Q4_0.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q4_0.gguf) | Q4_0 | 0.042 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
+| [Llama-68M-Chat-v1-Q4_K_S.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q4_K_S.gguf) | Q4_K_S | 0.042 GB | small, greater quality loss |
+| [Llama-68M-Chat-v1-Q4_K_M.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q4_K_M.gguf) | Q4_K_M | 0.043 GB | medium, balanced quality - recommended |
+| [Llama-68M-Chat-v1-Q5_0.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q5_0.gguf) | Q5_0 | 0.047 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
+| [Llama-68M-Chat-v1-Q5_K_S.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q5_K_S.gguf) | Q5_K_S | 0.047 GB | large, low quality loss - recommended |
+| [Llama-68M-Chat-v1-Q5_K_M.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q5_K_M.gguf) | Q5_K_M | 0.048 GB | large, very low quality loss - recommended |
+| [Llama-68M-Chat-v1-Q6_K.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q6_K.gguf) | Q6_K | 0.053 GB | very large, extremely low quality loss |
+| [Llama-68M-Chat-v1-Q8_0.gguf](https://huggingface.co/tensorblock/Llama-68M-Chat-v1-GGUF/tree/main/Llama-68M-Chat-v1-Q8_0.gguf) | Q8_0 | 0.068 GB | very large, extremely low quality loss - not recommended |
+## Downloading instruction
+### Command line
+Firstly, install Huggingface Client
+```shell
+pip install -U "huggingface_hub[cli]"
+```
+Then, downoad the individual model file the a local directory
+```shell
+huggingface-cli download tensorblock/Llama-68M-Chat-v1-GGUF --include "Llama-68M-Chat-v1-Q2_K.gguf" --local-dir MY_LOCAL_DIR
+```
+If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
+```shell
+huggingface-cli download tensorblock/Llama-68M-Chat-v1-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
+```