mmnga commited on
Commit
fb9b83e
·
verified ·
1 Parent(s): 7626e5f

Add files using upload-large-folder tool

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Llama-3.1-Nemotron-Nano-8B-v1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-Nemotron-Nano-8B-v1-IQ4_NL.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:edcdff5b6926c0bf9f7a9246c745d1abd00564a50a9ab231130b77c61f78e344
3
+ size 4677991488
README.md ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: unknown
4
+ language:
5
+ - en
6
+ - ja
7
+ datasets:
8
+ - TFMC/imatrix-dataset-for-japanese-llm
9
+ base_model:
10
+ - nvidia/Llama-3.1-Nemotron-Nano-8B-v1
11
+ ---
12
+
13
+ # Llama-3.1-Nemotron-Nano-8B-v1-gguf
14
+ [nvidiaさんが公開しているLlama-3.1-Nemotron-Nano-8B-v1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1)のggufフォーマット変換版です。
15
+
16
+ imatrixのデータは[TFMC/imatrix-dataset-for-japanese-llm](https://huggingface.co/datasets/TFMC/imatrix-dataset-for-japanese-llm)を使用して作成しました。
17
+
18
+ ## Usage
19
+
20
+ ```
21
+ git clone https://github.com/ggerganov/llama.cpp.git
22
+ cd llama.cpp
23
+ cmake -B build -DGGML_CUDA=ON
24
+ cmake --build build --config Release
25
+ build/bin/llama-cli -m 'Llama-3.1-Nemotron-Nano-8B-v1-gguf' -n 128 -c 128 -p 'あなたはプロの料理人です。レシピを教えて' -cnv
26
+ ```