JohnRoger commited on
Commit
fc08144
·
verified ·
1 Parent(s): cc70a38

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +82 -0
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: huihui-ai/Huihui-MoE-12B-A4B-abliterated
4
+ library_name: transformers
5
+ license_link: https://huggingface.co/Qwen/Qwen3-4B/blob/main/LICENSE
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - moe
9
+ - llama-cpp
10
+ - gguf-my-repo
11
+ extra_gated_prompt: '**Usage Warnings**
12
+
13
+
14
+ “**Risk of Sensitive or Controversial Outputs**“: This model’s safety filtering
15
+ has been significantly reduced, potentially generating sensitive, controversial,
16
+ or inappropriate content. Users should exercise caution and rigorously review generated
17
+ outputs.
18
+
19
+ “**Not Suitable for All Audiences**:“ Due to limited content filtering, the model’s
20
+ outputs may be inappropriate for public settings, underage users, or applications
21
+ requiring high security.
22
+
23
+ “**Legal and Ethical Responsibilities**“: Users must ensure their usage complies
24
+ with local laws and ethical standards. Generated content may carry legal or ethical
25
+ risks, and users are solely responsible for any consequences.
26
+
27
+ “**Research and Experimental Use**“: It is recommended to use this model for research,
28
+ testing, or controlled environments, avoiding direct use in production or public-facing
29
+ commercial applications.
30
+
31
+ “**Monitoring and Review Recommendations**“: Users are strongly advised to monitor
32
+ model outputs in real-time and conduct manual reviews when necessary to prevent
33
+ the dissemination of inappropriate content.
34
+
35
+ “**No Default Safety Guarantees**“: Unlike standard models, this model has not undergone
36
+ rigorous safety optimization. huihui.ai bears no responsibility for any consequences
37
+ arising from its use.'
38
+ ---
39
+
40
+ # JohnRoger/Huihui-MoE-12B-A4B-abliterated-Q8_0-GGUF
41
+ This model was converted to GGUF format from [`huihui-ai/Huihui-MoE-12B-A4B-abliterated`](https://huggingface.co/huihui-ai/Huihui-MoE-12B-A4B-abliterated) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
42
+ Refer to the [original model card](https://huggingface.co/huihui-ai/Huihui-MoE-12B-A4B-abliterated) for more details on the model.
43
+
44
+ ## Use with llama.cpp
45
+ Install llama.cpp through brew (works on Mac and Linux)
46
+
47
+ ```bash
48
+ brew install llama.cpp
49
+
50
+ ```
51
+ Invoke the llama.cpp server or the CLI.
52
+
53
+ ### CLI:
54
+ ```bash
55
+ llama-cli --hf-repo JohnRoger/Huihui-MoE-12B-A4B-abliterated-Q8_0-GGUF --hf-file huihui-moe-12b-a4b-abliterated-q8_0.gguf -p "The meaning to life and the universe is"
56
+ ```
57
+
58
+ ### Server:
59
+ ```bash
60
+ llama-server --hf-repo JohnRoger/Huihui-MoE-12B-A4B-abliterated-Q8_0-GGUF --hf-file huihui-moe-12b-a4b-abliterated-q8_0.gguf -c 2048
61
+ ```
62
+
63
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
64
+
65
+ Step 1: Clone llama.cpp from GitHub.
66
+ ```
67
+ git clone https://github.com/ggerganov/llama.cpp
68
+ ```
69
+
70
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
71
+ ```
72
+ cd llama.cpp && LLAMA_CURL=1 make
73
+ ```
74
+
75
+ Step 3: Run inference through the main binary.
76
+ ```
77
+ ./llama-cli --hf-repo JohnRoger/Huihui-MoE-12B-A4B-abliterated-Q8_0-GGUF --hf-file huihui-moe-12b-a4b-abliterated-q8_0.gguf -p "The meaning to life and the universe is"
78
+ ```
79
+ or
80
+ ```
81
+ ./llama-server --hf-repo JohnRoger/Huihui-MoE-12B-A4B-abliterated-Q8_0-GGUF --hf-file huihui-moe-12b-a4b-abliterated-q8_0.gguf -c 2048
82
+ ```