miike-ai
/

r1-12b

miike-ai commited on Mar 2

Commit

65a4b42

verified ·

1 Parent(s): 6fc6088

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,38 +1,24 @@
----
-base_model:
-- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
-library_name: transformers
-tags:
-- mergekit
-- merge
----
-# models
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the Passthrough merge method.
-### Models Merged
-The following models were included in the merge:
-* [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-slices:
-  - sources:
-    - model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
-      layer_range: [0, 24]
-  - sources:
-    - model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
-      layer_range: [8, 32]
-merge_method: passthrough
-dtype: bfloat16
-```

+```python
+import transformers
+import torch
+model_id = "miike-ai/r1-12b"
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model_id,
+    model_kwargs={"torch_dtype": torch.bfloat16},
+    device_map="auto",
+)
+messages = [
+    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
+    {"role": "user", "content": "Who are you?"},
+]
+outputs = pipeline(
+    messages,
+    max_new_tokens=8192,
+)
+print(outputs[0]["generated_text"][-1])
+```