EpistemeAI
/

Fireball-12B

Text Generation

text-generation-inference

Model card Files Files and versions Community

legolasyiu commited on Aug 20, 2024

Commit

c26f901

·

verified ·

1 Parent(s): 26b8494

Update README.md

Files changed (1) hide show

README.md +31 -0

README.md CHANGED Viewed

@@ -102,6 +102,37 @@ outputs = model.generate(**inputs, max_new_tokens=20)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 > [!TIP]
 > Unlike previous Mistral models, Mistral Nemo requires smaller temperatures. We recommend to use a temperature of 0.3.

 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Accelerator mode:
+```py
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from accelerate import Accelerator
+# Initialize the accelerator
+accelerator = Accelerator()
+# Define the model ID
+model_id = "EpistemeAI/Fireball-12B"
+# Load the tokenizer
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+# Load the model and prepare it for distributed setup using accelerate
+model = AutoModelForCausalLM.from_pretrained(model_id)
+# Move the model to the appropriate device using accelerate
+model, = accelerator.prepare(model)
+# Prepare inputs
+inputs = tokenizer("Hello my name is", return_tensors="pt").to(accelerator.device)
+# Generate outputs with the model
+outputs = model.generate(**inputs, max_new_tokens=20)
+# Decode and print the outputs
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
 > [!TIP]
 > Unlike previous Mistral models, Mistral Nemo requires smaller temperatures. We recommend to use a temperature of 0.3.