NX-AI
/

xLSTM-7b

Korbinian Pöppel commited on Dec 11, 2024

Commit

fbd0d01

1 Parent(s): a7ad556

Make model card more informative.

Files changed (3) hide show

MMLUvsTrainToken.svg ADDED Viewed

README.md CHANGED Viewed

@@ -31,4 +31,25 @@ tokenizers = AutoTokenizer.from_pretrained("NX-AI/xLSTM-7b")
 xlstm(tokenizer("Hello xLSTM, how are you doing?"))
 ```
-License: NXAI Community License (see `LICENSE` file)

 xlstm(tokenizer("Hello xLSTM, how are you doing?"))
 ```
+## Speed results
+Generation Speed using `torch.cuda.graph` and `torch.compile` optimizations:
+![generation speed](plot_tokens_per_sec.svg)
+## Performance
+![mmlu_train_token](MMLUvsTrainToken.svg)
+Using HuggingFace's `lm_eval`:
+| BBH   | MMLU-Pro | Math   | MUSR | GPQA | IfEval |
+|-------|----------|--------|------|------|--------|
+| 0.381	| 0.242	   | 0.036	| 0.379|0.280 |	0.244  |
+Using HuggingFace's `lighteval` in the Leaderboard-v1 settings:
+|Arc-Challenge (25-shot) |MMLU (5-shot) |Hellaswag (10-shot)|Winogrande (5-shot) |TruthfulQA (0-shot) |GSM8k (5-shot) |OpenbookQA (5-shot) | PiQA (5-shot)|
+|------------------------|--------------|-------------------|--------------------|--------------------|---------------|--------------------|--------------|
+| 0.584	                 |0.589         |           0.710   |0.742               |          0.420     |         0.004 |         0.443      |        0.817 |
+## License
+NXAI Community License (see `LICENSE` file)

plot_tokens_per_sec.svg ADDED Viewed