Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,75 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
- it
|
6 |
+
base_model:
|
7 |
+
- mistralai/Magistral-Small-2506
|
8 |
+
pipeline_tag: text-generation
|
9 |
+
library_name: transformers
|
10 |
+
tags:
|
11 |
+
- ita
|
12 |
+
- italian
|
13 |
+
- anita
|
14 |
+
- magistral
|
15 |
+
- 24b
|
16 |
+
- uniba
|
17 |
+
- bari
|
18 |
+
- italy
|
19 |
+
- italia
|
20 |
+
- Conversational
|
21 |
+
- LLaMantino
|
22 |
+
---
|
23 |
+
|
24 |
+
<img src="https://huggingface.co/m-polignano/ANITA-NEXT-24B-Magistral-2506-ITA/resolve/main/Anita-Next_full.png" alt="anita_next" border="0" width="600px">
|
25 |
+
<hr>
|
26 |
+
<!--<img src="https://i.ibb.co/6mHSRm3/llamantino53.jpg" width="200"/>-->
|
27 |
+
<h3><i>"Built on <b>mistral/Magistral-Small-2506</b>"</i></i></h3>
|
28 |
+
<p style="text-align:justify;"><b>ANITA-NEXT-24B-Magistral-2506-ITA</b> is a <b>Thinking Model</b> of the <a href="https://arxiv.org/abs/2405.07101"><b>ANITA</b></a> - <i>Large Language Models family</i>.
|
29 |
+
The model is a fine-tuned version of <a href="https://huggingface.co/mistralai/Magistral-Small-2506"><b>Magistral-Small-2506</b></a> (a fine-tuned <b>Mistral model</b>).
|
30 |
+
This model version aims to be the a <b>Multilingual Model</b> 🏁 (EN 🇺🇸 + ITA🇮🇹) to further fine-tuning on Specific Tasks in Italian.</p>
|
31 |
+
|
32 |
+
❗❗❗Use at your own risk. The model may generate hallucinations, incorrect, invented, offensive, unethical or dangerous responses. We are not responsible for any dangerous/offensive/criminal use. The model is release for research only purposes.❗❗❗
|
33 |
+
|
34 |
+
|
35 |
+
The 🌟**ANITA project**🌟 *(**A**dvanced **N**atural-based interaction for the **ITA**lian language)*
|
36 |
+
wants to provide Italian NLP researchers with an improved model for the Italian Language 🇮🇹 use cases.
|
37 |
+
|
38 |
+
The **NEXT** family includes **four models**:
|
39 |
+
- m-polignano/ANITA-NEXT-24B-Magistral-2506-ITA - **General Purpose**
|
40 |
+
- m-polignano/ANITA-NEXT-24B-Dolphin-Mistral-UNCENSORED-ITA - **Uncensored**
|
41 |
+
- m-polignano/ANITA-NEXT-24B-Magistral-2506-VISION-ITA - **Vision-Language**
|
42 |
+
- m-polignano/ANITA-NEXT-20B-GPT-OSS-ITA - **Agentic Ready**
|
43 |
+
|
44 |
+
<hr>
|
45 |
+
|
46 |
+
**Full Model**: [m-polignano/ANITA-NEXT-24B-Magistral-2506-ITA](https://huggingface.co/m-polignano/ANITA-NEXT-24B-Magistral-2506-ITA)
|
47 |
+
|
48 |
+
<hr>
|
49 |
+
|
50 |
+
<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/made with unsloth.png" width="200px" align="center" />
|
51 |
+
|
52 |
+
For *OLLAMA Inference* follow the [Unsloth Documentation](https://docs.unsloth.ai/basics/running-and-saving-models/saving-to-ollama).
|
53 |
+
|
54 |
+
<hr>
|
55 |
+
|
56 |
+
|
57 |
+
## Citation instructions
|
58 |
+
```bibtex
|
59 |
+
@misc{polignano2024advanced,
|
60 |
+
title={Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA},
|
61 |
+
author={Marco Polignano and Pierpaolo Basile and Giovanni Semeraro},
|
62 |
+
year={2024},
|
63 |
+
eprint={2405.07101},
|
64 |
+
archivePrefix={arXiv},
|
65 |
+
primaryClass={cs.CL}
|
66 |
+
}
|
67 |
+
```
|
68 |
+
```bibtex
|
69 |
+
@article{rastogi2025magistral,
|
70 |
+
title={Magistral},
|
71 |
+
author={Rastogi, Abhinav and Jiang, Albert Q and Lo, Andy and Berrada, Gabrielle and Lample, Guillaume and Rute, Jason and Barmentlo, Joep and Yadav, Karmesh and Khandelwal, Kartik and Chandu, Khyathi Raghavi and others},
|
72 |
+
journal={arXiv preprint arXiv:2506.10910},
|
73 |
+
year={2025}
|
74 |
+
}
|
75 |
+
```
|