Momento-No-More
Collection
2 items
โข
Updated
This model is based on Llama-3.1-70B-Instruct, fine-tuned on the ToolQA dataset for multi-step tool-use reasoning tasks.
For detailed information on the training methodology, architecture, and evaluations, please refer to our paper:
Alakuijala, M., Gao, Y., Ananov, G., Kaski, S., Marttinen, P., Ilin, A., & Valpola, H. (2025). Memento No More: Coaching AI Agents to Master Multiple Tasks via Hints Internalization. arXiv preprint arXiv:2502.01562.
Base model
meta-llama/Llama-3.1-70B