yagao403
/

llama3.1-70B-momento-no-more-ToolQA

Model card Files Files and versions Community

Llama-3.1-70B-Instruct + ToolQA (Finetuned)

This model is based on Llama-3.1-70B-Instruct, fine-tuned on the ToolQA dataset for multi-step tool-use reasoning tasks.

Training Details

Dataset: ToolQA – a benchmark designed for evaluating agents' tool-use capabilities in complex environments.
Training Framework: Memento-No-More – a novel framework for teaching models to internalize hints and perform multi-skill reasoning.
Fine-tuning Rounds: 3
Model Base: Llama-3.1-70B-Instruct

Reference

For detailed information on the training methodology, architecture, and evaluations, please refer to our paper:

Alakuijala, M., Gao, Y., Ananov, G., Kaski, S., Marttinen, P., Ilin, A., & Valpola, H. (2025). Memento No More: Coaching AI Agents to Master Multiple Tasks via Hints Internalization. arXiv preprint arXiv:2502.01562.

Downloads last month: 6

Safetensors

Model size

70.6B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yagao403/llama3.1-70B-momento-no-more-ToolQA

Base model

meta-llama/Llama-3.1-70B

Finetuned

meta-llama/Llama-3.1-70B-Instruct

Finetuned

(78)

this model

Collection including yagao403/llama3.1-70B-momento-no-more-ToolQA

Momento-No-More

2 items • Updated Jun 1