Haritz Puerto

haritzpuerto

·

https://haritzpuerto.github.io

HaritzPuerto

AI & ML interests

Reasoning in LLMs, AI safety, agents

Recent Activity

published a model about 2 months ago

compass-group-tue/nemotron-6-traits

published a model about 2 months ago

compass-group-tue/Nemotron-Traits-Seed-44

updated a model about 2 months ago

compass-group-tue/nemotron-traits

View all activity

Organizations

Posts 3

Post

407

📜 Accepted at ACL 2025! Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs
We propose to fine-tune LLMs to generate diverse chains of thought (DCoT) in a single inference step. This enables within-inference refinement of the cots, no external feedback needed!
🔗 https://arxiv.org/abs/2407.03181

Articles 1

Article

Problem Solving with Language Models

View all Articles

Collections 4

View 4 collections

Papers 13

arxiv:2605.28591

arxiv:2602.24210

arxiv:2506.15674

arxiv:2506.11097

models 42

haritzpuerto/microsoft-Phi-4-14B-IF-RT

Text Generation • Updated Mar 2 • 6

haritzpuerto/microsoft-Phi-4-14B-IF-FA

Text Generation • Updated Mar 2 • 3

haritzpuerto/microsoft-Phi-4-14B-IF-Avg

Text Generation • Updated Mar 2 • 2

haritzpuerto/unsloth-Phi-4-3.8B-IF-RT

Text Generation • Updated Mar 2 • 9

haritzpuerto/unsloth-Phi-4-3.8B-IF-FA

Text Generation • Updated Mar 2 • 2

haritzpuerto/unsloth-Qwen3-14B-IF-RT

Text Generation • Updated Mar 2 • 4

haritzpuerto/unsloth-Qwen3-14B-IF-FA

Text Generation • Updated Mar 2 • 2

haritzpuerto/unsloth-Qwen3-14B-IF-Avg

Text Generation • Updated Mar 2 • 1

haritzpuerto/unsloth-Qwen3-8B-IF-RT

Text Generation • Updated Mar 2 • 6

haritzpuerto/unsloth-Qwen3-8B-IF-FA

Text Generation • Updated Mar 2 • 6

datasets 24

haritzpuerto/instruction-following-reasoning-traces

Viewer • Updated Mar 2 • 7k • 36

haritzpuerto/math-if

Viewer • Updated Mar 2 • 422 • 48

haritzpuerto/ifeval-lrm

Viewer • Updated Mar 2 • 541 • 26

haritzpuerto/password_eval-contextual-integrity

Viewer • Updated Mar 2 • 1k • 53

haritzpuerto/controlling-reasoning-models-privacy-outputs

Updated Mar 2 • 38

haritzpuerto/PEEP-contextual-integrity

Viewer • Updated Feb 26 • 2.06k • 56

haritzpuerto/OpenThoughts3-5k_len

Viewer • Updated Aug 11, 2025 • 38.7k • 9

haritzpuerto/benchmark

Viewer • Updated May 9, 2025 • 16.4k • 31

haritzpuerto/nq-300

Viewer • Updated Mar 26, 2025 • 1.5k • 21

haritzpuerto/datasets

Viewer • Updated Mar 23, 2025 • 3.19k • 22

View 24 datasets