Data artifacts related to the paper "ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning".

Columbia NLP
university
AI & ML interests
Natural language processing group at Columbia University
Recent Activity
View all activity
Organization Card
Columbia University - NLP
models
20

Columbia-NLP/LION-Gemma-2b-sft-v1.0
Text Generation
•
Updated
•
27

Columbia-NLP/LION-Gemma-2b-dpo-v1.0
Text Generation
•
Updated
•
5

Columbia-NLP/LION-Gemma-2b-odpo-v1.0
Text Generation
•
Updated
•
11
•
4

Columbia-NLP/LION-LLaMA-3-8b-sft-v1.0
Text Generation
•
Updated
•
10

Columbia-NLP/LION-LLaMA-3-8b-dpo-v1.0
Text Generation
•
Updated
•
12
•
2

Columbia-NLP/LION-LLaMA-3-8b-odpo-v1.0
Text Generation
•
Updated
•
11
•
2

Columbia-NLP/llama3-8b-instruct-rewriting-r-Decor
Text Generation
•
Updated
•
9

Columbia-NLP/llama3-8b-instruct-rewriting-nr-Decor
Text Generation
•
Updated
•
5

Columbia-NLP/llama2-7b-rewriting-r-Decor
Text Generation
•
Updated
•
5

Columbia-NLP/llama2-7b-rewriting-nr-Decor
Text Generation
•
Updated
•
5
datasets
19
Columbia-NLP/PUPA
Viewer
•
Updated
•
901
•
64
•
1
Columbia-NLP/ExACT-VWA
Viewer
•
Updated
•
176
•
45
Columbia-NLP/DPO-hh-rlhf
Viewer
•
Updated
•
169k
•
59
Columbia-NLP/DPO-PKU-SafeRLHF
Viewer
•
Updated
•
136k
•
63
•
1
Columbia-NLP/DPO-HelpSteer
Viewer
•
Updated
•
9.17k
•
62
Columbia-NLP/DPO-tldr-summarisation-preferences
Viewer
•
Updated
•
177k
•
79
Columbia-NLP/DPO-py-dpo-v0.1
Viewer
•
Updated
•
9.47k
•
43
Columbia-NLP/DPO-UltraFeedback_binarized
Viewer
•
Updated
•
62.7k
•
57
Columbia-NLP/DPO-distilabel-intel-orca-dpo-pairs_cleaned
Viewer
•
Updated
•
12.8k
•
44
Columbia-NLP/DPO-distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
60