Judgments of WildChat-50m models and DPO datasets derived from those judgments.
-
nyu-dice-lab/nvidia_NVLM-D-72B-jdgfct-LogicalCorrectness
Viewer • Updated • 984k -
nyu-dice-lab/meta-llama_Llama-3.1-70B-Instruct-jdgfct-LogicalCorrectness
Viewer • Updated • 983k -
nyu-dice-lab/meta-llama_Llama-3.1-70B-Instruct-jdgfct-LogicalEfficiency
Viewer • Updated • 983k -
nyu-dice-lab/nvidia_NVLM-D-72B-jdgfct-LogicalEfficiency
Viewer • Updated • 984k