Upload bias_description_dataset-biggenbench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub 2f80cca verified felipemaiapolo commited on 26 days ago
Upload bias_dataset-biggenbench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub 9eed333 verified felipemaiapolo commited on 26 days ago
Upload bias_description_dataset-biggenbench_judge-gpt-4o-mini_logprob-False_max-evals-5000.npy with huggingface_hub ac30693 verified felipemaiapolo commited on 26 days ago
Upload dataset-chatbot_arena_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub 7872b10 verified felipemaiapolo commited on 27 days ago
Upload dataset-chatbot_arena_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub 478b9f0 verified felipemaiapolo commited on 27 days ago
Upload dataset-chatbot_arena_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub 2bde5af verified felipemaiapolo commited on 27 days ago
Upload dataset-biggenbench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub ce7df6f verified felipemaiapolo commited on 28 days ago
Upload description_dataset-financebench_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub fe6fc19 verified felipemaiapolo commited on 28 days ago
Upload description_dataset-financebench_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub ee136a6 verified felipemaiapolo commited on 28 days ago
Upload description_dataset-financebench_judge-llama-3.1-8b_logprob-True_max-evals-5000.npy with huggingface_hub 85cb99f verified felipemaiapolo commited on 28 days ago
Upload dataset-biggenbench_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub 16eb723 verified felipemaiapolo commited on 28 days ago
Upload description_dataset-chatbot_arena_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub 071a6c9 verified felipemaiapolo commited on 29 days ago
Upload description_dataset-biggenbench_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub 42d18ca verified felipemaiapolo commited on 29 days ago
Upload description_dataset-chatbot_arena_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub 9469cb2 verified felipemaiapolo commited on 29 days ago
Upload description_dataset-financebench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub 3085283 verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-selene-1-mini_logprob-True_max-evals-5000.npy with huggingface_hub 9229f97 verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-llama-3.1-8b_logprob-True_max-evals-5000.npy with huggingface_hub d04a0e6 verified felipemaiapolo commited on 29 days ago
Upload dataset-biggenbench_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub ea6142d verified felipemaiapolo commited on 29 days ago
Upload description_dataset-financebench_judge-prometheus_logprob-True_max-evals-5000.npy with huggingface_hub 8b30c4c verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub ff34b0c verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-prometheus_logprob-True_max-evals-5000.npy with huggingface_hub 09bc26f verified felipemaiapolo commited on 29 days ago
Upload description_dataset-financebench_judge-selene-1-mini_logprob-True_max-evals-5000.npy with huggingface_hub 2872d1c verified felipemaiapolo commited on 29 days ago
Upload description_dataset-chatbot_arena_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub 6af9839 verified felipemaiapolo commited on 29 days ago
Upload description_dataset-biggenbench_judge-llama-3.1-8b_logprob-False_max-evals-5000.npy with huggingface_hub f404673 verified felipemaiapolo commited on 29 days ago
Upload description_dataset-biggenbench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub fc8a028 verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-prometheus_logprob-False_max-evals-5000.npy with huggingface_hub cc00368 verified felipemaiapolo commited on 29 days ago
Upload dataset-financebench_judge-selene-1-mini_logprob-False_max-evals-5000.npy with huggingface_hub bce955f verified felipemaiapolo commited on 29 days ago
Upload dataset-biggenbench_judge-prometheus_logprob-False_max-evals-3.npy with huggingface_hub 878204f verified felipemaiapolo commited on about 1 month ago
Upload dataset-biggenbench_judge-llama-3.1-8b_logprob-True_max-evals-5000.npy with huggingface_hub e030353 verified felipemaiapolo commited on about 1 month ago
Upload dataset-chatbot_arena_judge-selene-1-mini_logprob-True_max-evals-5000.npy with huggingface_hub 3841eee verified felipemaiapolo commited on about 1 month ago
Upload dataset-biggenbench_judge-selene-1-mini_logprob-True_max-evals-5000.npy with huggingface_hub 2e9c620 verified felipemaiapolo commited on about 1 month ago
Upload dataset-chatbot_arena_judge-prometheus_logprob-True_max-evals-5000.npy with huggingface_hub a25c505 verified felipemaiapolo commited on about 1 month ago
Upload dataset-biggenbench_judge-prometheus_logprob-True_max-evals-5000.npy with huggingface_hub 2157666 verified felipemaiapolo commited on about 1 month ago
Upload dataset-chatbot_arena_judge-llama-3.1-8b_logprob-True_max-evals-5000.npy with huggingface_hub 6058531 verified felipemaiapolo commited on about 1 month ago