RoleMRC A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Junrulu/RoleMRC Preview • Updated Mar 20 • 52 • 3 RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Paper • 2502.11387 • Published Feb 17 jiazhengli/Llama-3.1-8B-RoleMRC-dpo 8B • Updated Mar 11 • 1 jiazhengli/Llama-3.1-8B-RoleMRC-sft 8B • Updated Mar 11 • 1
RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Paper • 2502.11387 • Published Feb 17
MCTS with Preference Optimisation Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring jiazhengli/Rationale_MCTS Viewer • Updated Oct 14, 2024 • 8.71k • 20 • 2 jiazhengli/Synthetic_Rationale Viewer • Updated Oct 14, 2024 • 32.9k • 31 • 1 jiazhengli/deberta-v3-large-Rationale-to-Score Text Classification • 0.4B • Updated Jul 4, 2024 • 4 • 1 jiazhengli/Meta-Llama-3-8B-QLoRA-Assessment-Rationale-sft Updated Oct 14, 2024 • 2
jiazhengli/deberta-v3-large-Rationale-to-Score Text Classification • 0.4B • Updated Jul 4, 2024 • 4 • 1
RoleMRC A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Junrulu/RoleMRC Preview • Updated Mar 20 • 52 • 3 RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Paper • 2502.11387 • Published Feb 17 jiazhengli/Llama-3.1-8B-RoleMRC-dpo 8B • Updated Mar 11 • 1 jiazhengli/Llama-3.1-8B-RoleMRC-sft 8B • Updated Mar 11 • 1
RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Paper • 2502.11387 • Published Feb 17
MCTS with Preference Optimisation Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring jiazhengli/Rationale_MCTS Viewer • Updated Oct 14, 2024 • 8.71k • 20 • 2 jiazhengli/Synthetic_Rationale Viewer • Updated Oct 14, 2024 • 32.9k • 31 • 1 jiazhengli/deberta-v3-large-Rationale-to-Score Text Classification • 0.4B • Updated Jul 4, 2024 • 4 • 1 jiazhengli/Meta-Llama-3-8B-QLoRA-Assessment-Rationale-sft Updated Oct 14, 2024 • 2
jiazhengli/deberta-v3-large-Rationale-to-Score Text Classification • 0.4B • Updated Jul 4, 2024 • 4 • 1