SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions Paper • 2309.07045 • Published Sep 13, 2023
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement Paper • 2502.16776 • Published Feb 24 • 6
SocialEval: Evaluating Social Intelligence of Large Language Models Paper • 2506.00900 • Published Jun 1
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 15 days ago • 184
CPM: A Large-scale Generative Chinese Pre-trained Language Model Paper • 2012.00413 • Published Dec 1, 2020
CPM-2: Large-scale Cost-effective Pre-trained Language Models Paper • 2106.10715 • Published Jun 20, 2021 • 1
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation Paper • 2108.12960 • Published Aug 30, 2021
Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation Paper • 2204.10703 • Published Apr 22, 2022
A Benchmark for Understanding and Generating Dialogue between Characters in Stories Paper • 2209.08524 • Published Sep 18, 2022
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning Paper • 2409.12452 • Published Sep 19, 2024 • 1
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis Paper • 2406.19934 • Published Jun 28, 2024
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback Paper • 2402.01469 • Published Feb 2, 2024 • 1
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics Paper • 2105.08920 • Published May 19, 2021
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models Paper • 2503.02324 • Published Mar 4
A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications Paper • 2503.17003 • Published Mar 21
Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation Paper • 2504.02438 • Published Apr 3 • 1