Zhuoran Jin's picture

5 5 16

Zhuoran Jin

jinzhuoran

·

jinzhuoran

AI & ML interests

NLP

Recent Activity

liked a dataset 6 days ago

Pokerwf/KnowLogic

upvoted a paper about 1 month ago

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

authored a paper about 1 month ago

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

View all activity

Organizations

None yet

authored 2 papers about 1 month ago

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Paper • 2506.04142 • Published Jun 4 • 27

MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Paper • 2506.04141 • Published Jun 4 • 29

authored a paper 7 months ago

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Paper • 2412.13746 • Published Dec 18, 2024 • 9

authored 2 papers about 1 year ago

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models

Paper • 2402.18154 • Published Feb 28, 2024

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Paper • 2406.10890 • Published Jun 16, 2024 • 1