Auto-Arena

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

xww033 authored a paper 14 days ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

xww033 authored a paper 18 days ago

Exploiting Reasoning Chains for Multi-hop Science Question Answering

xww033 authored a paper 18 days ago

From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

View all activity

xww033

authored a paper 14 days ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published 17 days ago • 92

xww033

authored 5 papers 18 days ago

Exploiting Reasoning Chains for Multi-hop Science Question Answering

Paper • 2109.02905 • Published Sep 7, 2021 • 1

From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

Paper • 2212.04755 • Published Dec 9, 2022

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Paper • 2410.01428 • Published Oct 2, 2024 • 1

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Paper • 2410.10858 • Published Oct 7, 2024

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published 20 days ago • 105

isakzhang

authored 3 papers 4 months ago

On the Multi-turn Instruction Following for Conversational Web Agents

Paper • 2402.15057 • Published Feb 23, 2024 • 1

SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia

Paper • 2502.06298 • Published Feb 10 • 1

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published Mar 2 • 65

chiayewken

authored 3 papers 8 months ago

PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

Paper • 2403.13315 • Published Mar 20, 2024

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Paper • 2410.10858 • Published Oct 7, 2024

Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models

Paper • 2409.14277 • Published Sep 22, 2024

xww033

authored a paper 11 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 59

isakzhang

authored 2 papers 11 months ago

Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Paper • 2405.20267 • Published May 30, 2024 • 1

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 59

xww033

authored a paper about 1 year ago

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Paper • 2406.16377 • Published Jun 24, 2024 • 13

ruochenzhao

authored a paper about 1 year ago

Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Paper • 2405.20267 • Published May 30, 2024 • 1

isakzhang

updated a Space about 1 year ago

README

📊

isakzhang

authored 2 papers about 1 year ago

Zero-Shot Text Classification via Self-Supervised Tuning

Paper • 2305.11442 • Published May 19, 2023 • 1

Easy-to-Hard Learning for Information Extraction

Paper • 2305.09193 • Published May 16, 2023

AI & ML interests

Recent Activity

Team members 4

Auto-Arena's activity

README