University of Waterloo

university

Verified

https://uwaterloo.ca/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

dora2023 submitted a paper 12 days ago

Diversed Model Discovery via Structured Table Discovery

HideOnBush submitted a paper about 2 months ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

lllqaq authored a paper about 2 months ago

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

View all activity

Papers

Diversed Model Discovery via Structured Table Discovery

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

View all Papers

submitted a paper to Daily Papers 12 days ago

Diversed Model Discovery via Structured Table Discovery

Paper • 2605.22766 • Published 14 days ago • 6

authored a paper about 2 months ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

submitted a paper to Daily Papers about 2 months ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 95

submitted a paper to Daily Papers 2 months ago

SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

Paper • 2603.13669 • Published Mar 14 • 1

submitted a paper to Daily Papers 5 months ago

Shape of Thought: When Distribution Matters More than Correctness in Reasoning Tasks

Paper • 2512.22255 • Published Dec 24, 2025 • 6

submitted a paper to Daily Papers 6 months ago

ModelTables: A Corpus of Tables about Models

Paper • 2512.16106 • Published Dec 18, 2025 • 10

authored a paper 7 months ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24, 2025 • 22

authored a paper 7 months ago

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26, 2025 • 26

authored 2 papers 8 months ago

Bench-NPIN: Benchmarking Non-prehensile Interactive Navigation

Paper • 2505.12084 • Published May 17, 2025 • 2

Real-Time Navigation for Autonomous Surface Vehicles In Ice-Covered Waters

Paper • 2302.11601 • Published Feb 22, 2023

authored a paper 10 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8, 2025 • 42

authored 2 papers about 1 year ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2, 2024 • 37

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26, 2025 • 19

authored 7 papers about 1 year ago

Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement Learning

Paper • 2503.06034 • Published Mar 8, 2025 • 1

Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality

Paper • 2505.02466 • Published May 5, 2025 • 1

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20, 2025 • 24

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1, 2025 • 43

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models

Paper • 2310.07712 • Published Oct 11, 2023 • 1

PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published Jan 31, 2025 • 17

DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers

Paper • 2502.18460 • Published Feb 25, 2025 • 3