Proof of Concept Lab

community

AI & ML interests

We know something ...

Recent Activity

Gracjan authored a paper 3 days ago

What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

Gracjan authored a paper 4 days ago

Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models

rahid authored a paper 4 months ago

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

View all activity

PoCLab's activity

Gracjan

authored a paper 3 days ago

What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

Paper • 2406.03361 • Published Jun 5, 2024

Gracjan

authored a paper 4 days ago

Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models

Paper • 2409.12969 • Published Sep 2, 2024

rahid

authored a paper 4 months ago

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Paper • 2411.13543 • Published Nov 20, 2024 • 18

Gracjan

authored a paper 6 months ago

When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options

Paper • 2409.00113 • Published Aug 27, 2024 • 2