hal

community

https://github.com/benediktstroebl/agent-eval-harness/tree/main

benediktstroebl

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

benediktstroebl updated a dataset 3 days ago

agent-evals/hal_traces

xuetianci99 updated a dataset 7 days ago

agent-evals/hal_traces

Peterkirgis updated a dataset 10 days ago

agent-evals/hal_traces

View all activity

benediktstroebl

updated a dataset 3 days ago

agent-evals/hal_traces

Updated 3 days ago • 450

xuetianci99

updated a dataset 7 days ago

agent-evals/hal_traces

Updated 3 days ago • 450

Peterkirgis

updated a dataset 10 days ago

agent-evals/hal_traces

Updated 3 days ago • 450

siegelz

updated a dataset 21 days ago

agent-evals/hal_traces

Updated 3 days ago • 450

yifeizhou

updated a dataset 29 days ago

agent-evals/hal_traces

Updated 3 days ago • 450

boyiwei

authored a paper 30 days ago

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Paper • 2412.07097 • Published Dec 10, 2024 • 1

benediktstroebl

authored a paper about 1 month ago

Dynamic Risk Assessments for Offensive Cybersecurity Agents

Paper • 2505.18384 • Published May 23 • 7

boyiwei

authored a paper about 1 month ago

Dynamic Risk Assessments for Offensive Cybersecurity Agents

Paper • 2505.18384 • Published May 23 • 7

sayashk

authored a paper about 2 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

yifeizhou

authored a paper 2 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 43

yifeizhou

authored a paper 3 months ago

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Paper • 2503.15478 • Published Mar 19 • 12

yifeizhou

authored 7 papers 6 months ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

Aligning Large Language Models with Representation Editing: A Control Perspective

Paper • 2406.05954 • Published Jun 10, 2024

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Paper • 2405.10292 • Published May 16, 2024 • 2

ronch99

authored 2 papers 9 months ago

Automatic Evaluation of Attribution by Large Language Models

Paper • 2305.06311 • Published May 10, 2023

eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data

Paper • 2402.08831 • Published Feb 13, 2024

AI & ML interests

Recent Activity

Team members 10

agent-evals's activity