STAIR - a thu-ml Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

thu-ml 's Collections

STAIR

STAIR

updated Feb 26

Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)

thu-ml/STAIR-Llama-3.1-8B-SFT

Text Generation • Updated Feb 25 • 11
thu-ml/STAIR-Qwen2-7B-SFT

Text Generation • Updated Feb 25 • 29 • 1
thu-ml/STAIR-SFT

Viewer • Updated Feb 25 • 20k • 133
thu-ml/STAIR-Prompts

Viewer • Updated Feb 25 • 63k • 69
STAIR: Improving Safety Alignment with Introspective Reasoning

Paper • 2502.02384 • Published Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3

Text Generation • Updated Feb 26 • 12 • 1
thu-ml/STAIR-Llama-3.1-8B-DPO-3

Text Generation • Updated Feb 26 • 6

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs