Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
's Collections
STAIR
STAIR
updated
27 days ago
Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)
Upvote
1
thu-ml/STAIR-Llama-3.1-8B-SFT
Text Generation
•
Updated
28 days ago
•
38
thu-ml/STAIR-Qwen2-7B-SFT
Text Generation
•
Updated
28 days ago
•
53
•
1
thu-ml/STAIR-SFT
Viewer
•
Updated
28 days ago
•
20k
•
160
thu-ml/STAIR-Prompts
Viewer
•
Updated
28 days ago
•
63k
•
108
STAIR: Improving Safety Alignment with Introspective Reasoning
Paper
•
2502.02384
•
Published
Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3
Text Generation
•
Updated
27 days ago
•
32
•
1
thu-ml/STAIR-Llama-3.1-8B-DPO-3
Text Generation
•
Updated
27 days ago
•
34
Upvote
1
Share collection
View history
Collection guide
Browse collections