Models related to paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
韩沛煊
HakHan
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
19 days ago
SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals
updated
a model
19 days ago
HakHan/SafeSwitch
updated
a model
19 days ago
HakHan/SafeSwitch