Amir
sahsaeedi
·
AI & ML interests
NLP, RLHF, Alignment
Recent Activity
authored
a paper
4 days ago
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs'
Memorization
authored
a paper
4 days ago
Triple Preference Optimization: Achieving Better Alignment with Less
Data in a Single Step Optimization
authored
a paper
4 days ago
When "Competency" in Reasoning Opens the Door to Vulnerability:
Jailbreaking LLMs via Novel Complex Ciphers