Vasily Konovalov
Vasily
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
When Models Lie, We Learn: Multilingual Span-Level Hallucination
Detection with PsiloQA
upvoted
a
paper
3 days ago
The Rogue Scalpel: Activation Steering Compromises LLM Safety
upvoted
a
paper
3 days ago
OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features