B-score: Detecting biases in large language models using response history Paper • 2505.18545 • Published 18 days ago • 30