I'm trying to use this model in medicine where explain ability place a big role. When I set the output_attentions parameter to True. I get None back. Perhaps it is being cleared during the forward pass?
Anyone know how to get that attention scores from this model?