Enable flash_attention_2 support since the underlying Mistral model supports it

by winglian - opened Apr 20, 2024

←

Apr 20, 2024

No description provided.

lievan changed pull request status to merged Apr 21, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment