mlx-community
/

Ring-mini-linear-2.0-4bit

How did you quantize it?

by jc2375 - opened 4 days ago

jc2375

4 days ago

mlx-lm fails to quantize the larger sibling of the linear attention model (Ring Flash Linear v2)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment