fix: partition adapter mask when batch size is specified e6e3a6f verified jupyterjazz commited on 20 days ago
fix: update frequencies when updating the rope base value (#40) 8b2ad1e verified jupyterjazz commited on 26 days ago
fix mixed precision loading with recent transformers versions (#39) bda5e8d verified jupyterjazz commited on Aug 27