Article
The Transformers Library: standardizing model definitions
By
and 3 others
β’
β’
110Nice blog!
@osanseviero
we have been doing this in TGI and TEI for a while ;)
Padding free implementations also make dynamic batching easier to implement and more predictable in memory.