Add link to paper
#10 opened 3 months ago
by
nielsr

[work in progress] Upload optimized language model w/ WebGPU-compatible GQA
#9 opened 4 months ago
by
Xenova

Is there a code for converting the model to onnx?
π
π
4
#7 opened 6 months ago
by
2U1
ONNX decoder model uses non-standard operators
4
#6 opened 6 months ago
by
robertknight