Add link to paper
#10 opened about 2 months ago
by
nielsr

[work in progress] Upload optimized language model w/ WebGPU-compatible GQA
#9 opened 2 months ago
by
Xenova

Is there a code for converting the model to onnx?
π
3
#7 opened 4 months ago
by
2U1
ONNX decoder model uses non-standard operators
4
#6 opened 4 months ago
by
robertknight