Add/update the quantized ONNX model files and README.md for Transformers.js v3
Applied Quantizations
✅ Based on vision_model.onnx
with slimming
↳ ✅ int8
: vision_model_int8.onnx
(added)
↳ ✅ uint8
: vision_model_uint8.onnx
(added)
↳ ✅ q4
: vision_model_q4.onnx
(added)
↳ ✅ q4f16
: vision_model_q4f16.onnx
(added)
↳ ✅ bnb4
: vision_model_bnb4.onnx
(added)
✅ Based on vision_model.onnx
with slimming
↳ ✅ int8
: vision_model_int8.onnx
(added)
↳ ✅ uint8
: vision_model_uint8.onnx
(added)
↳ ✅ q4
: vision_model_q4.onnx
(added)
↳ ✅ q4f16
: vision_model_q4f16.onnx
(added)
↳ ✅ bnb4
: vision_model_bnb4.onnx
(added)
❌ Based on model.onnx
with slimming
None
↳ ✅ fp16
: model_fp16.onnx
(added)
↳ ❌ int8
: model_int8.onnx
(added but JS-based E2E test failed)
/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:25
__classPrivateFieldGet(this, _OnnxruntimeSessionHandler_inferenceSession, "f").loadModel(pathOrBuffer, options);
^
Error: Could not find an implementation for ConvInteger(10) node with name '/vision_model/embeddings/patch_embedding/Conv_quant'
at new OnnxruntimeSessionHandler (/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:25:92)
at Immediate.<anonymous> (/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:67:29)
at process.processImmediate (node:internal/timers:485:21)
Node.js v22.16.0
↳ ✅ uint8
: model_uint8.onnx
(added)
↳ ✅ q4
: model_q4.onnx
(added)
↳ ✅ q4f16
: model_q4f16.onnx
(added)
↳ ✅ bnb4
: model_bnb4.onnx
(added)
❌ Based on model.onnx
with slimming
None
↳ ✅ fp16
: model_fp16.onnx
(added)
↳ ❌ int8
: model_int8.onnx
(added but JS-based E2E test failed)
/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:25
__classPrivateFieldGet(this, _OnnxruntimeSessionHandler_inferenceSession, "f").loadModel(pathOrBuffer, options);
^
Error: Could not find an implementation for ConvInteger(10) node with name '/vision_model/embeddings/patch_embedding/Conv_quant'
at new OnnxruntimeSessionHandler (/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:25:92)
at Immediate.<anonymous> (/home/ubuntu/src/tjsmigration/node_modules/.pnpm/[email protected]/node_modules/onnxruntime-node/dist/backend.js:67:29)
at process.processImmediate (node:internal/timers:485:21)
Node.js v22.16.0
↳ ✅ uint8
: model_uint8.onnx
(added)
↳ ✅ q4
: model_q4.onnx
(added)
↳ ✅ q4f16
: model_q4f16.onnx
(added)
↳ ✅ bnb4
: model_bnb4.onnx
(added)
✅ Based on text_model.onnx
with slimming
↳ ✅ int8
: text_model_int8.onnx
(added)
↳ ✅ uint8
: text_model_uint8.onnx
(added)
↳ ✅ q4
: text_model_q4.onnx
(added)
↳ ✅ q4f16
: text_model_q4f16.onnx
(added)
↳ ✅ bnb4
: text_model_bnb4.onnx
(added)
✅ Based on text_model.onnx
with slimming
↳ ✅ int8
: text_model_int8.onnx
(added)
↳ ✅ uint8
: text_model_uint8.onnx
(added)
↳ ✅ q4
: text_model_q4.onnx
(added)
↳ ✅ q4f16
: text_model_q4f16.onnx
(added)
↳ ✅ bnb4
: text_model_bnb4.onnx
(added)