The ONNX format model of UltraSharp V2 FP16 has an inference error.

by nukui - opened 3 days ago

3 days ago

I am using ONNX Runtime version 1.22.0, which performs well for FP32 model inference and successfully runs the UltraSharp Lite FP16 model. However, for the UltraSharp FP16 model, it returns a tensor filled with NaN values.
Can you explain your approach to running inference on the FP16 ONNX model? I'd really appreciate it.

Kim2091

Owner 3 days ago

@nukui Apologies. I actually should not have uploaded an FP16 version as DAT2 is not stable in that format. The conversion is not usable.

nukui

3 days ago

Thanks for your reply, the other models work great!

nukui changed discussion status to closed 3 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment