ONNX model generation #55

ayazhassan · 2022-11-02T08:39:39Z

Can you please convert your model into the ONNX model? I want to test it on tensor rt for inferencing.
I am trying to convert it to the ONNX model but getting the following error:
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0, and cpu! (when checking argument for argument index in method wrapper__index_select)

ayazhassan · 2022-12-26T08:02:06Z

It seems that some of the model layers is running on CPU even I set the device as GPU. Following is the profiling screenshot. Let me know this is correct. How can we run the complete transformer model (generating all tensors) on GPU?

sazani · 2022-12-27T03:25:54Z

I have faced the same issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX model generation #55

ONNX model generation #55

ayazhassan commented Nov 2, 2022

ayazhassan commented Dec 26, 2022

sazani commented Dec 27, 2022

ONNX model generation #55

ONNX model generation #55

Comments

ayazhassan commented Nov 2, 2022

ayazhassan commented Dec 26, 2022

sazani commented Dec 27, 2022