infer using mixed precision in tensorrt

147 Views Asked by Faisal Hejary At 30 November 2023 at 16:15

I'm currently using DETR for object detection. I want to convert it as follows: pytorch -> onnx -> tensorrt I have the code to do so and tested the model achieving the same performance in all formats. the thing is, the model is in fp32 and when I convert it to fp16 I lose a lot of performance. My idea is to convert some layers to fp16 and leave the rest as fp32 to keep as much accuracy.
my question is. how to convert specific layers of the tensorrt model into fp16? I couldn't find any documentation on this. any and all help is appreciated.

Original Q&A

infer using mixed precision in tensorrt

There are 0 best solutions below

Related Questions in ONNX

Related Questions in TENSORRT

Related Questions in TENSORRT-PYTHON

Trending Questions

Popular # Hahtags

Popular Questions