Convert tflite model of float32 model to int8

90 Views Asked by phil_fill At 07 February 2024 at 15:55

Is it possible to convert a tflite float32 to tflite model of uint8? I read this page https://www.tensorflow.org/lite/performance/post_training_quantization, but they are only loading Keras of Tensorflow models.

Original Q&A

There are 2 best solutions below

bsbatu On 01 March 2024 at 11:05

You cannot make changes to the architecture by loading a serialized tflite binary.

To quantize the model, you need the model file in one of the formats accepted by TFLiteConverter. You can check the following page for alternative formats. https://www.tensorflow.org/api_docs/python/tf/lite/TFLiteConverter

mrtpk On 04 March 2024 at 10:30

It is possible. When we convert a TF model into TFlite float32, the weights are not quantized. Hence we can extract those weights and initialise them into a TF model. If the TFlite model is in int8 or uint8 quantization, it is different story.

Start by converting the tflite float32 model to the corresponding TensorFlow model. Use this repo - https://github.com/PINTO0309/tflite2tensorflow

Then you have to convert the float32 tensorflow model to int8. Ideally, you would need the dataset to do the post-training quantization. But you can also use a dummy dataset with similar data distribution. If you insist on converting into uint8, you have to use an older version of TensorFlow. TensorFlow doesn't support uint8 conversions now.

Convert tflite model of float32 model to int8

There are 2 best solutions below

Related Questions in TENSORFLOW

Related Questions in TFLITE

Related Questions in FLOAT32

Trending Questions

Popular # Hahtags

Popular Questions