About INT8 and UINT8 #432

hunterchenghx · 2021-10-25T08:05:41Z

hunterchenghx
Oct 25, 2021

Excause me. I think your quantized prune yolo5 model is trained with uint8 weights and int32 bias.
Could you tell me how to adjust the code so that I could train a model with INT8 weights?
Sometimes, other deployment tools could only support INT8 datatype but not uint8 for a quantized model(like TensorRT8).

Answered by bfineran

Jun 28, 2022

Hi @hunterchenghx @rui-shen-afk weight qconfig properties can be modified directly using weight_config_kwargs, however for tensorrt support, the current best pathway is to set tensorrt: True for any QuantizationModifier in your recipe.

sparseml/src/sparseml/pytorch/sparsification/quantization/modifier_quantization.py

Line 142 in d845052

      :param tenssorrt: if True sets quantization configuration for compatibility with  

 

Hope this helps, let me know if you have any other questions.

View full answer

rui-shen-afk · 2022-06-22T07:14:43Z

rui-shen-afk
Jun 22, 2022

Hello, is this problem solved somehow?

0 replies

bfineran · 2022-06-28T20:04:36Z

bfineran
Jun 28, 2022

Hi @hunterchenghx @rui-shen-afk weight qconfig properties can be modified directly using weight_config_kwargs, however for tensorrt support, the current best pathway is to set tensorrt: True for any QuantizationModifier in your recipe.

sparseml/src/sparseml/pytorch/sparsification/quantization/modifier_quantization.py

Line 142 in d845052

    
               :param tenssorrt: if True sets quantization configuration for compatibility with

Hope this helps, let me know if you have any other questions.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About INT8 and UINT8 #432

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

About INT8 and UINT8 #432

hunterchenghx Oct 25, 2021

Replies: 2 comments

rui-shen-afk Jun 22, 2022

bfineran Jun 28, 2022

hunterchenghx
Oct 25, 2021

rui-shen-afk
Jun 22, 2022

bfineran
Jun 28, 2022