Python API - int8_calibrator not used when calling build_engine (but works when calling build_cuda_engine)

dudu.asulin · November 24, 2020, 9:37am

Description

Hi,

I’m trying to convert models from PyTorch → ONNX → TensorRT. Optimally, I would like to use INT8 and support dynamic input size.
I seem to be able to create an INT8 calibrated model if I use builder.build_cuda_engine(network) and use optimization profiles for dynamic input support if I use builder.build_engine(network, config).
The latter option seems to always ignore the int8_calibrator regardless if I set it in the builder or the config objects and even if I remove the dynamic shape optimizations (see code snippet below).

Please let me know if what I’m trying here is not supported or any other way to make this work…

Thanks!

Environment

TensorRT Version:
GPU Type: T4
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvcr.io/nvidia/pytorch:20.11-py3

Relevant Files

Steps To Reproduce

def build_engine(onnx_file_path, input_name, int8_calibrator=None, max_batch_size=1, img_size=None, min_size=None, max_size=None): # initialize TensorRT engine and parse ONNX model with trt.Builder(TRT_LOGGER) as builder, builder.create_builder_config() as config: builder = trt.Builder(TRT_LOGGER) network_creation_flag = 1 << int(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH) network = builder.create_network(network_creation_flag) parser = trt.OnnxParser(network, TRT_LOGGER) # parse ONNX with open(onnx_file_path, 'rb') as model: print('Beginning ONNX file parsing') parser.parse(model.read()) print('Completed parsing of ONNX file') # allow TensorRT to use up to 8GB of GPU memory for tactic selection config.max_workspace_size = 8 << 30 # use FP16 mode if possible if builder.platform_has_fast_fp16: builder.fp16_mode = True print('USING FP16!!!') if int8_calibrator is not None: builder.int8_mode = True config.int8_calibrator = int8_calibrator builder.int8_calibrator = int8_calibrator print('USING INT8!!!', builder.platform_has_fast_int8) # # Dynamic input support - commented out for testing (still int8 calibration is not working) # if img_size is not None: # dynamic # opt_min, opt_max = min(img_size), max(img_size) # # landscape profile # profile = builder.create_optimization_profile() # profile.set_shape(input_name, min=(1, 3, min_size, opt_max), opt=(max_batch_size, 3, opt_min, opt_max), # max=(max_batch_size, 3, opt_max, opt_max)) # config.add_optimization_profile(profile) # # # portrait profile # profile = builder.create_optimization_profile() # profile.set_shape(input_name, min=(1, 3, opt_max, min_size), opt=(max_batch_size, 3, opt_max, opt_min), # max=(max_batch_size, 3, opt_max, opt_max)) # config.add_optimization_profile(profile) # generate TensorRT engine optimized for the target platform print('Building an engine...') # engine = builder.build_cuda_engine(network) engine = builder.build_engine(network, config) print("Completed creating Engine") return engine

SunilJB · November 24, 2020, 10:48am

Please refer below link:

Thanks

dudu.asulin · November 25, 2020, 10:04am

I found a solution in Int8 calibrate failed while using a new IBuilderConfig · Issue #388 · NVIDIA/TensorRT · GitHub, which is to use config.set_flag(trt.BuilderFlag.INT8) instead of builder.int8_mode = True.
The link to the example script given there is broken, here is the updated link:
tensorrt-utils/onnx_to_tensorrt.py at master · rmccorm4/tensorrt-utils · GitHub

Topic		Replies	Views
Segmentation fault in build_engine when using an int8 calibrator TensorRT	6	1288	October 12, 2021
TensorRT6 Dynamic Input Size does not support int8 with calibrator. TensorRT	13	3512	July 23, 2021
Is there any method to build model with int8 weight in tensorrt? TensorRT	1	1281	July 29, 2021
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	2057	June 29, 2021
INT8 calibration file not generating, not building in INT8 mode TensorRT tensorrt , ubuntu , python , jetson-nano	15	2584	June 4, 2022
Got Assertion `sI.count() == 1' failed. when create engine with INT8 calibration TensorRT tensorrt	5	642	October 12, 2021
Driver error-TensorRT INT8 deploy TensorRT	3	739	November 20, 2020
How to generate int8 calilb table for trtexec engine generation TensorRT tensorrt	7	4638	October 12, 2021
Deepstream -Jetson Xavier NX - Onnx2trt DeepStream SDK	6	665	October 12, 2021
INT8 Calibration Static Engine Issue TensorRT	1	483	August 13, 2019

Python API - int8_calibrator not used when calling build_engine (but works when calling build_cuda_engine)

Description

Environment

Relevant Files

Steps To Reproduce

Related topics