buildCudaEngine failed on windows 10 with TensorRT-5.0.4.3

simon1228 · January 11, 2019, 6:50pm

Hi,

Win10
RTX 2080
nvidia driver version: 417.35
CUDA version: 10
CUDNN version: 7.3.1 or 7.4.2
Python version [3.6]
pytorch 1.0

I tried to import ONNX model into tensorRT using sample project “sampleONNXMNIST” coming with TensorRT-5.0.4.3 SDK. The ONNX model was trained and saved in Pytorch 1.0. It succeeded to pass nvonnxparser function, however it failed on buildCudaEngine function. Error message is :

ERROR: c:\p4sw\sw\gpgpu\MachineLearning\DIT\release\5.0\builder\cudnnBuilderUtils.cpp (255) - Cuda Error in nvinfer1::cudnn::findFastestTactic: 4
ERROR: c:\p4sw\sw\gpgpu\MachineLearning\DIT\release\5.0\engine\runtime.cpp (30) - Cuda Error in nvinfer1::`anonymous-namespace’::DefaultAllocator::free: 4

the code is like this :

IBuilder* builder = createInferBuilder(gLogger); nvinfer1::INetworkDefinition* network = builder->createNetwork(); auto parser = nvonnxparser::createParser(*network, gLogger); if (!parser->parseFromFile(modelFile.c_str(), verbosity)) { std::string msg("failed to parse onnx file"); gLogger.log(nvinfer1::ILogger::Severity::kERROR, msg.c_str()); exit(EXIT_FAILURE); } // Build the engine builder->setMaxBatchSize(maxBatchSize); std::size_t x = builder->getMaxWorkspaceSize(); builder->setMaxWorkspaceSize(3600_MB); printf("%ld\n", x); samplesCommon::enableDLA(builder, gUseDLACore); ICudaEngine* engine = builder->buildCudaEngine(*network); assert(engine);

I also tried a few parameters for setMaxWorkspaceSize, still no luck but error message could be different. I attached the model file i used, Thanks.

simon1228 · January 11, 2019, 7:02pm

I always use maxBatchSize =1

NVES · January 11, 2019, 9:39pm

we are reviewing and will keep you updated.

simon1228 · January 15, 2019, 5:01pm

NVES · January 23, 2019, 5:35am

Hello,

engineering has committed the fix for next version of TensorRT. In the meantime, as a workaround, engineering recommends using cuDNN 3.7.0 cuDNN Archive | NVIDIA Developer

simon1228 · January 23, 2019, 2:46pm

I tried cuDNN 7.3.0. It passed buildCudaEngine function. Thank you!

simon1228 · January 24, 2019, 7:04pm

Hi,
I did get some results from tensorRT, however results looks different and worse than PyTorch inference results. Do you know if there is anything in tensorRT&cuDNN or ONNX could cause such difference? Thanks again for your help.

Topic		Replies	Views
TensorRT Engine build problem on Windows10 TensorRT	7	1757	October 12, 2021
Unknown error in IBuilder::buildCudaEngine call TensorRT	0	671	June 6, 2019
TensorRT - Error: could not build engine GPU-Accelerated Libraries	4	3434	November 9, 2017
TypeError: build_cuda_engine(): incompatible function arguments TensorRT	7	5821	October 12, 2021
onnx-tensorrt build failure TensorRT	3	2826	January 9, 2019
How to run the TRT in Windows with c++ TensorRT tensorrt	3	991	May 29, 2020
Build engine from onnx failed TensorRT	2	1082	December 14, 2021
TensorRT-5.0 error,win10 TensorRT	8	1913	October 12, 2021
Using TensorRT3.0 to convert tensorflow model to create TensorRT engine Jetson TX1	3	653	March 8, 2018
tensorrt 3.0.4 sample application issue TensorRT	0	573	May 14, 2018

buildCudaEngine failed on windows 10 with TensorRT-5.0.4.3

Related topics