TAO 5.0.0. TF1 Container fail to run tao model yolo_v4 dataset_convert command

pramaniksandip · October 2, 2023, 2:18pm

Please provide the following information when requesting support.

• Hardware (T4/V100/Xavier/Nano/etc) A100
• Network Type SSD

I am using nvcr.io/nvidia/tao/tao-toolkit:5.0.0-tf1.15.5 container. The container is running successful in Kubernetes environment in DGX A100 in a GPU.

But following command though error.
root@tao-f865859d9-6scz9:/workspace# tao model yolo_v4 dataset_convert -d /workspace/nemo/data/nvidia-tao/tensorflow/yolo_v4/specs/yolo_v4_tfrecords_kitti_train.txt -o /workspace/nemo/data/kitti_data/train/tfrecords/train
~/.tao_mounts.json wasn’t found. Falling back to obtain mount points and docker configs from ~/.tao_mounts.json.
Please note that this will be deprecated going forward.
Traceback (most recent call last):
File “/usr/local/bin/tao”, line 8, in
sys.exit(main())
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_cli/entrypoint/tao_launcher.py”, line 134, in main
instance.launch_command(
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_cli/components/instance_handler/local_instance.py”, line 356, in launch_command
docker_logged_in(required_registry=task_map[task].docker_registry)
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_cli/components/instance_handler/utils.py”, line 151, in docker_logged_in
data = load_config_file(docker_config)
File “/usr/local/lib/python3.8/dist-packages/nvidia_tao_cli/components/instance_handler/utils.py”, line 84, in load_config_file
assert os.path.exists(config_path), (
AssertionError: Config path must be a valid unix path. No file found at: /root/.docker/config.json. Did you run docker login?
root@tao-f865859d9-6scz9:/workspace#

List of tao packages inside the POD
root@tao-f865859d9-6scz9:/workspace# pip list|grep tao
nvidia-eff-tao-encryption 0.1.7
nvidia-tao 5.0.0
nvidia-tao-tf1 5.0.0.151.dev0
root@tao-f865859d9-6scz9:/workspace#

Morganh · October 3, 2023, 4:04am

How did you trigger this container? Could you share the full command?

pramaniksandip · October 3, 2023, 3:09pm

@Morganh

Please find the command to trigger container,

apiVersion: apps/v1
kind: Deployment
metadata:
name: tao
spec:
selector:
matchLabels:
run: tao
replicas: 1
template:
metadata:
labels:
run: tao
spec:
nodeName: dgxa100
containers:
- name: tao
image: nvcr.io/nvidia/tao/tao-toolkit:5.0.0-tf1.15.5
command: [“/bin/sh”, “-c”]
args:
- jupyter lab --allow-root --ip=0.0.0.0 --NotebookApp.token=‘tao’;
resources:
limits:
nvidia.com/${profile}: 1
volumeMounts:
- name: tao-data
mountPath: /workspace/nemo/data
ports:
- containerPort: 8888
volumes:
- name: tao-data
hostPath:
path: /home/tao
type: Directory

Morganh · October 5, 2023, 3:50pm

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

You already run inside the tao docker.
So, please run without tao model.
For example,
root@tao-f865859d9-6scz9:/workspace# yolo_v4 dataset_convert xxx

system · October 24, 2023, 2:42am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Error in Generating TFrecords for yolov4 TAO Toolkit	38	1491	May 17, 2022
An error occurred while preparing the data set using TAO TAO Toolkit	14	1497	October 19, 2021
AssertionError: Config path must be a valid unix path. TAO TOOLKIT TFRECORD CONVERT TAO Toolkit docker	8	954	February 20, 2023
Dataset_convert error in tao TAO Toolkit	3	486	September 16, 2022
Problem with tlt file mounting TAO Toolkit	29	2564	January 6, 2022
About tao_mounts.json and docker container stop in traning cell TAO Toolkit	7	968	July 6, 2022
Issue with tlt.components.docker_handler.docker_handler: Stopping container TAO Toolkit	17	1264	July 4, 2022
Tao toolkit container not installing TAO Toolkit	27	3260	June 20, 2022
Tao yolov3 custom dataset training, container issue TAO Toolkit inception	3	645	July 21, 2022
Dataset_convert tool is running properly but the TFrecords aren't getting created in output folder TAO Toolkit	32	1836	May 10, 2022

TAO 5.0.0. TF1 Container fail to run tao model yolo_v4 dataset_convert command

Related topics