GPUS

Deploying Hugging Face Models Using Triton Inference Server

May need to set LD_LIBRARY_PATH

export LD_LIBRARY_PATH=/usr/local/nvidia/lib64

I think this is a sample in the CUDA code link

I think it can be used as a smoke test for GPU accessibility.