What about for commercial cards? :) Specifically vgpu. You wouldn't happen to know where/how to get the toolkit? I'll have to check my driver packages but I don't think it's included with the grid drivers. Hopefully it doesn't require a consumer driver package.
Discussion
Should work with any NVIDIA card with Cuda drivers. When you say vGPU, what type of setup?
I built and tested it using the 3070 I have at my desk. I’ll go test it on an H200 now and report back.
Confirmed working on an H200. Getting 170mm keys/sec 🔥

Thanks for following up. With vGPU it's nvidia grid virtual gpu. So it's a pair of tesla p100s with a "custom" gpu profiles into virtual machines. If you're unfamiliar it requires nvidia proprietary driver distributions to work. All drivers and guest software (usermode executable etc) come packaged together. and I don't see the toolkit bundled with it.
That said I also am unfamiliar with the toolkit on linux. Not sure if it's available through standard dnf distribution channels. Specifically rhel.
Not totally sure about RHEL but I bet we can get it figured out.
When you run nvcc —version does it fail?
RHEL packages appear to be publicly available without any sort of restrictions
https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo
https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo
nvcc is not available on the system. Yeah the other issue is that I'm forced to use a really old driver and older kernel. So the target machine specifically is running rocky 9 with kernel 6.14 on driver 550.163 with cuda 12.4. It looks like the toolkits available from those links you shared require cuda 13 or newer. I think I have to find an old package.
Found this which sounds like it might match your setup
Thanks for sharing those links, it means I just need to look harder! I've had better luck using the .run self contained files than the rpm packages (I don't think nvidia actually tests them XD)
You bet, not a problem. If you get them installed all you would need to do is update the makefile to CCAP=60 and set NOSTR_BLOCKS_PER_GRID = 1120 in GPURummage.h