!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

286 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

You have reached the beginning of time (for this room).


SenderMessageTime
19 Nov 2025
@jfly:matrix.orgJeremy Fleischman (jfly)

Do the systemd NixOS containers provide their own copy of NVIDIA's driver? If not, they wouldn't have libcuda.so available.

afaik, they do not automatically do anything (please correct me if i'm wrong). i making them get their own libcuda.so by explicitly configuring them with hardware.graphics.enable = true; and hardware.graphics.extraPackages.

mounting the cuda runtime from the host makes sense, though! thanks for the link to this nvidia-container-toolkit

18:39:03
@lt1379:matrix.orgLun What's the current best practice / future plans for impure GPU tests? Is the discussion in https://github.com/NixOS/nixpkgs/issues/225912 up to date? cc SomeoneSerge (back on matrix) 18:43:23
@ss:someonex.netSomeoneSerge (back on matrix)

Do the systemd NixOS containers provide their own copy of NVIDIA's driver? If not, they wouldn't have libcuda.so available.

They don't (unless forced). Libcuda and its closure are mounted from the host.

20:10:33
@ss:someonex.netSomeoneSerge (back on matrix) The issue is maybe growing stale, but I'd say there haven't been any fundamental updates.
One bit it doesn't mention is that we rewrote most of the tests in terms of a single primitive, cudaPackages.writeGpuTestPython (can be overridden for e.g. rocm; could be moved outside cuda-modules).
It's now also clear that the VM tests can also be done, we'd just have to use a separate marker to signal that a builder exposes an nvidia device with a vfio driver.
If we replace the sandboxing mechanism (e.g. with microvms) it'll get trickier... but again, a low-bandwidth baseline with vfio is definitely achievable.
And there's still the issue of describing constraints, like listing the architectures or like memory quotas: we need a pluggable mechanism for assessing which builders are compatible with the derivation?
20:37:12
@ss:someonex.netSomeoneSerge (back on matrix) *

The issue is maybe growing stale, but I'd say there haven't been any fundamental updates.

  • One bit it doesn't mention is that we rewrote most of the tests in terms of a single primitive, cudaPackages.writeGpuTestPython (can be overridden for e.g. rocm; could be moved outside cuda-modules).
  • It's now also clear that the VM tests can also be done, we'd just have to use a separate marker to signal that a builder exposes an nvidia device with a vfio driver.
  • If we replace the sandboxing mechanism (e.g. with microvms) it'll get trickier... but again, a low-bandwidth baseline with vfio is definitely achievable.
  • And there's still the issue of describing constraints, like listing the architectures or like memory quotas: we need a pluggable mechanism for assessing which builders are compatible with the derivation? Maybe a proxy instead...
20:37:53
@ss:someonex.netSomeoneSerge (back on matrix) Also note that we still mount libcuda from /run/current-system instead of /run/booted-system... 20:39:08
@jfly:matrix.orgJeremy Fleischman (jfly) Ah that sort of sounds like a bug since we'd want to be compatible with the host kernel? 21:28:58
@apyh:matrix.orgapyhyeah, current system means that updating nvidia drivers with a rebuild switch breaks all CUDA until a reboot21:34:12
@apyh:matrix.orgapyh(experience this semi-frequently)21:34:20
20 Nov 2025
@user12592851:matrix.orgJohn joined the room.05:54:29
@ser:sergevictor.euser(ial)i have a Debian host with nvidia gpu which runs incus and in incus i have nixos containers. how can i utilise cuda programs in such container?10:24:20
@plan9better:matrix.orgplan9better joined the room.12:41:04
@ss:someonex.netSomeoneSerge (back on matrix)Hi. How do you use cuda in a non-NixOS container with Incus? Does it use CDI?13:22:58
@ser:sergevictor.euser(ial)with debian container i use built-in incus "nvidia.runtime" which passes the host NVIDIA and CUDA runtime libraries into the instance13:30:32
@ser:sergevictor.euser(ial)but nixos naturally does not seek for these libraries in that place13:31:15
@ser:sergevictor.euser(ial)does it mean that i need full libraries in nixos container which are with identical version as on debian host?13:32:26
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8) GaƩtan Lepage: I've got to package ONNX/ONNX Runtime/ONNX TensorRT for C++; if I upstream the PR do you think you'd have the bandwidth to look at it? I'd likely follow what I did here: https://github.com/ConnorBaker/cuda-packages/tree/8a317116a07717b13e0608f47b78bd6d75f8bb99/pkgs/development/libraries
That is, the sort of cursed double-build in a single derivation which produces both the C++ binaries and a python wheel, so the python3Packages entry essentially turns into installing a wheel.
14:04:07
@keiichi:matrix.orgtetoare there differences between https://nix-community.cachix.org and https://cache.nixos-cuda.org . My goal is to gain access to cuda-enable packages for unstable14:24:20
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)community cache is no longer being populated, use the latter14:27:28

Show newer messages


Back to Room ListRoom Version: 9