!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

288 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda56 Servers

Load older messages


SenderMessageTime
8 Dec 2024
@kaya:catnip.eekaya 𖀐 * Pretty sure the biggest difference between the flake and my upstreaming attempt is that in the flake i have allowUnfree and cudaSupport as true, but those options should carry over (i assume) as i also have those enabled on my nixos config and im building with --impure16:50:40
9 Dec 2024
@ss:someonex.netSomeoneSerge (back on matrix)

For some reason its complaining about CUDA_HOME being missing even though im specifying it which im kind of confused

It might just want some more components than just nvcc and cudart. Also off the top of my head not sure which outputs are propagated into the symlinkJoin

00:11:43
@ss:someonex.netSomeoneSerge (back on matrix)Could you publish the full logs?00:11:56
@hexa:lossy.networkhexa (UTC+1)is there a relationship between cuda and the open nvidia kmod?03:40:17
@hexa:lossy.networkhexa (UTC+1)because my cuda things stopped working some time after migrating to 24.1103:40:27
@hexa:lossy.networkhexa (UTC+1)though nvidia-smi is working03:40:47
@hexa:lossy.networkhexa (UTC+1)but ollama and wyoming-faster-whisper can't init cuda03:41:08
@hexa:lossy.networkhexa (UTC+1)will try to drop hardening next, as usual πŸ˜„ 03:42:19
@hexa:lossy.networkhexa (UTC+1)ok, DevicePolicy related πŸ™‚ 03:46:59
@hexa:lossy.networkhexa (UTC+1)ok, apparently not15:44:08
@hexa:lossy.networkhexa (UTC+1)nvidia_uvm doesn't get loaded at boot anymore15:44:25
@hexa:lossy.networkhexa (UTC+1) * it seems like nvidia_uvm doesn't get loaded at boot anymore 15:44:31
@ss:someonex.netSomeoneSerge (back on matrix)https://github.com/NixOS/nixpkgs/issues/33418015:52:57
@hexa:lossy.networkhexa (UTC+1)
fbdcdde Kiskae             2024-05-22 13:46 +0200 308β”‚             # Don't add `nvidia-uvm` to `kernelModules`, because we want
fbdcdde Kiskae             2024-05-22 13:46 +0200 309β”‚             # `nvidia-uvm` be loaded only after `udev` rules for `nvidia` kernel
fbdcdde Kiskae             2024-05-22 13:46 +0200 310β”‚             # module are applied.
fbdcdde Kiskae             2024-05-22 13:46 +0200 311β”‚             #
fbdcdde Kiskae             2024-05-22 13:46 +0200 312β”‚             # Instead, we use `softdep` to lazily load `nvidia-uvm` kernel module
fbdcdde Kiskae             2024-05-22 13:46 +0200 313β”‚             # after `nvidia` kernel module is loaded and `udev` rules are applied.
fbdcdde Kiskae             2024-05-22 13:46 +0200 314β”‚             extraModprobeConfig = ''
fbdcdde Kiskae             2024-05-22 13:46 +0200 315β”‚               softdep nvidia post: nvidia-uvm
fbdcdde Kiskae             2024-05-22 13:46 +0200 316β”‚             '';
16:03:03
@ss:someonex.netSomeoneSerge (back on matrix)Yeah, somehow softdep breaks with the open driver?16:03:59
@hexa:lossy.networkhexa (UTC+1)nope, reformat16:03:59
@hexa:lossy.networkhexa (UTC+1)ok, has been there since 2023.1116:04:16
@hexa:lossy.networkhexa (UTC+1)and yeah, I'm on the open driver16:04:19
@ss:someonex.netSomeoneSerge (back on matrix)https://github.com/NixOS/nixpkgs/issues/334180#issuecomment-228451881616:04:30
@ss:someonex.netSomeoneSerge (back on matrix)atry16:04:31
@ss:someonex.netSomeoneSerge (back on matrix) * Atry16:04:33
@hexa:lossy.networkhexa (UTC+1)we should probably deduplicate all issues to this one16:05:04
11 Dec 2024
@magic_rb:matrix.redalder.org@magic_rb:matrix.redalder.org joined the room.00:50:41
@magic_rb:matrix.redalder.org@magic_rb:matrix.redalder.org

cross post from #dev:nixos.org

anyone touch the nvidia driver code? packaging i mean. The long standing bug of "use xrandr twice and you get a segfault in X11" doesn't happen to me anymore apparently

00:51:08
@hexa:lossy.networkhexa (UTC+1)

copying path '/nix/store/62vk99s9kdcjj4x64wcw22a7rwbfnm36-python3.12-onnxruntime-1.20.1' from 'ssh://hexa@build2.darmstadt.ccc.de'

20:11:21
@hexa:lossy.networkhexa (UTC+1)πŸ₯³20:11:23
@hexa:lossy.networkhexa (UTC+1)now if only the cuda build was working20:11:35
@hexa:lossy.networkhexa (UTC+1)https://github.com/microsoft/onnxruntime/issues/22855#issue-266288204720:58:10
@hexa:lossy.networkhexa (UTC+1)ah, very coool 🫠20:58:14
12 Dec 2024
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8) Ah right that thing
I was told CMake is supposed to understand it’s a header-only library and not try to actually link against a shared object file, so not sure why it’s doing exactly that and causing the build to fail
01:48:43

Show newer messages


Back to Room ListRoom Version: 9