!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

310 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda60 Servers

You have reached the beginning of time (for this room).


SenderMessageTime
3 Mar 2025
@little_dude:matrix.orglittle_dude *

Hello, I'm having trouble getting CUDA working on Nixos. I made a post on Discourse but I thought I'd ask here since it's specific to CUDA: https://discourse.nixos.org/t/ollama-cuda-driver-library-init-failure-3/61068/2

In short, I installed ollama, but ollama reports:

Unable to load cudart library /nix/store/lgmvgx3r1pbpd40crz2nnliakfxh19f8-nvidia-x11-570.124.04-6.12.17/lib/libcuda.so.570.124.04: cuda driver library init failure: 3

I guess error 3 corresponds to the cudaErrorInitializationError described here but that doesn't help me much.

I think my issue is that is have CUDA 12.4 installed, when my GPU supports CUDA 12.8? Atlough I'm not certain what the CUDA version reported by nvidia-smi really mean (whether it is the only version supported by my GPU, or whether it's the maximum version). Would you have an idea about where to go from here?

12:50:49
@little_dude:matrix.orglittle_dudeAlso I hope it's ok to cross-post like this. Apologies if that seems pushy.12:51:36
@little_dude:matrix.orglittle_dude I also thought the error might be because I'm using the GPU to run Wayland, but I assume that just like CPUs, GPUs can run multiple workloads in parallel? Or does CUDA need to have exclusive access to the GPU? (I know these are very naive questions, I just never dealt with GPUs before) 12:54:37
@little_dude:matrix.orglittle_dude * I also thought the error might be because I'm using the GPU to run Wayland, but I assume that just like CPUs, GPUs can run multiple workloads? Or does CUDA need to have exclusive access to the GPU? (I know these are very naive questions, I just never dealt with GPUs before) 12:56:23
@ruroruro:matrix.orgruro connor (he/him) (UTC-8): I just noticed that pkgs/development/libraries/science/math/tensorrt/extension.nix is a thing. At first glance, this code seems dead to me (or at least I wasn't able to find a place where it is called from)? It seems that nowadays all of the TensorRT-related code lives in pkgs/development/cuda-modules. The last commit (excluding automated reformatting) that touched pkgs/development/libraries/science/math/tensorrt seems to be 8e800cedaf24f5ad9717463b809b0beef7677000 authored by you in 2023. That commit also removed pkgs/development/libraries/science/math/tensorrt/generic.nix. So I am guessing that you forgot to also delete the extension.nix? 13:43:50
@connorbaker:matrix.orgconnor (he/him) ruro: yes, seems likely :l 16:39:14
@connorbaker:matrix.orgconnor (he/him)

little_dude: it's fine to cross-post!

Sorry it's not working, my only suggestion would be to try running it with whatever flags Ollama needs to enable debugging and/or LD_DEBUG=libs to make sure it's finding and loading the correct libraries.

The version difference across CUDA driver version and CUDA library version is fine -- just means you can run CUDA libraries using up to and including 12.8.

The GPU definitely supports multiple workloads, so that shouldn't be a problem either.

I'm strapped for time so I probably won't be able to help debug or troubleshoot, but I think some other people in here use ollama, so they might be able to chime in.

16:46:44
4 Mar 2025
@stick:matrix.orgsticki have prepared a cudaPackages_12 update from 12.4 to 12.8 here: https://github.com/NixOS/nixpkgs/pull/386983 can you have a look? I also included a nixpkgs-review result - 229 marked as broken / 219 failed to build / 2455 packages built but I am having hard time figuring out which build failures are new and which were happening even before can you advise what is the best way how to proceed? please comment on github, i am not always following the discussion here 10:48:13
@stick:matrix.orgstickan ideal thing for me would be if someone indicated the list of packages that really need to have the build fixed before the merge happens and I would (try to) work on fixing these10:53:23
@stick:matrix.orgstick* an ideal thing for me would be if someone indicated the list of packages that really need to have the build fixed before the merge happens and I will (try to) work on fixing these10:53:33
@ss:someonex.netSomeoneSerge (matrix works sometimes) In addition to Connor's suggestions, can you check what is the output when you run cudaPackages.saxpy? 11:26:55

Show newer messages


Back to Room ListRoom Version: 9