NixOS CUDA - Public Room Timeline - Matrix Static

	NixOS CUDA	310 Members
	CUDA packages maintenance and support in nixpkgs \| https://github.com/orgs/NixOS/projects/27/ \| https://nixos.org/manual/nixpkgs/unstable/#cuda	60 Servers

You have reached the beginning of time (for this room).

Sender	Message	Time
3 Mar 2025
little_dude	* Hello, I'm having trouble getting CUDA working on Nixos. I made a post on Discourse but I thought I'd ask here since it's specific to CUDA: https://discourse.nixos.org/t/ollama-cuda-driver-library-init-failure-3/61068/2 In short, I installed ollama, but ollama reports: Unable to load cudart library /nix/store/lgmvgx3r1pbpd40crz2nnliakfxh19f8-nvidia-x11-570.124.04-6.12.17/lib/libcuda.so.570.124.04: cuda driver library init failure: 3 I guess error 3 corresponds to the `cudaErrorInitializationError` described here but that doesn't help me much. I think my issue is that is have CUDA 12.4 installed, when my GPU supports CUDA 12.8? Atlough I'm not certain what the CUDA version reported by `nvidia-smi` really mean (whether it is the only version supported by my GPU, or whether it's the maximum version). Would you have an idea about where to go from here?	12:50:49
little_dude	Also I hope it's ok to cross-post like this. Apologies if that seems pushy.	12:51:36
little_dude	I also thought the error might be because I'm using the GPU to run Wayland, but I assume that just like CPUs, GPUs can run multiple workloads in parallel? Or does CUDA need to have exclusive access to the GPU? (I know these are very naive questions, I just never dealt with GPUs before)	12:54:37
little_dude	* I also thought the error might be because I'm using the GPU to run Wayland, but I assume that just like CPUs, GPUs can run multiple workloads? Or does CUDA need to have exclusive access to the GPU? (I know these are very naive questions, I just never dealt with GPUs before)	12:56:23
ruro	connor (he/him) (UTC-8): I just noticed that `pkgs/development/libraries/science/math/tensorrt/extension.nix` is a thing. At first glance, this code seems dead to me (or at least I wasn't able to find a place where it is called from)? It seems that nowadays all of the TensorRT-related code lives in `pkgs/development/cuda-modules`. The last commit (excluding automated reformatting) that touched `pkgs/development/libraries/science/math/tensorrt` seems to be `8e800cedaf24f5ad9717463b809b0beef7677000` authored by you in 2023. That commit also removed `pkgs/development/libraries/science/math/tensorrt/generic.nix`. So I am guessing that you forgot to also delete the `extension.nix`?	13:43:50
connor (he/him)	ruro: yes, seems likely :l	16:39:14
connor (he/him)	little_dude: it's fine to cross-post! Sorry it's not working, my only suggestion would be to try running it with whatever flags Ollama needs to enable debugging and/or `LD_DEBUG=libs` to make sure it's finding and loading the correct libraries. The version difference across CUDA driver version and CUDA library version is fine -- just means you can run CUDA libraries using up to and including 12.8. The GPU definitely supports multiple workloads, so that shouldn't be a problem either. I'm strapped for time so I probably won't be able to help debug or troubleshoot, but I think some other people in here use ollama, so they might be able to chime in.	16:46:44
4 Mar 2025
stick	i have prepared a cudaPackages_12 update from 12.4 to 12.8 here: https://github.com/NixOS/nixpkgs/pull/386983 can you have a look? I also included a nixpkgs-review result - 229 marked as broken / 219 failed to build / 2455 packages built but I am having hard time figuring out which build failures are new and which were happening even before can you advise what is the best way how to proceed? please comment on github, i am not always following the discussion here	10:48:13
stick	an ideal thing for me would be if someone indicated the list of packages that really need to have the build fixed before the merge happens and I would (try to) work on fixing these	10:53:23
stick	* an ideal thing for me would be if someone indicated the list of packages that really need to have the build fixed before the merge happens and I will (try to) work on fixing these	10:53:33
SomeoneSerge (matrix works sometimes)	In addition to Connor's suggestions, can you check what is the output when you run `cudaPackages.saxpy`?	11:26:55

Show newer messages

Back to Room ListRoom Version: 9