NixOS CUDA | 316 Members | |
| CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda | 63 Servers |
| Sender | Message | Time |
|---|---|---|
| 7 May 2024 | ||
Hello, I was wondering whether cudaPackages.cudatoolkit with Nix would allow me to use multiple versions of CUDA on my machine, either with NixOS or by just using the Nix package manager. | 02:31:16 | |
| yklcs: Cuda can be tricky but I've had good luck using nix shell and specific versions of cudatoolkit. | 04:58:33 | |
In reply to @brandon:matrix.radiation.ioThanks. Do you have any .nix files to share? | 05:38:34 | |
| https://nixos.wiki/wiki/CUDA yklcs | 10:06:58 | |
Hey, I tried switching from virtualisation.docker.enableNvidia = true; to the more recent virtualisation.containers.cdi.dynamic.nvidia.enable = true;, hardware.nvidia-container-toolkit.enable = true; and features.cdi = true;. I'm using Docker daemon and client at v25, and since switching to the new configuration options, I see the following when trying to start containers with GPUs:``Searching online for a little, most of the people running into that issue didn't install the CTK properly. However, that shouldn't be the case with the options mentioned above, or am I wrong? Does anyone of you have another idea? | 14:57:05 | |
Hey, I tried switching from virtualisation.docker.enableNvidia = true; to the more recent virtualisation.containers.cdi.dynamic.nvidia.enable = true;, hardware.nvidia-container-toolkit.enable = true; and features.cdi = true;. I'm using Docker daemon and client at v25, and since switching to the new configuration options, I see the following when trying to start containers with GPUs: | 14:57:12 | |
In reply to @msanft:matrix.orgCan you try my suggestion above? | 15:24:31 | |
In reply to @trexd:matrix.orgThis one Moritz Sanft | 15:24:43 | |
| Ohh, that seems helpful! Will try! | 15:25:25 | |
| That works. Thank you! | 15:34:30 | |
| 8 May 2024 | ||
| 00:55:57 | ||
| 00:56:09 | ||
| I am re-emerging from the exhaustion surrounding travel and interviews; will be hammering the PR I have open into shape tomorrow; hopefully ready for review and merge soon so we can get new releases of CUDA, CUDN , etc. | 03:02:17 | |
| 12:47:16 | ||
| Urhgh SomeoneSerge (Way down Hadestown) what are your thoughts on having something akin to | 21:20:31 | |
The top-level pkgs is supposed to be that, but I guess we fail. Well, we definitely do because tensorflow | 21:23:13 | |
| 9 May 2024 | ||
| 17:11:24 | ||
| 10 May 2024 | ||
| is it too complicated maintaining the closure with packages for each component? i.e. would a single input simplify? | 14:31:54 | |
In reply to @justbrowsing:matrix.orgIs this in reference to the above or about the redistributable packaging in general? | 14:47:33 | |
| why not both? | 15:13:10 | |
| (more the latter) | 15:13:57 | |
| 15:23:52 | ||
Ah for the latter the trouble is mostly around Nixpkgs expecting certain outputs to behave in certain ways (like dev including a dependency on out) and us using the outputs as components rather than full outputs | 16:00:12 | |
| For the former the issue is mostly around different packages in the global scope requiring different versions of CUDA (like PyTorch and Tensorflow use different versions of CUDA) | 16:00:48 | |
| 12 May 2024 | ||
| staging-next has been merged to master a few minutes ago. Looks like most of the CUDA stuff is broken... | 16:26:19 | |
| 16:30:23 | ||
| Uh oh | 19:03:43 | |
| Gaétan Lepage: can you send me a few reproducers? I’m going to rebase the PR I have outstanding on Monday and will pick those up so I’d like to know ahead of time what to look for | 19:04:18 | |
| I am currently in the middle of many rebuilds. My JAX update PR was basically ready and now I have some kWh to spare rebuilding everything ^^ I am not sure yet about the failures. I have re-tried building the packages that were supposidly failing and it seems to work fine now. | 19:06:32 | |
| I'll let you know if I spot anything fishy. | 19:06:43 | |