NixOS CUDA | 289 Members | |
| CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda | 58 Servers |
| Sender | Message | Time |
|---|---|---|
| 12 Apr 2025 | ||
| 12:56:11 | ||
| 13 Apr 2025 | ||
| 11:43:29 | ||
| Hi everyone! I am looking at a bug we have with CDI (Container Device Interface, for forwarding GPU's to containers): https://github.com/NixOS/nixpkgs/issues/397065 I think the user has a correct configuration (unless there are settings that were not mentioned in the issue), my main question is when using the datacenter driver, why the nvidia-container-toolkit is reporting:
Do you have any idea on why NVML would not be present in this environment? | 11:45:34 | |
| HI! I've a small announcement to make. I've been failing badly to keep up with the backlog as a maintainer, even though I'm recently able to spend some more time on Nixpkgs&c. Working in occasional 1:1 meetings, otoh, has always felt comparatively productive. We've just had another call with Gaétan Lepage and I find it was nice, so I now want to try the following: https://md.someonex.net/s/9S4E00sIb# This is not exactly "official", I'm not posting this e.g. on Discourse until I'm more confident, but as such it's an open invitation. | 14:52:27 | |
Indeed, it was great! We were able to finally finish fixint mistral-rs's cuda support! | 15:24:02 | |
* Indeed, it was great! We were able to finally finish fixing mistral-rs's cuda support! | 15:24:22 | |
| 15 Apr 2025 | ||
| BTW folks, if you have a moment, I'd love to get this one merged: https://github.com/NixOS/nixpkgs/pull/367769 | 06:26:28 | |
connor (he/him) (UTC-7): did you use something like josh for cuda-legacy? I suspect this produced at least a few pings 😅 | 13:35:27 | |
| I used https://github.com/newren/git-filter-repo — what would have pinged people? | 13:37:06 | |
| User handles in commit messages xD | 13:37:28 | |
| Hi! Given https://github.com/NixOS/nixpkgs/pull/362197 had conflicts recently due to the treewide formatting I closed it, and reopened it at https://github.com/NixOS/nixpkgs/pull/398993. I think we can merge this one too | 21:24:34 | |
| We have been going back and forth with the author for a while, and I thought it would be good to go ahead on our side | 21:25:10 | |
| Thanks! | 21:29:32 | |
| 17 Apr 2025 | ||
| 09:38:30 | ||
| is this the right place to ask questions / get pointes on how to properly setup cuda container toolkit? For docker it seems to work when enabling deprecated | 11:01:34 | |
| was not lucky at all with containerd for k3s | 11:02:09 | |
| for anybody stumbling over this: I'm pretty sure im on the right track using CDIs, having it work with docker (& compose). Should have read the docs properly. The relevant section section from the nixOS CUDA docs that got me here was all the way at the bottom: https://nixos.wiki/wiki/Nvidia#NVIDIA%20Docker%20not%20Working | 14:38:50 | |
| from all I understand this gives a lot more flexibility to pass accelerators of different vendors to containerized workloads 🥳 | 14:39:36 | |
Yes, CDI is the supported way (and has received a lot of care from @ereslibre), enableNvidia relies on end-of-life runtime wrappers | 16:18:38 | |
Did you manage to get containerd to work? | 16:20:27 | |
| +1, let us know if you run into any issues when enabling CDI :) | 19:31:30 | |
| 18 Apr 2025 | ||
| SomeoneSerge (UTC+U[-12,12]) I removed all the module system stuff from https://github.com/connorbaker/cuda-packages | 11:24:48 | |
| ereslibre: I got it to run with docker but still struggling to getting it to run with containerd and k8s-device-plugin. | 20:46:45 | |
In reply to @luke-skywalker:matrix.orgInteresting. If you feel like it, please open an issue and we can follow up. I did not try to run CDI with either of those | 20:48:38 | |
| 20 Apr 2025 | ||
Updated https://github.com/NVIDIA/build-system-archive-import-examples/issues/5 to reflect preference for the.note.dlopen section over eager-loading | 09:34:53 | |
| 18:15:53 | ||
| 21 Apr 2025 | ||
| Redacted or Malformed Event | 13:54:54 | |
| @luke-skywalker:matrix.org: the moderation bot is configured to drop all media in nixos spaces because there was a spam campaign disseminating csam matrix-wide, it's an unfortunate situation but the mods don't really have any other tools at their disposal | 19:48:15 | |
| 22 Apr 2025 | ||
| 09:32:38 | ||
| ereslibre: where exactly should I make an issue? on nixpkgs gituhb? If so how do I indicate that its about the nvidia-container-toolkit? So far, the attached config got me to:
| 10:56:26 | |