NixOS CUDA | 332 Members | |
| CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda | 63 Servers |
| Sender | Message | Time |
|---|---|---|
| 19 May 2026 | ||
| 16:41:32 | ||
| 20:15:25 | ||
| hey folks, I'm looking for feedback on this PR :: https://github.com/NixOS/nixpkgs/pull/515928 My goal with these changes is to speed up my NVIDIA Jetson Orin Nano builds by adding dGPU architectures targets for Jetson Xavier (sm_72), Jetson Orin (sm_87), Jetson Thor (sm_110). I'm happy to iterate more on the changes if there is feedback :) | 20:22:59 | |
| if/when this gets merged, I was planning on raising a follow-up fix to the nixos-cuda inference package to support these new params. https://github.com/nixos-cuda/infra | 20:25:01 | |
Hi! Just so you know, aarch64-linux is not supported in our current CI. | 21:24:45 | |
hi! I did see that there wasn't a callout for an aarch64-linux system in the infra package. I was going to include more thoughts in the follow-up CR, but the few options I see are (a) enable cross compilation via qemu on current hardware (ie :: https://github.com/81reap/jetpack-nixos/blob/d20caea8befd5fdad99efc9fad5527e70124ca98/README.md?plain=1#L339-L351); (b) onboard my own Jetson Nano as a target to build things for the infra pipeline; or (c) I donate $250 for the NVIDIA Jetson Orin Nano Super Developer Kit to be purchased by one of the current maintainers | 22:51:06 | |
| that said, I think the lowest bar to test this on the current CI is to enable cross complication. But that's just my current opinion. Happy to chat more and bar raise these thoughts :) | 22:52:04 | |
| 20 May 2026 | ||
| Our infra is fairly limited in terms of capacity and doing builds through QEMU would be prohibitively expensive. Not to say this can't happen, but it's certainly not a platform we build for or test. | 02:05:15 | |
| Gaétan Lepage: for gpu-burn: https://github.com/NixOS/nixpkgs/pull/522144 I still need to look into CUDA compat/fix the assumption in the logic that cudaVariant only ever has a major version suffix | 05:12:41 | |
| And then also investigate the NVCC fatal error encountered when using family feature sets with baseline ones | 05:15:40 | |
| For the CUDA variant stuff, look for changes here:
| 05:20:42 | |
| If you're able to poke at the gpu-burn PR I'd appreciate it. I've been running benchmarks with https://github.com/ConnorBaker/nix/tree/vibe-coding/optimise-and-gc-throughput-baseline-bench-rig-616df9797 and https://github.com/ConnorBaker/nix/tree/vibe-coding/optimise-and-gc-throughput before submitting PRs upstream to parallelize/make faster store optimise and gc. They've been running for two days and I don't want to fully load the system while it's doing that. | 05:23:23 | |
| Thanks Prayag Bhakar for your suggestions. We are actively working in the background to secure "sponsorships" and get a legitimate compute capacity, but this is not done yet. | 08:34:33 | |
| I see, thanks for the input connor (burnt/out) (UTC-8) Gaétan Lepage just to clarify a few things for my understanding (1) This is only a blocker for updating the cuda infra pipeline, right? Or is this also a blocker for updating Nixpkgs to properly support aarch64? (2) How does the current infra work? Is its machines hosted by volunteers/maintainers? I'm trying to understand what "legitimate compute capacity means" and if solution B or C would be viable | 13:18:50 | |
| Hey, sorry for disappearing on the github ticket, v limited bandwidth currently. The first and most important point, elaborating on Gaétan's reply: our infra currently lacks aarch64 builders, and frankly we as of now haven't even a fraction of the x8664 capacity that we need. We are currently working on securing proper funding for the hardware and for the general effort, and first and foremost for reducing the harm and the extra load that CUDA imposes on the rest of Nixpkgs maintainers. It's been in the works for almost two years now. Recently our entire team, including our new "manager" member @dhofer:matrix.org, has been fully dedicating to making this happen, but, while there's been some very modest progress, it's going to take a while before we start testing and caching Jetsons or in fact anything besides the vanilla x86-64-linux nixpkgs simply due to cashflow considerations. We are actively looking for companies willing to properly pay for the service of testing and keeping the Jetson ecosystem "green" & functional here upstream in Nixpkgs, but so far there's no ETAs for this specific effort. In our personal projects we normally use very different kind of hardware, so it's not a priority for any of the maintainers. Regarding your other messages, just some clarifications:
Now to what is a priority to us, fot example: I'm happy to brainstorm with anyone about how do we get rid of the backendStdenv thing and how to cleanly model coprocessors/accelerators in the elaborated-system structure, whether to make coprocessors part of a system "quadruple" (contrast to triple), and whether the specific cudaCapabilities and/or rocm gpuTargets should become a part of... well, I suppose, the system "polycule". | 18:22:16 | |
| This too would be very welcome, but before we can make any use of that we need to secure the general aarch64 cpu build capacity! | 18:24:17 | |
| no worries SomeoneSerge (matrix works sometimes) I'm grateful for any time folks are taking out of their day to help me out. I figured the conversation may be more fruitful in the matrix channel
ah, sorry I was under the impression it was close enough. Since option A is off the table, do options B and C have any legs? I understand that there is a process to secure more compute, but can that compute come from volunteers like me? From my understanding even with just a Jetson nano, it can start compiling other aarch64 packages until more capable compute is secured
oh I see, so have the GPU definitions be part of the core | 22:56:23 | |
| While a Jetson can indeed build packages for We appreciate the proposition, but until we get access to something resembling a serious build capacity for this platform, we won't spend time supporting it. | 23:00:50 | |
| I see, so at the minimum targeting the $5k budget hardware range with Ampere Altra or an Apple M Series Mac (probably also need Asahi Linux). Is there a donation target/pool for this goal? | 23:59:21 | |
| 21 May 2026 | ||
| * I see, so at the minimum targeting the $5k budget hardware range with Ampere Altra or a used/refurbished Apple M Series Mac (probably also need Asahi Linux). Is there a donation target/pool for this goal? | 00:11:20 | |
| * I see, so at the minimum targeting the $5k budget hardware range with Ampere Altra Dev Box/Server or a used/refurbished Apple M Series Mac (probably also need Asahi Linux). Is there a donation target/pool for this goal? | 00:14:07 | |
| Thanks once again for proposing to help. Yes, Ampere altra is a solution. Mac + Asahi less so as we want to avoid hacking stuff around more than necessary. We will be sharing a detailed list of what systems we are looking to get for our CI. However, the budget range is more enterprise-scale than individual-donation scale. As mentionned before, we're strenghtening our collaboration with companies which are interested to help us on this front. I hope to be able to update you on our infra soon. | 07:45:54 | |
| /me goes on to dream about an Asahi cluster for reproducibility studies | 12:36:34 | |
| got it. is there anything I can help with? Sounds like there's an ongoing effort to refactor I have a lot of machines with Nvidia GPUS that I use nix & nixOS with, so I would still like to help improve the system https://prayag.bhakar.org/000-00-0000/apollo-server.jpg | 14:39:34 | |
Hello, I find that the binary cache doesn't have one for cudaPackages.libnvshmem on my system, which resorts to building from source which I don't have the resources to. Hydra says that the package is there, so I'm not sure on that. Exact name of the package is/nix/store/4nzqdwc370may1ilz6zyy34ym16jlqvn-cuda12.9-libnvshmem-3.6.5-0.drv | 15:14:03 | |
| Does Asahi work on M2? We could get a Mac Pro that nobody needs for relatively cheap ;) | 15:23:44 | |
| I'd rather use it to build/test packages on darwin | 16:23:08 | |
staging-next was merged very recently into master, so our Hydra instance is probably catching up at this moment. | 16:23:39 | |
Download image.png | 17:37:08 | |
| seems like folks have been able to compile Linux on Mac with a few patchs https://seiya.me/blog/building-linux-on-macos-natively | 17:37:41 | |