!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

290 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
20 Sep 2025
@connorbaker:matrix.orgconnor (he/him)Posted a reply back21:45:34
21 Sep 2025
@tomberek:matrix.orgtomberekHey, through the GitHub Open Source security program we can get a significant amount of Azure credits. Would the CUDA team be able to effectively use that? Would that make a difference in the maintenance burden?14:10:32
@ss:someonex.netSomeoneSerge (back on matrix)I'm afraid that expert-hours are 146% the scarcest resource right now (as always?)15:48:56
@ss:someonex.netSomeoneSerge (back on matrix) connor (he/him) (UTC-7): repeatedly mentioned he's been experimenting with building nixos images for Azure 15:49:27
@ss:someonex.netSomeoneSerge (back on matrix) If we could have ephemeral azure instances with a mutable nix store that would be amazing 15:50:19
22 Sep 2025
@ss:someonex.netSomeoneSerge (back on matrix) Judging by Gaétan Lepage's emoji reaction, he's already imagining them nixpkgs-reviews going brrrrrrrr. Gaétan Lepage would you have the availability for this one? 08:01:20
@glepage:matrix.orgGaétan LepageTo work on the Azure stuff?09:16:35
@glepage:matrix.orgGaétan Lepage I have gained access to a beefy builder since then, so I'm less constrained than before on the x86_64-linux side of things. 09:17:08
@matthewcroughan:defenestrate.itmatthewcroughan SomeoneSerge (back on matrix): can you bump or have you bumped privately https://github.com/NixOS/nixpkgs/blob/nixos-unstable/pkgs/development/python-modules/opensfm/default.nix#L144 ? 20:19:03
@matthewcroughan:defenestrate.itmatthewcroughanI need it for https://github.com/NixOS/nixpkgs/pull/44200320:19:15
@matthewcroughan:defenestrate.itmatthewcroughanbecause none of the SfM tools work properly in nixpkgs anymore20:19:27
@matthewcroughan:defenestrate.itmatthewcroughanSfM tools are tools that take a dir of image and tag it with inferred data like rotation, coordinates etc, to pipe into gaussian splat utils like brush20:19:52
@matthewcroughan:defenestrate.itmatthewcroughanhttps://github.com/NixOS/nixpkgs/pull/43867220:43:43
@matthewcroughan:defenestrate.itmatthewcroughanhmm, this gets close to fixing colmap20:43:50
23 Sep 2025
@ss:someonex.netSomeoneSerge (back on matrix) matthewcroughan: hard pressed right now, maybe end of week... 00:30:01
@hexa:lossy.networkhexadid anyone here make substitions from the flox cache work yet?01:51:21
@glepage:matrix.orgGaétan LepageI had some cache hits is this is your question08:23:23
@glepage:matrix.orgGaétan Lepage FYI Zowoq has restored the cuda jobset on nix-community:
https://matrix.to/#/%21PbtOpdWBSRFbEZRLIf%3Anumtide.com/%240hueN5_QPZEhj5g4nqSa-gFgmCYe3CMKlG79bd2E-nM?via=blad.is&via=matrix.org&via=envs.net
08:24:47
@hugo:okeso.euHugo

@ss:someonex.net Thanks for your feedback on my PR adding Cuda tests.

I implemented most, but cannot test because I cannot rebuild xformers with CUDA enabled 🫤. I tried many times on multiple machines but no luck. It builds fine without Cuda though.

10:19:57
@hugo:okeso.euHugoRedacted or Malformed Event10:20:52
@hugo:okeso.euHugoScreenshot_20250923_122127.png
Download Screenshot_20250923_122127.png
10:22:03
@hugo:okeso.euHugoScreenshot_20250923_122117.png
Download Screenshot_20250923_122117.png
10:22:24
@hugo:okeso.euHugoI also find it really weird that CPU stays at 100% while the build is stuck.10:22:25
@hugo:okeso.euHugoIs that a known bug in Nix or in the build tools?10:22:55
@a-kenji:matrix.orgkenji changed their display name from a-kenji to kenji.10:42:31
@albertlarsan68:albertlarsan.frAlbert Larsannvcc spawns multiple compiler instances per invocation, and ninja spawns as many nvcc instances as the number of cores/threads, which makes the CPU overcommited (ex: you have 16 threads, ninja spawn 16 nvcc instances, and each one of the nvcc intances spawns 6 cicc instances, and each cicc instance consumes one full cpu thread. So you end up with 16*6=96 processes trying to run at the same time). The build is not stuck, it just takes a very long time to happen (because it tries to do more at the same time than your computer can handle)11:32:01
@hugo:okeso.euHugoTanks for this explanation. My impression is more that the system goes OOM and then something gets stuck and never resumes.11:44:59
@hugo:okeso.euHugoEspecially since a working build (previous release) finished in 14 minutes.12:08:09
@gregorburger:matrix.orgGregor BurgerHi Guys, quick question is there an equivalent cudaPackages.backendStdenv for clang? 12:09:25
@gregorburger:matrix.orgGregor Burger* Hi, quick question is there an equivalent cudaPackages.backendStdenv for clang? 12:11:37

Show newer messages


Back to Room ListRoom Version: 9