!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

279 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
12 Nov 2025
@arilotter:matrix.orgAri Lotterallllllright let's try nixpkgs-review again with the new binary cache :p15:30:38
@arilotter:matrix.orgAri Lotterstill building jax, but, uhhhh, we ball15:40:02
@glepage:matrix.orgGaétan LepageWhich PR?15:56:37
@arilotter:matrix.orgAri Lotterthis one https://github.com/NixOS/nixpkgs/pull/46070115:57:23
@arilotter:matrix.orgAri Lotter yep, workers keep crashing :/
] building python3.12-jax-0.8.0 (pytestCheckPhase): replacing crashed worker gw1
15:57:38
@arilotter:matrix.orgAri Lotter hm,
warning: ignoring the client-specified setting 'sandbox', because it is a restricted setting and you are not a trusted user
warning: ignoring the client-specified setting 'system', because it is a restricted setting and you are not a trusted user
would setting myself as a trusted user fix this, i wonder
16:02:06
@daniel-fahey:matrix.orgDaniel Fahey Yeah you might want to use extra-substituters (I never grok'd the difference, if I'm being honest) 16:06:51
@daniel-fahey:matrix.orgDaniel Faheynot sure if it's different with Nix on Ubuntu, but on NixOS, I have to rebuild the system before the binary cache is available. Is there a rebuild step with plain Nix?16:08:28
@daniel-fahey:matrix.orgDaniel Fahey

I reckon they might be in a private repo hinted at in https://github.com/flox/nixpkgs/pull/3#issuecomment-1276439899

But the https://github.com/flox/nixpkgs/tree/unstable is a simple fork that I'd like to see sync'd/rebased from upstream Nixpkgs more frequently.

Gaétan Lepage is this is the kind of thing that could be discussed in your CUDA Team meetings, and maybe brought up to the Steering Committee for discussion with Flox?

16:12:16
@glepage:matrix.orgGaétan Lepage Flox is managing their cache internally. As you pointed, they use an internal fork of nixpkgs that is slightly delayed from nixos-unstable (or nixos-unstable-small). It's normal that their cache is less fresh than chache.nixos-cuda.org. 16:13:41
@glepage:matrix.orgGaétan LepageThe difference is that they have the permission from Nvidia to redistribute their binaries.16:14:01
@daniel-fahey:matrix.orgDaniel Fahey😅16:14:45
@arilotter:matrix.orgAri Lotteri have it in both because i don't understand it <316:15:52
@arilotter:matrix.orgAri Lotter also - 100% sure i'm not running out of ram anymore, at only ~400gb/2tb used on the machine, but jax still has crashed workers - and not sure if it's progressing 16:17:00
@arilotter:matrix.orgAri Lottercan i pull up interactive logs for its derivation somehow?16:17:11
@arilotter:matrix.orgAri Lotteroh nice, i got a real failure16:17:50
@daniel-fahey:matrix.orgDaniel Faheyimage.png
Download image.png
16:18:16
@daniel-fahey:matrix.orgDaniel FaheyAnother option Ari, is you follow the "Help" button next to the closure link at https://hydra.nixos-cuda.org/build/8123 16:18:17
@daniel-fahey:matrix.orgDaniel Fahey Could just pipe output through tee, but normally at the tail of the the crash, it has the command to run to see the log 16:19:19
@daniel-fahey:matrix.orgDaniel FaheyI'd be interested to see them, see if they're similar to what I saw on nixbuild.net -> you said it crashed during the test stage?16:19:53
@daniel-fahey:matrix.orgDaniel Fahey*phase16:19:56
@daniel-fahey:matrix.orgDaniel Fahey I also wonder if nixpkgs-review is complicating things 16:20:33
@arilotter:matrix.orgAri Lottergrabbing logs!16:22:05
@arilotter:matrix.orgAri LotterDownload test.txt16:27:01
@arilotter:matrix.orgAri Lotterthat's a big log lol16:27:04
@arilotter:matrix.orgAri Lotter but yeah, i hit both some INTERNAL: Failed to materialize symbols and some LLVM compilation error: Cannot allocate memory 🤔 16:28:19
@arilotter:matrix.orgAri Lotter(keeping in mind i had >1600gb of ram available at all times)16:28:52
@daniel-fahey:matrix.orgDaniel FaheyMight be different than what I saw on nixbuild.net but not sure, did your new attempt crashed building the python3.13 version, so not 3.12 like yesterday The (server grade) Intel hypothesis still isn't disproven, tbf16:40:31
@arilotter:matrix.orgAri Lotter 3.12 version is stuck on building python3.12-jax-0.8.0 (pytestCheckPhase): replacing crashed worker gw1 16:41:02
@daniel-fahey:matrix.orgDaniel FaheyHave you fed it into a clanker yet?16:41:04

Show newer messages


Back to Room ListRoom Version: 9