NixOS CUDA - Public Room Timeline

	NixOS CUDA	332 Members
	CUDA packages maintenance and support in nixpkgs \| https://github.com/orgs/NixOS/projects/27/ \| https://nixos.org/manual/nixpkgs/unstable/#cuda	63 Servers

Load older messages

Sender	Message	Time
27 May 2026
hexa	in this economy?!	16:04:38
Gaétan Lepage	`nix build --impure --cores 2 --expr ' (import (builtins.getFlake "github:GaetanLepage/nixpkgs/flash-attn") { system = builtins.currentSystem; config = { allowUnfree = true; cudaSupport = true; }; }).python3Packages.flash-attn.gpuCheck '` This should do it.	16:05:28
Gaétan Lepage	* `nix build --impure --expr ' (import (builtins.getFlake "github:GaetanLepage/nixpkgs/flash-attn") { system = builtins.currentSystem; config = { allowUnfree = true; cudaSupport = true; }; }).python3Packages.flash-attn.gpuCheck '` This should do it.	16:05:59
BerriJ	In reply to @hexa:lossy.network in this economy?! It's not my private one unfortunately 😅 But I'm the admin and currently there is no workload on that thing.	16:07:57
Gaétan Lepage	I mean... If only I had nix installed... root@p4-r01-ct18:~# nvidia-smi Wed May 27 16:10:23 2026 +-----------------------------------------------------------------------------------------+ \| NVIDIA-SMI 580.126.21 Driver Version: 580.126.21 CUDA Version: 13.2 \| +-----------------------------------------+------------------------+----------------------+ \| GPU Name Persistence-M \| Bus-Id Disp.A \| Volatile Uncorr. ECC \| \| Fan Temp Perf Pwr:Usage/Cap \| Memory-Usage \| GPU-Util Compute M. \| \| \| \| MIG M. \| \|=========================================+========================+======================\| \| 0 NVIDIA GB200 On \| 00000008:01:00.0 Off \| 0 \| \| N/A 45C P0 170W / 1200W \| 0MiB / 189471MiB \| 0% Default \| \| \| \| Disabled \| +-----------------------------------------+------------------------+----------------------+ \| 1 NVIDIA GB200 On \| 00000009:01:00.0 Off \| 0 \| \| N/A 45C P0 153W / 1200W \| 0MiB / 189471MiB \| 0% Default \| \| \| \| Disabled \| +-----------------------------------------+------------------------+----------------------+ \| 2 NVIDIA GB200 On \| 00000018:01:00.0 Off \| 0 \| \| N/A 45C P0 153W / 1200W \| 0MiB / 189471MiB \| 0% Default \| \| \| \| Disabled \| +-----------------------------------------+------------------------+----------------------+ \| 3 NVIDIA GB200 On \| 00000019:01:00.0 Off \| 0 \| \| N/A 45C P0 176W / 1200W \| 0MiB / 189471MiB \| 0% Default \| \| \| \| Disabled \| +-----------------------------------------+------------------------+----------------------+ +-----------------------------------------------------------------------------------------+ \| Processes: \| \| GPU GI CI PID Type Process name GPU Memory \| \| ID ID Usage \| \|=========================================================================================\| \| No running processes found \| +-----------------------------------------------------------------------------------------+	16:10:33
hexa	makes you wonder who we are building cuda support for	16:12:00
Gaétan Lepage	Not for the owners of those GPUs unfortunately 🥲	16:20:08
BerriJ	In my case I'm working at a German University and the server is used by a team of around 9 researchers :)	16:33:56
hexa	pretty sure Gaetan works at some French university 😆	16:37:26
Gaétan Lepage	Not anymore. (French universities don't have such fancy GPUs) 🫠	16:38:38
BerriJ	The build is running now :)	17:16:42
SomeoneSerge (matrix works sometimes)	Can't you nix in container?	18:26:52
SomeoneSerge (matrix works sometimes)	Not TUM?	18:27:57
SomeoneSerge (matrix works sometimes)	Not the OS group? I'd be hyped yo learn that somebody in academia/hpc/rse community actually uses nixpkgs cuda, because so far I've been getting the vibes that only the enterprise cares, and all these eurohpc/CSC/yada yada are completely unapproachable and dead set on their easybuild lmod workflows...	18:33:29
BerriJ	University of Duisburg-Essen, not TUM But it's really not that big of a deal. The economics faculty has its own little IT department, they bought some servers for machine learning of which our Chair was able to get one and we asked them to install nixos on that for us cause we use nixos since 2 years on all of our machines. That's essentially the full story, there is not that much support for NixOS besides me pushing it and my Boss seeing the advantages and sometimes proudly talking about our infra 😅	18:55:53
SomeoneSerge (matrix works sometimes)	Shooting in the dark but anything that could be done or reprioritized on our side to potentially help the lab's story?	19:48:31
BerriJ	Well the biggest point is the cache. Currently we obtain pytorch and other ml packages from pypi cause it has the CUDA binaries packaged directly. I we can't really risk getting cache misses and triggering a 5 hour recompilation on my colleagues machines. And setting up our own binary cache is also not trivial, we are working from home a lot and the machines are only connected to the university vpn on demand. I've read that there is this flox cache now, but I also read that this does not strictly follow nixos-unstable.	20:35:31
BerriJ	By the way the build is still running it's at the `pytestCheckPhase` of flash attention and causes a good 60gb of VRAM usage at the moment. I'll call it a day and report on the status tomorrow morning 🙂	20:37:16
	@busti:leitstelle511.net left the room.	21:16:57
28 May 2026
Gaétan Lepage	CUDA 13.3 is out: https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates/	07:16:08
Gaétan Lepage	* CUDA 13.3 is out: https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates/ PR: https://github.com/NixOS/nixpkgs/pull/525130	07:26:31
BerriJ	Its still running but it shows an error: [1/10/11 built] building python3.13-flash-attention-2.8.3 (buildPhase): [4/73] /nix/store/50fi3x00m5ksrpcmc4pbkh57h9dhl8ls-cuda12.9-cuda_nvcc-12.9.86/bin/nvcc -MD -MF /build/source/build/temp.linux-x86_64-cpython-313/csrc/flash_attn/src/flash_fwd_hdim64_bf16_causal_sm80.o.d -I/build/source/csrc/flash_attn -I/build/source/csrc/flash_attn/src -I/build/source/csrc/cutlass/include -I/nix/store/291svvk6bn3mfw0k57lp8d2plhdb1k46-python3.13-torch-2.11.0/lib/pyth[1/10/1[1/10/11 built] building python3.13-flash-attention-2.8.3 (buildPhase): [14/73] /nix/store/50fi3x00m5ksrpcmc4pbkh57h9dhl8ls-cuda12.9-cuda_nvcc-12.9.86/bin/nvcc -MD -MF /build/source/build/temp.linux-x86_64-cpython-313/csrc/flash_attn/src/flash_fwd_hdim32_bf16_sm80.o.d -I/build/source/csrc/flash_attn -I/build/source/csrc/flash_attn/src -I/build/source/csrc/cutlass/include -I/nix/store/291svvk6bn3mfw0k57lp8d2plhdb1k46-python3.13-torch-2.11.0/lib/pyt[1/10/11 built[1/10/11 built] building python3.13-flash-attention-2.8.3 (pytestCheckPhase): /nix/store/291svvk6bn3mfw0k57lp8d2plhdb1k46-python3.13-torch-2.11.0/lib/python3.13/site-packages/torch/cuda/random.py:126: AcceleratorErr[1/10/11 built] building python3.13-flash-attention-2.8.3 (pytestCheckPhase): /nix/store/291svvk6bn3mfw0k57lp8d2plhdb1k46-python3.13-torch-2.11.0/lib/python3.13/site-packages/torch/cuda/random.py:126: AcceleratorError	07:43:05
BerriJ	* Its still running but it shows an error: `[1/10/11 built] building python3.13-flash-attention-2.8.3 (pytestCheckPhase): /nix/store/291svvk6bn3mfw0k57lp8d2plhdb1k46-python3.13-torch-2.11.0/lib/python3.13/site-packages/torch/cuda/random.py:126: AcceleratorError`	07:43:34
Gaétan Lepage	Thanks for the feedback. The CUDA team is trying hard to make the community cache approved by Nvidia. Also, we're working on ramping up our infra to build, test and cache more derivations, faster and for more platforms. We have some plans to provide "cuda" nixpkgs channels that would guarantee certain packages to be cached. We'll announce things as soon as they happen, but be sure that we take those requests very seriously	09:30:44
prince213	I wonder if people actually do LLM inference with CUDA packages from Nixpkgs	10:32:45
grw00	tried to run this and earlyoom killed my desktop :D	10:35:02
prince213	But anyway I’m working on packaging SGLang and trying to learn CUDA in Nixpkgs (the hard way I suppose)	10:35:29
prince213	https://github.com/NixOS/nixpkgs/pull/525141	10:35:33
Gaétan Lepage	I warned you ;)	11:44:54
Gaétan Lepage	Haha nice! Feel free to ping me for review or if you need some help	11:45:58

Show newer messages

Back to Room ListRoom Version: 9