| 26 Jan 2025 |
hexa (UTC+1) | wandb build failure on python-updates https://hydra.lossy.network/build/2649332 | 22:26:22 |
hexa (UTC+1) | not sure anyone here cares about that, but samuela used to | 22:26:52 |
| 27 Jan 2025 |
Kevin Mittman (UTC-8) | In reply to @connorbaker:matrix.org Kevin Mittman: I don’t know if you’re able to raise this internally, but https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#deprecated-toolchains seems like it should have a list of toolchains or something instead of being empty. Are you able to, or would you mind, seeing if it’s supposed to be empty? Fixed (for some definition of that) | 06:08:56 |
connor (burnt/out) (UTC-8) | I'm treating that doc as meeting notes -- I started drafting an update here: https://pad.lassul.us/ptoW_p-KSPWlD7Ep2Bx66A
Did we ever have an announcement anywhere about CUDA packages being available through the nix-community cache? | 23:10:44 |
| 29 Jan 2025 |
| matthewcroughan changed their display name from matthewcroughan to matthewcroughan (already in Brussels). | 13:35:17 |
| 30 Jan 2025 |
Gaétan Lepage | Which package provides libcusparseLt.so.0 ? | 14:45:19 |
Gaétan Lepage | I tried lib.getLib cudaPackages.libcusparse but it doesn't have it. | 14:46:25 |
Gaétan Lepage | Looks like we need to wait for https://github.com/NixOS/nixpkgs/pull/306172 | 14:50:07 |
ruro | Can somebody review/merge this PR https://github.com/NixOS/nixpkgs/pull/371640 please? | 16:22:09 |
ruro | Also, I occasionally have some free time and I would like to spend it on improving the state of CUDA in nixpkgs/NixOS. Do you have any suggestions for which issues I should start with? Alternatively, I could just start going down the list of eval/build failures on the nix-community CUDA builder... | 16:24:29 |
ruro | * I occasionally have some free time and I would like to spend it on improving the state of CUDA in nixpkgs/NixOS. Do you have any suggestions for which issues I should start with? Alternatively, I could just start going down the list of eval/build failures on the nix-community CUDA builder... | 16:24:46 |
ruro | * I occasionally have some free time and I would like to spend it on improving the state of CUDA in nixpkgs/NixOS. Do you have any suggestions for which issues I should start with?
I am not a complete newbie, but I haven't contributed to nixpkgs all that much (above PR is my third). I guess, I could just start going down the list of eval/build failures on the nix-community CUDA builder... but that might end up just being treadmill work. | 16:28:22 |
ruro | * I occasionally have some free time and I would like to spend it on improving the state of CUDA in nixpkgs/NixOS. Do you have any suggestions for which issues I should start with?
I am not a complete newbie, but I haven't contributed to nixpkgs all that much (the above-mentioned PR is my third). I guess, I could just start going down the list of eval/build failures on the nix-community CUDA builder... but that might end up just being treadmill work. | 16:47:05 |
connor (burnt/out) (UTC-8) | In reply to @glepage:matrix.org Looks like we need to wait for https://github.com/NixOS/nixpkgs/pull/306172 I meant to close that since I’ll make a new PR for the CUDA-packages work | 16:47:34 |
Gaétan Lepage | Ok Connor. Do we have a short-term alternative to get this library ? | 16:48:28 |
connor (burnt/out) (UTC-8) | Depends on what you mean by short term :( | 16:53:34 |
connor (burnt/out) (UTC-8) | I should have everything landed by 25.05 but I suppose we’ll need it prior to that | 16:54:03 |
connor (burnt/out) (UTC-8) | I guess I can start trying to land things, but it’ll cause some breakages and I don’t have docs written yet | 16:54:43 |
Gaétan Lepage | I'm asking for pytorch (https://github.com/NixOS/nixpkgs/pull/377785). There is no emergency and we can surely wait before updating it. | 16:56:39 |
connor (burnt/out) (UTC-8) | Ugh didn’t they also remove support for CUDA 12.1 | 16:58:19 |
connor (burnt/out) (UTC-8) | Also I think they support newer architectures now (maybe Blackwell?) | 16:58:36 |
Gaétan Lepage | In reply to @connorbaker:matrix.org Ugh didn’t they also remove support for CUDA 12.1 From the CI at least: https://github.com/pytorch/pytorch/pull/141271, https://github.com/pytorch/pytorch/pull/142177 | 16:59:51 |
connor (burnt/out) (UTC-8) | My bad I mixed up the CI removal and https://github.com/NVIDIA/TensorRT-Model-Optimizer/releases/tag/0.23.0 removing support for CUDA 11 | 17:02:04 |
| 31 Jan 2025 |
Kevin Mittman (UTC-8) | In reply to @glepage:matrix.org Which package provides libcusparseLt.so.0 ? It's separate https://developer.download.nvidia.com/compute/cusparselt/redist/ | 02:13:02 |
connor (burnt/out) (UTC-8) | I am so tired
But now have setup hooks which can catch common issues like the order of different CUDA directories in a run path
Or fail a build if NVCC’s host compiler leaks out (which can/will cause glibc/glibcxx symbol issues)
Even beyond that
I implemented utility functions for arrays and associative arrays in bash because I got tired of repeating myself in different hooks
And then when I got tired of repeating myself in tests for those functions and hooks, I made a utility derivation to make testing for expected arrays and associative arrays easier | 06:55:57 |
connor (burnt/out) (UTC-8) | It’s still a mess but it’s on this branch if anyone is curious https://github.com/ConnorBaker/cuda-packages/compare/main...fix/runpath-order-matters-and-cuda-compat-gets-clobbered | 06:56:57 |
SomeoneSerge (back on matrix) | Let's schedule a call to discuss how to go forward with stdenv support, setup-hooks, wrappers, config.cudaSupport, localSystem/pkgsCross, clang support, and out-of-tree override-ability of manifests and toolkit components https://crab.fit/cudapackages-ng-781527
CC connor (he/him) (UTC-7), sielicki, Samuel Ainsworth, and anyone interested
| 10:49:09 |
| Jonas Chevalier changed their display name from Jonas Chevalier to Jonas Chevalier (FOSDEM). | 19:11:42 |
| SomeoneSerge (back on matrix) changed their display name from SomeoneSerge to SomeoneSerge (Bruxelles). | 19:35:00 |
| 1 Feb 2025 |
| matthewcroughan changed their display name from matthewcroughan (already in Brussels) to matthewcroughan (FOSDEM). | 09:41:01 |