!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

291 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda58 Servers

Load older messages


SenderMessageTime
18 Mar 2025
@stick:matrix.orgstick * if you are feeling adventurous, you might as well update pkgs/development/cuda-modules/tensorrt/releases.nix to contain the latest 10.9.x.y release (both for cuda 11.x and 12.x) 20:12:16
@koutensky:matrix.nesad.fit.vutbr.czMichal Koutenskýboth report '10.3.0' for me20:13:44
@koutensky:matrix.nesad.fit.vutbr.czMichal Koutenskýsure, i can do that tomorrow20:14:26
@stick:matrix.orgstickin that case it makes sense to update version on line 16, not the unpack phase20:15:01
@stick:matrix.orgstickso the nix package has the same version as reported by wheel20:15:18
@koutensky:matrix.nesad.fit.vutbr.czMichal Koutenskýyeah that makes sense20:15:26
@stick:matrix.orgstick

SomeoneSerge (UTC+U[-12,12]): I finally managed to build onnxruntime with CUDA 12.8 - see https://github.com/NixOS/nixpkgs/pull/390885

can you run the CUDA version bump test suite and let me know if we can merge the PR?

22:20:53
@stick:matrix.orgstick *

SomeoneSerge (UTC+U[-12,12]): I finally managed to build onnxruntime (afaik the only blocker) with CUDA 12.8 - see https://github.com/NixOS/nixpkgs/pull/390885

can you run the CUDA version bump test suite and let me know if we can merge the PR?

22:21:20
@stick:matrix.orgstickI am building magma/torch/vllm as we speak - but IIRC this already went OK when i tried it few weeks ago22:21:59
@stick:matrix.orgstickthe onnxruntime fix was to turn off LTO - it went bonkers (into infinite loop) when trying to do the final link - I looked into gentoo ebuilds and they also turn off LTO when linking onnxruntime with cuda22:24:01
@ss:someonex.netSomeoneSerge (back on matrix) stick: just a heads up: I've been using the university workstation for nixpkgs-review so far, but I'm no longer employed by the uni and am migrating between infrastructures; going to take time 🤷 22:24:51
@stick:matrix.orgstickso the version bump test is only running nixpkgs-review on the PR?22:25:57
@ss:someonex.netSomeoneSerge (back on matrix)There's, nixpgks-review with cudaSupport=true, there's passthru gpuChecks, there's samuela's and Connor's out-of-tree test-suites22:27:50
@ss:someonex.netSomeoneSerge (back on matrix)Nixpkgs-review is a bit of cargo culting but it gives an idea of the size of the fallout22:29:14
@stick:matrix.orgstickyeah, i use it often locally - but there are many failing packages also on master unfortunately22:30:00
@stick:matrix.orgstick* yeah, i use it often locally - but there are many not-so-important failing packages also on master unfortunately22:30:20
@ss:someonex.netSomeoneSerge (back on matrix)Yes, it's definitely too much compute hours spent on just getting a statistic (# failures)22:38:34
21 Mar 2025
@domenkozar:matrix.orgDomen Kožar changed their profile picture.11:40:19
23 Mar 2025
@connorbaker:matrix.orgconnor (he/him) changed their display name from connor (he/him) (UTC-8) to connor (he/him) (UTC-7).18:28:18
24 Mar 2025
@connorbaker:matrix.orgconnor (he/him)How did I not know about nvCOMP?! https://docs.nvidia.com/cuda/nvcomp/index.html14:37:00
@ss:someonex.netSomeoneSerge (back on matrix)Did you ever need it?14:37:53
@msanft:matrix.orgMoritz SanftAre you sure that removing the OpenSSL1.1 compatibility libraries from the NVIDIA driver package is correct? see: https://github.com/NixOS/nixpkgs/blob/a13cc0bc42a89a025f545c4a17f25e6300710f39/pkgs/os-specific/linux/nvidia-x11/builder.sh#L81-L82 I ran into PyTorch 2.6 trying to open that library today, hence the question.15:41:42
@msanft:matrix.orgMoritz Sanft* Are we sure that removing the OpenSSL1.1 compatibility libraries from the NVIDIA driver package is correct? see: https://github.com/NixOS/nixpkgs/blob/a13cc0bc42a89a025f545c4a17f25e6300710f39/pkgs/os-specific/linux/nvidia-x11/builder.sh#L81-L82 I ran into PyTorch 2.6 trying to open that library today, hence the question.15:41:49
@ss:someonex.netSomeoneSerge (back on matrix)Hm. The user would have to manually enable patchelfing the paths to openssl 1.1 anyway, I'd say we shouldn't include them y default because then everything goes behind permittedInsecurePackages or what's the name15:43:20
@ss:someonex.netSomeoneSerge (back on matrix)

I ran into PyTorch 2.6 trying to open that library today, hence the question.

Can you also file a report with the logs on github?

15:43:58
@msanft:matrix.orgMoritz SanftSure!15:47:58
@glepage:matrix.orgGaétan Lepage

Hi,
I'm looking at mistral-rs and trying to make autoAddDriverRunPath and autoPatchelfHook work, but I get this at runtime:

Unable to dynamically load the "cuda" shared library - searched for library names: ["cuda", "nvcuda"]. Ensure that `LD_LIBRARY_PATH` has the correct path to the installed library. If the shared library is present on the system under a different name than one of those listed above, please open a GitHub issue.
16:30:22
@ss:someonex.netSomeoneSerge (back on matrix)They probably CDLL on the python side?17:26:26
@glepage:matrix.orgGaétan LepageIt's rust17:43:35
@glepage:matrix.orgGaétan LepageBut yeah, they surely DLopen17:43:40

Show newer messages


Back to Room ListRoom Version: 9