!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

286 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
24 Sep 2025
@winter:catgirl.cloudWinterdisregard16:06:29
@winter:catgirl.cloudWintercomputers are downright evil16:06:34
@winter:catgirl.cloudWinter(the library it's pointing to isn't actually the one it's loading!)16:06:50
@apyh:matrix.orgapyhis there a server for pytorch stuff specifically, or is this as close as it gets? really struggling to get torch.compile working :/16:38:53
@sporeray:matrix.orgRobbie BuxtonWhat error are you running into apyh?16:42:44
@gammieduncan:matrix.orgDuncan Gammie apyh: you'll probably get the fastest answer to that here if you provide specific error messages here: https://discuss.pytorch.org/c/compile/41 18:30:20
@apyh:matrix.orgapyh
In reply to @sporeray:matrix.org
What error are you running into apyh?
well, torch's .compile functionality requires a bunch of stuff that isn't provided in its nix derivation - needs gcc at runtime, it reads an /etc/passwd file to pick a cache directory, etc - so it doesn't work out of the box thru it's nixpkgs stuff
18:50:26
@apyh:matrix.orgapyhwas just wondering if there was like a torch-nix chat outside here 18:51:40
@sporeray:matrix.orgRobbie BuxtonAh I’ve recently fixed the gcc iisue locally, I was planning to put a pr in upstream this week.18:58:56
@sporeray:matrix.orgRobbie Buxton* Ah I’ve recently fixed the gcc issue locally, I was planning to put a pr in upstream this week.18:59:05
@apyh:matrix.orgapyhyou will, for CUDA, also need to set TRITON_LIBCUDA_PATH - it normally tries to find it with ldconfig20:09:52
@sporeray:matrix.orgRobbie Buxton How are you providing your cuda kernel libraries, are you on NixOS or a different distribution? 20:15:01
@sporeray:matrix.orgRobbie Buxton I.e where are you getting libcuda.so from? 20:15:42
@apyh:matrix.orgapyhI'm in a docker container 😅21:17:32
@apyh:matrix.orgapyhso i just point to /lib64/libcuda.so21:17:44
@sporeray:matrix.orgRobbie Buxton Nix expects it in /run/opengl-driver/lib 21:18:49
@apyh:matrix.orgapyhah yeah I use nix-gl-host21:20:30
@apyh:matrix.orgapyhfor all that21:20:31
@sporeray:matrix.orgRobbie Buxton I’m not sure what the recommended way of doing that is these days but I symlink in all the required libraries to that path 21:20:32
@sporeray:matrix.orgRobbie Buxton I’m confused why triton is struggling to find cuda tho 21:21:01
@apyh:matrix.orgapyhok cool yeah then if you PR the missing gcc I'll see what else is missing in my setup :)21:21:09
@apyh:matrix.orgapyh
In reply to @sporeray:matrix.org
I’m confused why triton is struggling to find cuda tho
triton runs /sbin/ldconfig to find it
21:21:32
@apyh:matrix.orgapyhwhich doesn't exist under nixos / nix built docker images 21:21:47
@sporeray:matrix.orgRobbie Buxton I run triton on non NixOS and haven’t had issues 21:22:07
@sporeray:matrix.orgRobbie BuxtonDo you have a minimum repro or is it literally failing on import21:22:22
@apyh:matrix.orgapyhlike, calling torch.compile() fails 21:22:49
@apyh:matrix.orgapyhsec21:22:50
@sporeray:matrix.orgRobbie Buxton Maybe we should start a private chat to avoid spamming 21:23:17
@ss:someonex.netSomeoneSerge (back on matrix)I presume it's the -bin version of tensorflow, which means it's patchelfed to use whatever versions of things we've got and inshallah it happens to work?22:43:11
@ss:someonex.netSomeoneSerge (back on matrix)Didn't we recently change triton so it doesn't retain the reference to gcc?22:44:50

Show newer messages


Back to Room ListRoom Version: 9