!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

291 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
31 Oct 2024
@glepage:matrix.orgGaétan Lepage * If you happen to just wake of from 6 months spent in coma, don't worry, the triton repo still hasn't got any new tag. 22:45:22
1 Nov 2024
@ss:someonex.netSomeoneSerge (back on matrix)I don't think anything short of a github action publishing automatic tags in release branches is going to work with this organization:)07:40:32
@ss:someonex.netSomeoneSerge (back on matrix)That might not either but stands a chance (they merged stuff before)07:40:56
@gdesforges:matrix.orgGuillaume Desforges changed their profile picture.21:19:14
2 Nov 2024
@sielicki:matrix.orgsielickiI do not really follow the conversation on https://github.com/NixOS/nixpkgs/pull/34370623:22:57
@sielicki:matrix.orgsielicki I don't think it actually matters in this case but just want to call it out if it ever comes up again in the context of nixpkgs: to support the cuda stubs (where you can successfully link a binary against the cuda driver api with the cuda toolkit alone, despite that it doesn't include the full UMD blob), on normal distributions nvidia creates an /usr/local/cuda/lib64/stubs/libcuda.so files, which claim a SONAME with .1, but where the file on disk doesn't actually have that. That means the dynamic loader will never accidentally confuse the stub with the real library. 23:23:59
@ss:someonex.netSomeoneSerge (back on matrix)Huh23:25:10
@sielicki:matrix.orgsielickibut now, for some of the libraries that got relinked here, I think now there's two files that have the same name, and im not sure what the dynamic loader does23:25:29
@sielicki:matrix.orgsielicki * but now, for some of the libraries that got relinked here, I think now there's two files that have the same name, and im not sure what the dynamic loader does edit: maybe not under nixos but potentially for people doing nix on ubuntu or whatever23:25:52
@sielicki:matrix.orgsielickihere let me find you a link, one sec23:26:10
@sielicki:matrix.orgsielickihttps://discourse.cmake.org/t/linking-against-cuda-cuda-driver-not-working-right-with-libcuda-stub-wants-libcuda-so-1-not-libcuda-so/7455/1323:26:38
@sielicki:matrix.orgsielickithis is his first and only post on the cmake forums and it's <3'd by rob maynard who works at nvidia23:27:24
@sielicki:matrix.orgsielickii just kinda don't get the motivation, not really clear why the FHS steam shrinking needed this23:33:25
@sielicki:matrix.orgsielickibut who knows23:33:40
@sielicki:matrix.orgsielickias long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it? 23:39:13
@sielicki:matrix.orgsielicki https://github.com/NixOS/nixpkgs/pull/322075 looks like he just never got back to handling review comments :\ 23:40:51
@ss:someonex.netSomeoneSerge (back on matrix)
❯ readelf -d result-stubs/lib/stubs/libcuda.so
...
 0x000000000000000e (SONAME)             Library soname: [libcuda.so.1]
...
❯ readelf -d /run/opengl-driver/lib/libcuda.so
...
 0x000000000000000e (SONAME)             Library soname: [libcuda.so.1]
23:40:52
@ss:someonex.netSomeoneSerge (back on matrix)I think the last comment in that discourse might be misleading23:41:13
@sielicki:matrix.orgsielickimaybe I did a poor job of summarizing it here in riot but that's what I expected to see23:42:13
@ss:someonex.netSomeoneSerge (back on matrix) Nvidia ships stubs without the .1s which is why we have https://github.com/NixOS/nixpkgs/blob/a8ffc2295c358629bc1bda569bf8b3bbb21aa1be/pkgs/development/cuda-modules/cuda/overrides.nix#L124-L129 23:42:54
@sielicki:matrix.orgsielickiThe problem I'm wondering about is what actually enforces that ld.so prefers /run/opengl-driver/lib to /usr/local/nvidia/lib64/stubs? or potentially someone's conda env or virtualenv23:42:59
@sielicki:matrix.orgsielickiwith RPATH'ing all the things, it's probably fine23:43:31
@ss:someonex.netSomeoneSerge (back on matrix)
In reply to @sielicki:matrix.org
The problem I'm wondering about is what actually enforces that ld.so prefers /run/opengl-driver/lib to /usr/local/nvidia/lib64/stubs? or potentially someone's conda env or virtualenv
That executables from Nixpkgs use their own ld.so which ignores /usr stuff
23:43:42
@ss:someonex.netSomeoneSerge (back on matrix)
In reply to @sielicki:matrix.org
as long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it?
Yes it's just that it's toil
23:44:40
@ss:someonex.netSomeoneSerge (back on matrix)
In reply to @sielicki:matrix.org
as long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it?
* Yes it's just that it's toil (and review roundtrip times aren't helping)
23:45:28
@sielicki:matrix.orgsielickilet me know if I can pick up any slack or what you guys need23:46:03
@ss:someonex.netSomeoneSerge (back on matrix)There's lots and the linked PR is one candidate 😍23:46:50
@sielicki:matrix.orgsielickiI just raised an issue earlier today about some of the driver hashes missing for some of the releases, it feels to me like we really need a solid cuda json scraper to prefetch thing 23:47:14
@sielicki:matrix.orgsielicki * I just raised an issue earlier today about some of the driver hashes missing for some of the releases, it feels to me like we really need a solid cuda json scraper to prefetch script 23:47:33
@ss:someonex.netSomeoneSerge (back on matrix)
In reply to @ss:someonex.net
Nvidia ships stubs without the .1s which is why we have https://github.com/NixOS/nixpkgs/blob/a8ffc2295c358629bc1bda569bf8b3bbb21aa1be/pkgs/development/cuda-modules/cuda/overrides.nix#L124-L129
I'm wondering what can we do to remove this hack
23:53:53

Show newer messages


Back to Room ListRoom Version: 9