| 31 Oct 2024 |
Gaétan Lepage | * If you happen to just wake of from 6 months spent in coma, don't worry, the triton repo still hasn't got any new tag. | 22:45:22 |
| 1 Nov 2024 |
SomeoneSerge (back on matrix) | I don't think anything short of a github action publishing automatic tags in release branches is going to work with this organization:) | 07:40:32 |
SomeoneSerge (back on matrix) | That might not either but stands a chance (they merged stuff before) | 07:40:56 |
| Guillaume Desforges changed their profile picture. | 21:19:14 |
| 2 Nov 2024 |
sielicki | I do not really follow the conversation on https://github.com/NixOS/nixpkgs/pull/343706 | 23:22:57 |
sielicki | I don't think it actually matters in this case but just want to call it out if it ever comes up again in the context of nixpkgs: to support the cuda stubs (where you can successfully link a binary against the cuda driver api with the cuda toolkit alone, despite that it doesn't include the full UMD blob), on normal distributions nvidia creates an /usr/local/cuda/lib64/stubs/libcuda.so files, which claim a SONAME with .1, but where the file on disk doesn't actually have that. That means the dynamic loader will never accidentally confuse the stub with the real library. | 23:23:59 |
SomeoneSerge (back on matrix) | Huh | 23:25:10 |
sielicki | but now, for some of the libraries that got relinked here, I think now there's two files that have the same name, and im not sure what the dynamic loader does | 23:25:29 |
sielicki | * but now, for some of the libraries that got relinked here, I think now there's two files that have the same name, and im not sure what the dynamic loader does
edit: maybe not under nixos but potentially for people doing nix on ubuntu or whatever | 23:25:52 |
sielicki | here let me find you a link, one sec | 23:26:10 |
sielicki | https://discourse.cmake.org/t/linking-against-cuda-cuda-driver-not-working-right-with-libcuda-stub-wants-libcuda-so-1-not-libcuda-so/7455/13 | 23:26:38 |
sielicki | this is his first and only post on the cmake forums and it's <3'd by rob maynard who works at nvidia | 23:27:24 |
sielicki | i just kinda don't get the motivation, not really clear why the FHS steam shrinking needed this | 23:33:25 |
sielicki | but who knows | 23:33:40 |
sielicki | as long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it? | 23:39:13 |
sielicki | https://github.com/NixOS/nixpkgs/pull/322075 looks like he just never got back to handling review comments :\ | 23:40:51 |
SomeoneSerge (back on matrix) | ❯ readelf -d result-stubs/lib/stubs/libcuda.so
...
0x000000000000000e (SONAME) Library soname: [libcuda.so.1]
...
❯ readelf -d /run/opengl-driver/lib/libcuda.so
...
0x000000000000000e (SONAME) Library soname: [libcuda.so.1]
| 23:40:52 |
SomeoneSerge (back on matrix) | I think the last comment in that discourse might be misleading | 23:41:13 |
sielicki | maybe I did a poor job of summarizing it here in riot but that's what I expected to see | 23:42:13 |
SomeoneSerge (back on matrix) | Nvidia ships stubs without the .1s which is why we have https://github.com/NixOS/nixpkgs/blob/a8ffc2295c358629bc1bda569bf8b3bbb21aa1be/pkgs/development/cuda-modules/cuda/overrides.nix#L124-L129 | 23:42:54 |
sielicki | The problem I'm wondering about is what actually enforces that ld.so prefers /run/opengl-driver/lib to /usr/local/nvidia/lib64/stubs? or potentially someone's conda env or virtualenv | 23:42:59 |
sielicki | with RPATH'ing all the things, it's probably fine | 23:43:31 |
SomeoneSerge (back on matrix) | In reply to @sielicki:matrix.org The problem I'm wondering about is what actually enforces that ld.so prefers /run/opengl-driver/lib to /usr/local/nvidia/lib64/stubs? or potentially someone's conda env or virtualenv That executables from Nixpkgs use their own ld.so which ignores /usr stuff | 23:43:42 |
SomeoneSerge (back on matrix) | In reply to @sielicki:matrix.org as long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it? Yes it's just that it's toil | 23:44:40 |
SomeoneSerge (back on matrix) | In reply to @sielicki:matrix.org as long as i'm stuck on my computer on a saturday, qq: is there a strong reason to hold cudart at 12.4 in master, or just nobody raised a PR for it? * Yes it's just that it's toil (and review roundtrip times aren't helping) | 23:45:28 |
sielicki | let me know if I can pick up any slack or what you guys need | 23:46:03 |
SomeoneSerge (back on matrix) | There's lots and the linked PR is one candidate 😍 | 23:46:50 |
sielicki | I just raised an issue earlier today about some of the driver hashes missing for some of the releases, it feels to me like we really need a solid cuda json scraper to prefetch thing | 23:47:14 |
sielicki | * I just raised an issue earlier today about some of the driver hashes missing for some of the releases, it feels to me like we really need a solid cuda json scraper to prefetch script | 23:47:33 |
SomeoneSerge (back on matrix) | In reply to @ss:someonex.net Nvidia ships stubs without the .1s which is why we have https://github.com/NixOS/nixpkgs/blob/a8ffc2295c358629bc1bda569bf8b3bbb21aa1be/pkgs/development/cuda-modules/cuda/overrides.nix#L124-L129 I'm wondering what can we do to remove this hack | 23:53:53 |