!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

290 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
1 Aug 2024
@ss:someonex.netSomeoneSerge (back on matrix)So there's something else propagating an unwrapped (differently wrapped) gcc12 maybe15:23:40
@yorickvp:matrix.orgyorickvphow can I list all propagated inputs?15:24:08
@ss:someonex.netSomeoneSerge (back on matrix)all propagated inputs of15:24:30
@yorickvp:matrix.orgyorickvp I'm in a nix develop for the drv that produces the libraries with the wrong rpath 15:25:24
@ss:someonex.netSomeoneSerge (back on matrix) H'mm, maybe you can echo "${pkgsBuildHost[@]}" for compilers/build tools 15:26:51
@ss:someonex.netSomeoneSerge (back on matrix)But that won't tell you where it's coming from15:27:09
@ss:someonex.netSomeoneSerge (back on matrix)Just do a nix-tree --derivation or path-info why-depends15:27:31
@yorickvp:matrix.orgyorickvpseems like there's no unwrapped gcc15:48:59
@yorickvp:matrix.orgyorickvplibtorch_cuda.so also manages to link it15:49:53
@yorickvp:matrix.orgyorickvphttps://gist.github.com/yorickvP/b263b9d6d058280a3f7d4c70eff2a758 /nix/store/mbg29pcjydgss24z0v6jczjda7q4z9x6-gcc-12.3.0.drv (the offending gcc lib) only occurs as a dependency of the gcc-wrapper that has the correct lib first15:54:09
@yorickvp:matrix.orgyorickvpI'll try to repro with torch on nixos-unstable15:57:23
@yorickvp:matrix.orgyorickvpyeah, ${python3.pkgs.torchWithCuda.lib}/lib/libtorch_cuda.so links to gcc-12.4.0-lib16:16:14
@ss:someonex.netSomeoneSerge (back on matrix)Wow16:40:20
@ss:someonex.netSomeoneSerge (back on matrix)This looks like a regression16:40:27
@ss:someonex.netSomeoneSerge (back on matrix) Well the first obvious leak (the one we see in the wrapper) is https://github.com/NixOS/nixpkgs/blob/fc27807b85986bb26a8f28e590e01fae742e6b53/pkgs/build-support/cc-wrapper/default.nix#L596-L606 16:53:54
@ss:someonex.netSomeoneSerge (back on matrix) Notably, cudaPackages.saxpy works fine at that commit 16:54:12
@ss:someonex.netSomeoneSerge (back on matrix) I'm running github:NixOS/nixpkgs/c66e984bda09e7230ea7b364e677c5ba4f0d36d0#opencv4.tests.no-libstdcxx-errors now (only defined for cudaSupport = true) 16:54:41
@ss:someonex.netSomeoneSerge (back on matrix)Going to take a while16:54:45
@ss:someonex.netSomeoneSerge (back on matrix)But it might be the regression is somehow magically torch specific16:54:59
@ss:someonex.netSomeoneSerge (back on matrix) No idea why https://github.com/NixOS/nixpkgs/blame/fc27807b85986bb26a8f28e590e01fae742e6b53/pkgs/build-support/cc-wrapper/default.nix#L605-L606 uses cc_solib honestly 16:55:53
@yorickvp:matrix.orgyorickvpyou know, I blame cmake17:00:59
@yorickvp:matrix.orgyorickvp * you know, I blame cmake :)17:01:03
@yorickvp:matrix.orgyorickvplooking at 36 megabytes of cmake logs, it obviously parses it out of some gcc output (together with the correct one, which it puts first in the path). I'm not sure what it does with it after17:02:50
@ss:someonex.netSomeoneSerge (back on matrix)Waiting for opencv, but so far I'm leaning towards "maybe pytorch devs replaced some of the cmake logic with an unnecessary gcc -print-search-paths"17:06:46
@yorickvp:matrix.orgyorickvpI'm looking at https://github.com/Kitware/CMake/blob/master/Modules/CMakeParseImplicitLinkInfo.cmake17:08:01
@ss:someonex.netSomeoneSerge (back on matrix)saxpy and opencv are built using cmake too17:08:34
@ss:someonex.netSomeoneSerge (back on matrix)At least one of them has been shown to still work (whatever the cost)17:08:56
@ss:someonex.netSomeoneSerge (back on matrix)
gy skimage.transform skimage.util skimage.segmentation
python3-3.11.9-env> building '/nix/store/4rqjcjk4h2mnfwsbvcgf3igjnmpxhxwf-python3-3.11.9-env.drv'
python3-3.11.9-env> created 521 symlinks in user environment
opencv-4.9.0-libstdcxx-test> building '/nix/store/2gh11xabzlxbfgvydhcln0qbfiharw32-opencv-4.9.0-libstdcxx-test.drv'
┏━ Dependency Graph:
┃             ┌─ ✔ opencv-4.9.0 ⏱ 17m40s
┃          ┌─ ✔ python3.11-pillow-heif-0.16.0 ⏱ 2m0s
┃       ┌─ ✔ python3.11-imageio-2.34.2 ⏱ 11s
┃    ┌─ ✔ python3.11-scikit-image-0.22.0 ⏱ 1m37s
┃ ┌─ ✔ python3-3.11.9-env ⏱ 1s
┃ ✔ opencv-4.9.0-libstdcxx-test 
┣━━━ Builds         
┗━ ∑ ⏵ 0 │ ✔ 6 │ ⏸ 0 │ Finished at 17:11:37 after 21m35s
17:12:13
@ss:someonex.netSomeoneSerge (back on matrix)So ugh at least opencv4's python extension must be linking the right libstdc++17:13:11
@ss:someonex.netSomeoneSerge (back on matrix) Hmm the last torch update was almost two months ago https://github.com/NixOS/nixpkgs/pull/317576 17:14:41

Show newer messages


Back to Room ListRoom Version: 9