!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

290 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
29 Nov 2022
@skainswo:matrix.orgSamuel Ainsworthyup, you're totally right: https://github.com/NixOS/nixpkgs/pull/20043523:51:40
@skainswo:matrix.orgSamuel Ainsworthi'm really confused why we're seeing these errors... they seem to indicate that the directory structure changed between 11.7 -> 11.823:52:23
1 Dec 2022
@box1:matrix.org@box1:matrix.org

I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.

After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=1 or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because

there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.

Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?

11:44:06
@box1:matrix.org@box1:matrix.org *

I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.

After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=1 or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because

there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.

Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?

11:44:26
@box1:matrix.org@box1:matrix.org *

I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.

After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=ON or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because

there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.

Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?

11:44:48
@box1:matrix.org@box1:matrix.org *

I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.

After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=ON or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because

there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.

Currently, torch doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?

11:45:26
@hexa:lossy.networkhexa changed their display name from hexa to hexa (22.11 now).13:09:03
@hexa:lossy.networkhexa changed their display name from hexa (22.11 now) to hexa.14:38:53
@danielrf:matrix.orgdanielrfHi, I have some recent work that might be of interest to the Nix CUDA community: jetpack-nixos (https://github.com/anduril/jetpack-nixos) See also this announcement post on the discourse: https://discourse.nixos.org/t/jetpack-nixos-nixos-module-for-nvidia-jetson-devices/2363219:50:11
@danielrf:matrix.orgdanielrf The CUDA version included with jetpack is apparently not the same as just the aarch64 CUDA for servers, but I've tried to repackage the debs from NVIDIA in a way similar to cudaPackages in nixpkgs: https://github.com/anduril/jetpack-nixos/blob/master/cuda-packages.nix19:50:23
2 Dec 2022
@skainswo:matrix.orgSamuel Ainsworth yeah IIUC the best way would be to add an option to the pytorch derivation to enable BUILD_SPLIT_CUDA. Shouldn't be too hard to accomplish 🤞 01:29:56
4 Dec 2022
@schnecfk:ruhr-uni-bochum.deCRTified (old handle) changed their display name from CRTified to CRTified (old handle).14:19:44
6 Dec 2022
@schnecfk:ruhr-uni-bochum.deCRTified (old handle) changed their profile picture.14:11:02
@connorbaker:matrix.orgconnor (he/him) left the room.15:17:44
12 Dec 2022
@ss:someonex.netSomeoneSerge (back on matrix)Just noticed that the GH action that triggers the Hercules rebuilds has been broken for 20 days :-( Updated it, but this only shows that it's something to be watched14:06:09
15 Dec 2022
@hexa:lossy.networkhexawondering if anyone here is motivated to get torchaudio source builds into nixpkgs00:56:52
@hexa:lossy.networkhexaI've gotten pretty far, but I currently fail at linking libsox 00:57:17
@hexa:lossy.networkhexahttps://github.com/NixOS/nixpkgs/pull/20313600:57:18
21 Dec 2022
@jdaly:matrix.orgjdaly joined the room.17:33:43
@jdaly:matrix.orgjdalyOur project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, search the NixOS repo, we find 'pkgs.cudaPackages_11_6.cudatoolkit' but various other versions of cuDNN (i.e. 8.3.2) -- but no 8.2.4 package. A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum. We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet! Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx!17:45:23
@jdaly:matrix.orgjdaly * Our project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, searching the NixOS repo, we find the 'pkgs.cudaPackages_11_6.cudatoolkit' and various other versions of cuDNN (e.g.. 8.3.2) -- but no 8.2.4 package. A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum. We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet! Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx! 17:53:59
@tpw_rules:matrix.orgtpw_rulescan you build stuff using a checkout of nixpkgs?23:25:22
22 Dec 2022
@shiroki:matrix.orgShiroki Satsuki joined the room.17:59:25
@shiroki:matrix.orgShiroki Satsuki Hello, I'm using shell.nix with Arch Linux, and I'm trying to use python310Packages.pycuda in my Python project, but when I import pycuda.driver, Python complained about missing of libcuda.so.1 18:04:05
@shiroki:matrix.orgShiroki Satsuki Is this package only available in NixOS? Or do I need some setup to link my host libcuda.so.1 into shell.nix? 18:05:57
@shiroki:matrix.orgShiroki Satsuki
18:06:21
@shiroki:matrix.orgShiroki Satsuki *
> ldd /nix/store/x8gqndanr39r0zf2465xad9z0pkg0y39-python3-3.10.9-env/lib/python3.10/site-packages/pycuda/_driver.cpython-310-x86_64-linux-gnu.so
        linux-vdso.so.1 (0x00007ffff3797000)
        libboost_python310.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_python310.so.1.79.0 (0x00007fe813a75000)
        libboost_thread.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_thread.so.1.79.0 (0x00007fe813a58000)
        libcuda.so.1 => not found
        libcurand.so.10 => /nix/store/rp33ncna7sm3inj3fjlrracg27sl78x3-cudatoolkit-11.7.0/lib/libcurand.so.10 (0x00007fe80de00000)
        libstdc++.so.6 => /nix/store/wprxx5zkkk13hpj6k1v6qadjylh3vq9m-gcc-11.3.0-lib/lib/libstdc++.so.6 (0x00007fe80da00000)
        libm.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libm.so.6 (0x00007fe80dd20000)
        libgcc_s.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libgcc_s.so.1 (0x00007fe813a3c000)
        libc.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libc.so.6 (0x00007fe80d600000)
        librt.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/librt.so.1 (0x00007fe813a37000)
        libdl.so.2 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libdl.so.2 (0x00007fe813a30000)
        libpthread.so.0 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libpthread.so.0 (0x00007fe813a2b000)
        libutil.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libutil.so.1 (0x00007fe813a26000)
        /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib64/ld-linux-x86-64.so.2 (0x00007fe813cad000)
18:06:29
@tpw_rules:matrix.orgtpw_rulesif you are not using nixos you need nixGL: https://github.com/guibou/nixGL for cuda to work18:11:11
@tpw_rules:matrix.orgtpw_rules(and other graphics stuff)18:11:24
@shiroki:matrix.orgShiroki Satsuki Thanks, I only know that nixGL is needed for OpenGL and Vulkan 18:13:42

Show newer messages


Back to Room ListRoom Version: 9