| 29 Nov 2022 |
Samuel Ainsworth | yup, you're totally right: https://github.com/NixOS/nixpkgs/pull/200435 | 23:51:40 |
Samuel Ainsworth | i'm really confused why we're seeing these errors... they seem to indicate that the directory structure changed between 11.7 -> 11.8 | 23:52:23 |
| 1 Dec 2022 |
@box1:matrix.org | I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.
After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=1 or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because
there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.
Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?
| 11:44:06 |
@box1:matrix.org | * I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.
After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=1 or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because
there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.
Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?
| 11:44:26 |
@box1:matrix.org | * I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.
After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=ON or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because
there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.
Currently, [torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix) doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?
| 11:44:48 |
@box1:matrix.org | * I'm trying to package dgl-cu116(dgl with cuda support) and it fails to find rpath for libtorch_cuda_cpp.so and libtorch_cuda_cu.so.
After some searching, those files are generated under torch when it is built with BUILD_SPLIT_CUDA=ON or BUILD_SPLIT_CUDA=1. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that BUILD_SPLIT_CUDA is not default because
there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues.
Currently, torch doesn't have an option to it. Maybe an option like mklDnnSupport so that it can be turned on for packages like dgl-cuda116 that needs those files would be great. Any thought on this?
| 11:45:26 |
| hexa changed their display name from hexa to hexa (22.11 now). | 13:09:03 |
| hexa changed their display name from hexa (22.11 now) to hexa. | 14:38:53 |
danielrf | Hi, I have some recent work that might be of interest to the Nix CUDA community: jetpack-nixos (https://github.com/anduril/jetpack-nixos)
See also this announcement post on the discourse: https://discourse.nixos.org/t/jetpack-nixos-nixos-module-for-nvidia-jetson-devices/23632 | 19:50:11 |
danielrf |
The CUDA version included with jetpack is apparently not the same as just the aarch64 CUDA for servers, but I've tried to repackage the debs from NVIDIA in a way similar to cudaPackages in nixpkgs: https://github.com/anduril/jetpack-nixos/blob/master/cuda-packages.nix | 19:50:23 |
| 2 Dec 2022 |
Samuel Ainsworth | yeah IIUC the best way would be to add an option to the pytorch derivation to enable BUILD_SPLIT_CUDA. Shouldn't be too hard to accomplish 🤞 | 01:29:56 |
| 4 Dec 2022 |
| CRTified (old handle) changed their display name from CRTified to CRTified (old handle). | 14:19:44 |
| 6 Dec 2022 |
| CRTified (old handle) changed their profile picture. | 14:11:02 |
| connor (he/him) left the room. | 15:17:44 |
| 12 Dec 2022 |
SomeoneSerge (back on matrix) | Just noticed that the GH action that triggers the Hercules rebuilds has been broken for 20 days :-( Updated it, but this only shows that it's something to be watched | 14:06:09 |
| 15 Dec 2022 |
hexa | wondering if anyone here is motivated to get torchaudio source builds into nixpkgs | 00:56:52 |
hexa | I've gotten pretty far, but I currently fail at linking libsox | 00:57:17 |
hexa | https://github.com/NixOS/nixpkgs/pull/203136 | 00:57:18 |
| 21 Dec 2022 |
| jdaly joined the room. | 17:33:43 |
jdaly | Our project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, search the NixOS repo, we find 'pkgs.cudaPackages_11_6.cudatoolkit' but various other versions of cuDNN (i.e. 8.3.2) -- but no 8.2.4 package.
A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum.
We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet!
Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx! | 17:45:23 |
jdaly | * Our project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, searching the NixOS repo, we find the 'pkgs.cudaPackages_11_6.cudatoolkit' and various other versions of cuDNN (e.g.. 8.3.2) -- but no 8.2.4 package.
A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum.
We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet!
Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx! | 17:53:59 |
tpw_rules | can you build stuff using a checkout of nixpkgs? | 23:25:22 |
| 22 Dec 2022 |
| Shiroki Satsuki joined the room. | 17:59:25 |
Shiroki Satsuki | Hello, I'm using shell.nix with Arch Linux, and I'm trying to use python310Packages.pycuda in my Python project, but when I import pycuda.driver, Python complained about missing of libcuda.so.1 | 18:04:05 |
Shiroki Satsuki | Is this package only available in NixOS? Or do I need some setup to link my host libcuda.so.1 into shell.nix? | 18:05:57 |
Shiroki Satsuki |
| 18:06:21 |
Shiroki Satsuki | * > ldd /nix/store/x8gqndanr39r0zf2465xad9z0pkg0y39-python3-3.10.9-env/lib/python3.10/site-packages/pycuda/_driver.cpython-310-x86_64-linux-gnu.so
linux-vdso.so.1 (0x00007ffff3797000)
libboost_python310.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_python310.so.1.79.0 (0x00007fe813a75000)
libboost_thread.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_thread.so.1.79.0 (0x00007fe813a58000)
libcuda.so.1 => not found
libcurand.so.10 => /nix/store/rp33ncna7sm3inj3fjlrracg27sl78x3-cudatoolkit-11.7.0/lib/libcurand.so.10 (0x00007fe80de00000)
libstdc++.so.6 => /nix/store/wprxx5zkkk13hpj6k1v6qadjylh3vq9m-gcc-11.3.0-lib/lib/libstdc++.so.6 (0x00007fe80da00000)
libm.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libm.so.6 (0x00007fe80dd20000)
libgcc_s.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libgcc_s.so.1 (0x00007fe813a3c000)
libc.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libc.so.6 (0x00007fe80d600000)
librt.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/librt.so.1 (0x00007fe813a37000)
libdl.so.2 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libdl.so.2 (0x00007fe813a30000)
libpthread.so.0 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libpthread.so.0 (0x00007fe813a2b000)
libutil.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libutil.so.1 (0x00007fe813a26000)
/nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib64/ld-linux-x86-64.so.2 (0x00007fe813cad000)
| 18:06:29 |
tpw_rules | if you are not using nixos you need nixGL: https://github.com/guibou/nixGL for cuda to work | 18:11:11 |
tpw_rules | (and other graphics stuff) | 18:11:24 |
Shiroki Satsuki | Thanks, I only know that nixGL is needed for OpenGL and Vulkan | 18:13:42 |