NixOS CUDA - Public Room Timeline

	NixOS CUDA	290 Members
	CUDA packages maintenance and support in nixpkgs \| https://github.com/orgs/NixOS/projects/27/ \| https://nixos.org/manual/nixpkgs/unstable/#cuda	57 Servers

Load older messages

Sender	Message	Time
29 Nov 2022
Samuel Ainsworth	yup, you're totally right: https://github.com/NixOS/nixpkgs/pull/200435	23:51:40
Samuel Ainsworth	i'm really confused why we're seeing these errors... they seem to indicate that the directory structure changed between 11.7 -> 11.8	23:52:23
1 Dec 2022
@box1:matrix.org	I'm trying to package `dgl-cu116`(`dgl` with cuda support) and it fails to find rpath for `libtorch_cuda_cpp.s`o and `libtorch_cuda_cu.so.` After some searching, those files are generated under `torch` when it is built with `BUILD_SPLIT_CUDA=1` or `BUILD_SPLIT_CUDA=1`. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that `BUILD_SPLIT_CUDA` is not default because there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues. Currently, `[torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix)` doesn't have an option to it. Maybe an option like `mklDnnSupport` so that it can be turned on for packages like `dgl-cuda116` that needs those files would be great. Any thought on this?	11:44:06
@box1:matrix.org	* I'm trying to package `dgl-cu116`(`dgl` with cuda support) and it fails to find rpath for `libtorch_cuda_cpp.so` and `libtorch_cuda_cu.so.` After some searching, those files are generated under `torch` when it is built with `BUILD_SPLIT_CUDA=1` or `BUILD_SPLIT_CUDA=1`. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that `BUILD_SPLIT_CUDA` is not default because there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues. Currently, `[torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix)` doesn't have an option to it. Maybe an option like `mklDnnSupport` so that it can be turned on for packages like `dgl-cuda116` that needs those files would be great. Any thought on this?	11:44:26
@box1:matrix.org	* I'm trying to package `dgl-cu116`(`dgl` with cuda support) and it fails to find rpath for `libtorch_cuda_cpp.so` and `libtorch_cuda_cu.so.` After some searching, those files are generated under `torch` when it is built with `BUILD_SPLIT_CUDA=ON` or `BUILD_SPLIT_CUDA=1`. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that `BUILD_SPLIT_CUDA` is not default because there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues. Currently, `[torch](https://github.com/NixOS/nixpkgs/blob/nixos-22.11/pkgs/development/python-modules/torch/default.nix)` doesn't have an option to it. Maybe an option like `mklDnnSupport` so that it can be turned on for packages like `dgl-cuda116` that needs those files would be great. Any thought on this?	11:44:48
@box1:matrix.org	* I'm trying to package `dgl-cu116`(`dgl` with cuda support) and it fails to find rpath for `libtorch_cuda_cpp.so` and `libtorch_cuda_cu.so.` After some searching, those files are generated under `torch` when it is built with `BUILD_SPLIT_CUDA=ON` or `BUILD_SPLIT_CUDA=1`. (https://discuss.pytorch.org/t/no-libtorch-cuda-cpp-so-available-when-build-pytorch-from-source/159864). This link says that `BUILD_SPLIT_CUDA` is not default because there may be other side effects (like increased binary size) that users might not be expecting, and it's only when we are compiling for many architectures where we run into these linker issues. Currently, `torch` doesn't have an option to it. Maybe an option like `mklDnnSupport` so that it can be turned on for packages like `dgl-cuda116` that needs those files would be great. Any thought on this?	11:45:26
	hexa changed their display name from hexa to hexa (22.11 now).	13:09:03
	hexa changed their display name from hexa (22.11 now) to hexa.	14:38:53
danielrf	Hi, I have some recent work that might be of interest to the Nix CUDA community: jetpack-nixos (https://github.com/anduril/jetpack-nixos) See also this announcement post on the discourse: https://discourse.nixos.org/t/jetpack-nixos-nixos-module-for-nvidia-jetson-devices/23632	19:50:11
danielrf	The CUDA version included with jetpack is apparently not the same as just the aarch64 CUDA for servers, but I've tried to repackage the debs from NVIDIA in a way similar to cudaPackages in nixpkgs: https://github.com/anduril/jetpack-nixos/blob/master/cuda-packages.nix	19:50:23
2 Dec 2022
Samuel Ainsworth	yeah IIUC the best way would be to add an option to the pytorch derivation to enable `BUILD_SPLIT_CUDA`. Shouldn't be too hard to accomplish 🤞	01:29:56
4 Dec 2022
	CRTified (old handle) changed their display name from CRTified to CRTified (old handle).	14:19:44
6 Dec 2022
	CRTified (old handle) changed their profile picture.	14:11:02
	connor (he/him) left the room.	15:17:44
12 Dec 2022
SomeoneSerge (back on matrix)	Just noticed that the GH action that triggers the Hercules rebuilds has been broken for 20 days :-( Updated it, but this only shows that it's something to be watched	14:06:09
15 Dec 2022
hexa	wondering if anyone here is motivated to get torchaudio source builds into nixpkgs	00:56:52
hexa	I've gotten pretty far, but I currently fail at linking libsox	00:57:17
hexa	https://github.com/NixOS/nixpkgs/pull/203136	00:57:18
21 Dec 2022
	jdaly joined the room.	17:33:43
jdaly	Our project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, search the NixOS repo, we find 'pkgs.cudaPackages_11_6.cudatoolkit' but various other versions of cuDNN (i.e. 8.3.2) -- but no 8.2.4 package. A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum. We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet! Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx!	17:45:23
jdaly	* Our project has a dependency on the MS Onnxruntime w/CUDA support. At present the, MS OnnxRuntime 1.13 release requires Cuda 11.6 and cuDNN 8.2.4, however, searching the NixOS repo, we find the 'pkgs.cudaPackages_11_6.cudatoolkit' and various other versions of cuDNN (e.g.. 8.3.2) -- but no 8.2.4 package. A few weeks back, I went looking for help on NixOS Discourse (See https://discourse.nixos.org/t/how-to-install-a-specific-version-of-cuda-and-cudnn/21725) -- where SergeK kindly pointed me to this forum. We're complete newbies when it comes to NixOS, so we're looking for some help for get the cuDNN 8.2.4 package added -- or maybe its already there but we just haven't unearthed it yet! Anyway, any tips or suggestions to help us get this figured out would be greatly appreciated. Thx!	17:53:59
tpw_rules	can you build stuff using a checkout of nixpkgs?	23:25:22
22 Dec 2022
	Shiroki Satsuki joined the room.	17:59:25
Shiroki Satsuki	Hello, I'm using `shell.nix` with Arch Linux, and I'm trying to use `python310Packages.pycuda` in my Python project, but when I `import pycuda.driver`, Python complained about missing of `libcuda.so.1`	18:04:05
Shiroki Satsuki	Is this package only available in NixOS? Or do I need some setup to link my host `libcuda.so.1` into `shell.nix`?	18:05:57
Shiroki Satsuki		18:06:21
Shiroki Satsuki	* > ldd /nix/store/x8gqndanr39r0zf2465xad9z0pkg0y39-python3-3.10.9-env/lib/python3.10/site-packages/pycuda/_driver.cpython-310-x86_64-linux-gnu.so linux-vdso.so.1 (0x00007ffff3797000) libboost_python310.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_python310.so.1.79.0 (0x00007fe813a75000) libboost_thread.so.1.79.0 => /nix/store/r7gb9lm3bisvj7njwb73acawvmkqr3sw-boost-1.79.0/lib/libboost_thread.so.1.79.0 (0x00007fe813a58000) libcuda.so.1 => not found libcurand.so.10 => /nix/store/rp33ncna7sm3inj3fjlrracg27sl78x3-cudatoolkit-11.7.0/lib/libcurand.so.10 (0x00007fe80de00000) libstdc++.so.6 => /nix/store/wprxx5zkkk13hpj6k1v6qadjylh3vq9m-gcc-11.3.0-lib/lib/libstdc++.so.6 (0x00007fe80da00000) libm.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libm.so.6 (0x00007fe80dd20000) libgcc_s.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libgcc_s.so.1 (0x00007fe813a3c000) libc.so.6 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libc.so.6 (0x00007fe80d600000) librt.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/librt.so.1 (0x00007fe813a37000) libdl.so.2 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libdl.so.2 (0x00007fe813a30000) libpthread.so.0 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libpthread.so.0 (0x00007fe813a2b000) libutil.so.1 => /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib/libutil.so.1 (0x00007fe813a26000) /nix/store/ayfr5l52xkqqjn3n4h9jfacgnchz1z7s-glibc-2.35-224/lib64/ld-linux-x86-64.so.2 (0x00007fe813cad000)	18:06:29
tpw_rules	if you are not using nixos you need nixGL: https://github.com/guibou/nixGL for cuda to work	18:11:11
tpw_rules	(and other graphics stuff)	18:11:24
Shiroki Satsuki	Thanks, I only know that `nixGL` is needed for OpenGL and Vulkan	18:13:42

Show newer messages

Back to Room ListRoom Version: 9