!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

337 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda64 Servers

Load older messages


SenderMessageTime
6 Mar 2023
@ss:someonex.netSomeoneSerge (matrix works sometimes)
.github/workflows/_linux-build.yml
161:          docker exec -t "${container_name}" sh -c '.jenkins/pytorch/build.sh'
22:45:01
@ss:someonex.netSomeoneSerge (matrix works sometimes)Yes, I think this is the right one: https://github.com/pytorch/pytorch/blob/39e8311a29b5713c8858cab73a8f713a7f3d531c/.github/workflows/_binary-build-linux.yml#L205 ...but they still take the flags from elsewhere and just propagate them22:51:02
@ss:someonex.netSomeoneSerge (matrix works sometimes)aaaaaand 0 workflows run https://github.com/pytorch/pytorch/actions/workflows/_binary-build-linux.yml22:51:55
@hexa:lossy.networkhexayeah, why would they run that 😄22:52:07
@ss:someonex.netSomeoneSerge (matrix works sometimes)https://github.com/pytorch/pytorch/actions/runs/433782356222:54:01
@ss:someonex.netSomeoneSerge (matrix works sometimes)Here https://github.com/pytorch/pytorch/actions/runs/4337823562/jobs/7574087583#step:14:30522:55:00
@ss:someonex.netSomeoneSerge (matrix works sometimes)

-DBUILD_LIBTORCH_CPU_WITH_DEBUG=0

Ok, how do we check we don't have any debug symbols in our libs?

22:55:59
@hexa:lossy.networkhexaobjdump --syms22:59:10
@ss:someonex.netSomeoneSerge (matrix works sometimes) -DUSE_NCCL=1 22:59:17
@ss:someonex.netSomeoneSerge (matrix works sometimes)hmmm, I didn't even know it can be built without cuda23:00:18
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @hexa:lossy.network
objdump --syms
Seems fine
23:04:58
@hexa:lossy.networkhexaagreed23:05:05
@ss:someonex.netSomeoneSerge (matrix works sometimes) -DUSE_FBGEMM? 23:07:02
@hexa:lossy.networkhexa

FBGEMM (Facebook GEneral Matrix Multiplication) is a low-precision, high-performance matrix-matrix multiplications and convolution library

23:09:25
@ss:someonex.netSomeoneSerge (matrix works sometimes)
❯ nix log nixpkgs#python3Packages.torch
...

--   USE_EIGEN_FOR_BLAS    : ON
--   USE_FBGEMM            : ON
--     USE_FAKELOWP          : OFF
--   USE_KINETO            : ON
...
23:09:47
7 Mar 2023
@ss:someonex.netSomeoneSerge (matrix works sometimes)(how does one get an nvim diffsplit online?)00:01:28
@ss:someonex.netSomeoneSerge (matrix works sometimes) Meanwhile, two notable GLIBCXX_3.4.30 failures remain: python3Packages.jax and python3Packages.torchvision 00:39:55
@ss:someonex.netSomeoneSerge (matrix works sometimes)

Same error I saw with gpflow, something about scipy/optimize:

    from ._highs._highs_wrapper import _highs_wrapper
E   ImportError: /nix/store/205vsmbfhq1q2vhgskpqyymqvba4mscp-gcc-11.3.0-lib/lib/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by /nix/store/yi7jc5p2mlwb3j37j7gwj15bk45j6xqs-python3.10-scipy-1.9.3/lib/python3.10/site-packages/scipy/optimize/_highs/_highs_wrapper.cpython-310-x86_64-linux-gnu.so)
00:40:34
@ss:someonex.netSomeoneSerge (matrix works sometimes) * Meanwhile, two notable GLIBCXX_3.4.30 failures remain: python3Packages.jax (not jaxlib) and python3Packages.torchvision 01:21:16
@ss:someonex.netSomeoneSerge (matrix works sometimes)MKL -> down to 5s01:29:23
@ss:someonex.netSomeoneSerge (matrix works sometimes)Mystery solved, I guess01:29:38
@ss:someonex.netSomeoneSerge (matrix works sometimes) connor (he/him): you should've received an automated invite to the NixOS GH organization back when your first PR was merged. I don't see NixOS in your org list. Could you have missed it? 12:07:02
@ss:someonex.netSomeoneSerge (matrix works sometimes) *

connor (he/him): you should've received an automated invite to the NixOS GH organization back when your first PR was merged. I don't see NixOS in your org list. Could you have missed it?

EDIT: Oh! That's how it works: https://github.com/NixOS/nixpkgs/issues/50105#issuecomment-1458140917

Add yourself to maintainers/maintainer-list.nix

13:16:35
@hexa:lossy.networkhexafrom where do I get nvcc?14:30:56
@hexa:lossy.networkhexanix-locate is being entirely unhelpful14:31:10
@hexa:lossy.networkhexa I suspect somewhere below cudatoolkit? 14:31:24
@justbrowsing:matrix.orgKevin Mittman (UTC-7) cuda_nvcc ? 15:08:10
@hexa:lossy.networkhexaoh, thanks15:18:25
@hexa:lossy.networkhexa a bit redundant, below cudaPackages 15:18:36
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @hexa:lossy.network
a bit redundant, below cudaPackages
Yea, but the names come from the json manifests published by nvidia. And if we strip cuda_ from cuda_nvcc, should we keep lib in libcublas? And then you learn that libcusparse and cuSPARSE are different things
16:30:18

Show newer messages


Back to Room ListRoom Version: 9