!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

211 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda42 Servers

Load older messages


SenderMessageTime
10 Aug 2024
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @yorickvp:matrix.org

bisecting the following:

let
  pkgs = import ./. {
    config = {
      allowUnfree = true;
      cudaCapabilities = [ "8.6" ];
      cudaSupport = true;
    };
  };
in
{
  torchtest = (pkgs.python3.pkgs.torch.override { openai-triton = null; }).overridePythonAttrs (o: {
    disallowedReferences = [ pkgs.python3.pkgs.torch.cudaPackages.cuda_nvcc.stdenv.cc.cc.lib ];
    USE_CUDNN = 0;
    USE_KINETO = 0;
    USE_QNNPACK = 0;
    USE_PYTORCH_QNNPACK = 0;
    USE_XNNPACK = 0;
    INTERN_DISABLE_ONNX = 1;
    ONNX_ML = 0;
    USE_ITT = 0;
    USE_FLASH_ATTENTION = 0;
    USE_MEM_EFF_ATTENTION = 0;
    USE_FBGEMM = 0;
    USE_MKLDNN = 0;
  });
}
https://github.com/NixOS/nixpkgs/pull/333575#issuecomment-2278925778
01:25:55
@ss:someonex.netSomeoneSerge (utc+3) yorickvp can you summarize your findings in a github issue? 01:31:53
@ss:someonex.netSomeoneSerge (utc+3) * yorickvp can you summarize your findings in a github issue, and link th epr? 01:32:12
@ss:someonex.netSomeoneSerge (utc+3) * yorickvp can you summarize your findings in a github issue, and link the pr? 01:32:16
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @glepage:matrix.org
https://github.com/mikex86/LibreCuda
Rum and coke, melting ice
01:34:23
@yorickvp:matrix.orgyorickvp
In reply to @ss:someonex.net
yorickvp can you summarize your findings in a github issue, and link the pr?
My bisect findings is that this has been happening ever since the mixed gcc was introduced
08:41:16
@yorickvp:matrix.orgyorickvpThank you for the fix :) 08:41:51
@shekhinah:she.khinah.xyzshekhinah changed their display name from shekhinah | kosmisches Zeitreisebüro to shekhinah.12:06:24
@shekhinah:she.khinah.xyzshekhinah changed their profile picture.12:06:46
@shekhinah:she.khinah.xyzshekhinah changed their profile picture.12:09:02
@shekhinah:she.khinah.xyzshekhinah changed their profile picture.12:09:26
@shekhinah:she.khinah.xyzshekhinah changed their profile picture.12:11:43
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @yorickvp:matrix.org
My bisect findings is that this has been happening ever since the mixed gcc was introduced
Can you open an issue describing the pytorch situation nonetheless?
14:00:03
@glepage:matrix.orgGaétan Lepage

Hey connor (he/him) (UTC-5),
I have a quick question regarding the torch update.
The CUDA build of pytorch now fails because of the addition of a new cuda capability 9.0a.

Someone suggested the following solutions:
https://github.com/NixOS/nixpkgs/pull/329836#issuecomment-2279508588

What is your opinion ?

20:58:59
@glepage:matrix.orgGaétan Lepage Should we remove the rm -rf cmake/Modules_CUDA_fix/{upstream,FindCUDA.cmake} line in our torch derivation or should we patch the regex in the cmake derivation to properly parse 9.0a ? 21:00:00
@connorbaker:matrix.orgconnor (he/him) (UTC-7)That seems familiar23:08:17
@connorbaker:matrix.orgconnor (he/him) (UTC-7)Oh maybe check these out23:08:27
@connorbaker:matrix.orgconnor (he/him) (UTC-7)https://github.com/ConnorBaker/nixpkgs/commits/feat/python3Packages.torch/23:08:31
@connorbaker:matrix.orgconnor (he/him) (UTC-7)I know I was looking at fixes for the torch derivation in general (I guess two months ago at this point now) and that was one of the things I ran into23:09:08
@connorbaker:matrix.orgconnor (he/him) (UTC-7)My preference is always to patch; IMHO Nixpkgs is a really easy way to reliably build these tools for cutting edge compute capabilities and I’d like that to continue23:10:00
11 Aug 2024
@glepage:matrix.orgGaétan LepageOk, thank you very much for sharing your patch !08:48:12
12 Aug 2024
@ss:someonex.netSomeoneSerge (utc+3) Kevin Mittman any idea why the softdep trick might not work with the open driver? $ sudo nvidia-settings
ERROR: libEGL setup error : libEGL.so.1: cannot open shared object file: No such file or directory
17:16:38
@ss:someonex.netSomeoneSerge (utc+3) * Kevin Mittman any idea why the softdep trick might not work with the open driver? $ sudo nvidia-settings
https://github.com/NixOS/nixpkgs/issues/334180#issuecomment-2284535323
17:16:47
@ss:someonex.netSomeoneSerge (utc+3) * Kevin Mittman any idea why the softdep trick might not work with the open driver?
https://github.com/NixOS/nixpkgs/issues/334180#issuecomment-2284535323
17:16:51
@ss:someonex.netSomeoneSerge (utc+3) * Kevin Mittman any idea why the softdep trick might not work with the open driver?
https://github.com/NixOS/nixpkgs/issues/334180
17:16:54
@connorbaker:matrix.orgconnor (he/him) (UTC-7)
In reply to @glepage:matrix.org
Ok, thank you very much for sharing your patch !
Please feel free to use more than just the patch too; it might be a while before I’m able to revisit that particular work
18:07:32
@connorbaker:matrix.orgconnor (he/him) (UTC-7)I’m probably going to be focused fairly heavily on Jetson related stuff for a while18:07:46
@accipiterx:matrix.org@accipiterx:matrix.org joined the room.21:00:17
13 Aug 2024
@connorbaker:matrix.orgconnor (he/him) (UTC-7) SomeoneSerge (UTC+3): I haven't had a chance to follow super closely, but for the aarch64-builders, do we have anything for CI beside the Hetzner instance I have? I'm trying to cut down on cloud infra I pay for so I can pay for my electricity bill (which is an order of magnitude more than I had budgeted for) 05:25:13
@connorbaker:matrix.orgconnor (he/him) (UTC-7)I'm not sure what the status of CUDA packages on the NixOS-community hydra is05:25:30

Show newer messages


Back to Room ListRoom Version: 9