!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

290 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
21 Jun 2024
@glepage:matrix.orgGaétan Lepage *

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlib
20:48:48
@glepage:matrix.orgGaétan Lepage *

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
20:48:50
@aidalgol:matrix.orgaidalgol
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?
Sorry, I have not been using TensorRT in a while, so I'm not catching these, even though I put it in nixpkgs to begin with. :S
21:19:55
@lcw:matrix.orgLucas
In reply to @lcw:matrix.org

Does anyone have nsight_systems working?

I am using CUDA to develop programs on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

{
  description = "nsight_systems";

  inputs = {
    # nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
    # nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
    nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
  };
  outputs = { self, nixpkgs }:
    let
      system = "x86_64-linux";
      pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
    in
    {
      devShells.${system}.default = pkgs.mkShell {
        nativeBuildInputs = [
          pkgs.cudaPackages.nsight_systems
          pkgs.cudaPackages.nsight_compute
        ];
      };
    };
}

I was able to get ncu working.

I was able to get nsight_systems working from nixpkgs.url = "github:mcwitt/nixpkgs/fix/nsight_systems";.
21:26:13
@ss:someonex.netSomeoneSerge (back on matrix)
In reply to @lcw:matrix.org

But when I try to run nsys-ui I get a dialogue box with the error message

Failed to load plugin: QuadDPlugin

Cannot load library /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: undefined symbol: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEED1Ev, version Qt_6

Some functionality will be disabled
Looks likw a lwftover vendored qt library that we oughtta relink
21:45:46
@lcw:matrix.orgLucas
In reply to @ss:someonex.net
Looks likw a lwftover vendored qt library that we oughtta relink
Oh cool. Is there an example of relinking the libraries that I can follow? Is it this https://github.com/ConnorBaker/nixpkgs/blob/9ee229fe705580b62fc9011f5d8cc78e87f85971/pkgs/development/cuda-modules/overrides/cuda/nsight_systems.nix#L102-L121 ?
22:19:56
22 Jun 2024
@search-sense:matrix.orgsearch-sense
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies.```
16:12:11
@search-sense:matrix.orgsearch-sense
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?
*

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies..
```
16:12:24
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)
In reply to @glepage:matrix.org

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
I’ll try to run it later tonight if you’d still like
21:10:09
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)
In reply to @search-sense:matrix.org

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies..
```
That should be provided by cuda_cudart.lib
21:10:44
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)Sorry for the lack of movements on PRs and such; I found http://aggregate.org/DIT/PARSEK/ and spent a week and a half trying to rewrite everything to use JAX and that’s been a trip.21:12:08
@hexa:lossy.networkhexa Gaétan Lepage: feel free to cull the botorch test suite 21:29:00
@hexa:lossy.networkhexait keeps clogging my hydra build slots alot21:29:11
@glepage:matrix.orgGaétan LepageYes that's quite sad...21:29:50
@glepage:matrix.orgGaétan Lepage Btw, this PR fixes the jaxlib hash error: https://github.com/NixOS/nixpkgs/pull/321559 21:30:10
@glepage:matrix.orgGaétan LepageStill unclear how/why it happened21:30:23
24 Jun 2024
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)

Gaétan Lepage: ran into an error doing the build

$ cat ~/.config/nixpkgs/config.nix 
{
  allowAliases = false;
  allowBroken = false;
  allowUnfree = true;
  checkMeta = true;
  cudaCapabilities = [ "7.5" ];
  cudaSupport = true;
}
$ nix build --impure -L github:GaetanLepage/nixpkgs/jax#python311Packages.jaxlibWithCuda

https://gist.github.com/ConnorBaker/56253dd716f7b0a30705916cff13f964

02:48:45
@glepage:matrix.orgGaétan Lepage Thanks for taking the time connor (he/him) (UTC-5).
I actually wanted you to try my jaxlib branch.
The jax branch is for the jax update and it is basically stuck becaus of some important changes in the upstream packaging/build process
06:27:56
@hexa:lossy.networkhexa
In reply to @hexa:lossy.network
it keeps clogging my hydra build slots alot
also not a fan of deepwave and qutip
10:10:47
@hexa:lossy.networkhexa

https://hj.9fs.net/emery/p/1719216291.477930

10:10:57
@hexa:lossy.networkhexa *

"Wave propagation modules for PyTorch"

10:11:12
@hexa:lossy.networkhexa

"Open-source software for simulating the dynamics of closed and open quantum systems"

10:11:27
@ss:someonex.netSomeoneSerge (back on matrix)"Not just an infra problem"10:14:54
@hexa:lossy.networkhexawhat are you quoting? 10:20:20
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)
In reply to @glepage:matrix.org
Thanks for taking the time connor (he/him) (UTC-5).
I actually wanted you to try my jaxlib branch.
The jax branch is for the jax update and it is basically stuck becaus of some important changes in the upstream packaging/build process
I tried but didn’t see a jaxlib branch :/
12:51:28
@glepage:matrix.orgGaétan LepageOh indeed, I deleted it after the PR was merged.13:09:03
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8)Sorry I didn’t get to it sooner, my bad :(13:19:45
@connorbaker:matrix.orgconnor (burnt/out) (UTC-8) Gaétan Lepage: you mentioned you had a wrapper (script?) around Nixpkgs-review which ran it with different configurations. Any chance you can share it? I’ve been wanting to have some way to have Nixpkgs-review generate an expression for everything it would build, parameterized by multiple Nixpkgs configs. 13:21:40
@glepage:matrix.orgGaétan Lepage
In reply to @connorbaker:matrix.org
Sorry I didn’t get to it sooner, my bad :(
No worry !
13:29:52
@glepage:matrix.orgGaétan Lepage
In reply to @connorbaker:matrix.org
Gaétan Lepage: you mentioned you had a wrapper (script?) around Nixpkgs-review which ran it with different configurations. Any chance you can share it? I’ve been wanting to have some way to have Nixpkgs-review generate an expression for everything it would build, parameterized by multiple Nixpkgs configs.
Yes, I have a script which start nixpkgs-review for several architectures at once.
My plan long term is to contribute to upstream for being able to run the tool for several architectures at once and have a single message posted on the PR
13:31:20

Show newer messages


Back to Room ListRoom Version: 9