!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

311 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda60 Servers

Load older messages


SenderMessageTime
21 Jun 2024
@search-sense:matrix.orgsearch-sense

Is anyone interested to add latest tensorrt-10.1.0 to NixOS ?

searching for dependencies of /nix/store/gknr686xg6ggafkdfy5323bc7f1m5yf7-tensorrt-10.1.0.27-lib/lib/stubs/libnvinfer_vc_plugin.so
    libstdc++.so.6 -> found: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib
    libgcc_s.so.1 -> found: /nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib
setting RPATH to: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib:/nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib:$ORIGIN
auto-patchelf: 1 dependencies could not be satisfied
error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
auto-patchelf failed to find all the required dependencies.
Add the missing dependencies to --libs or use `--ignore-missing="foo.so.1 bar.so etc.so"`.
error: builder for '/nix/store/7rqkwg91vnk5d3p4vaym0z0pskkmj4r8-tensorrt-10.1.0.27.drv' failed with exit code 1;
       last 10 log lines:
       >     libgcc_s.so.1 -> found: /nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib
       > setting RPATH to: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib:/nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib:$ORIGIN
       > searching for dependencies of /nix/store/gknr686xg6ggafkdfy5323bc7f1m5yf7-tensorrt-10.1.0.27-lib/lib/stubs/libnvinfer_vc_plugin.so
       >     libstdc++.so.6 -> found: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib
       >     libgcc_s.so.1 -> found: /nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib
       > setting RPATH to: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib:/nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib:$ORIGIN
       > auto-patchelf: 1 dependencies could not be satisfied
       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies.
       > Add the missing dependencies to --libs or use `--ignore-missing="foo.so.1 bar.so etc.so"`.
       For full logs, run 'nix log /nix/store/7rqkwg91vnk5d3p4vaym0z0pskkmj4r8-tensorrt-10.1.0.27.drv'.
07:59:22
@search-sense:matrix.orgsearch-sense

export NIXPKGS_ALLOW_UNFREE=1 && nix-build -A cudaPackages.tensorrt

       > setting RPATH to: /nix/store/bn7pnigb0f8874m6riiw6dngsmdyic1g-gcc-13.3.0-lib/lib:/nix/store/pd8xxiyn2xi21fgg9qm7r0qghsk8715k-gcc-13.3.0-libgcc/lib:$ORIGIN
       > auto-patchelf: 1 dependencies could not be satisfied
       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies.
       > Add the missing dependencies to --libs or use `--ignore-missing="foo.so.1 bar.so etc.so"`.

11:03:14
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @search-sense:matrix.org

Hello, NixOS community, I want to install python311Packages.tensorrt

TensorRT> command, and try building this derivation again.
TensorRT> $ nix-store --add-fixed sha256 TensorRT-8.6.1.6.Linux.x86_64-gnu.cuda-11.8.tar.gz
TensorRT> ***
error: builder for '/nix/store/140c5c8lpa30r3jrxxbw74631831prrw-TensorRT-8.6.1.6.Linux.x86_64-gnu.cuda-11.8.tar.gz.drv' failed with exit code 1;

but the cuda is 12.2 on my system, is it compatible?

> nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Tue_Aug_15_22:02:13_PDT_2023
Cuda compilation tools, release 12.2, V12.2.140
Build cuda_12.2.r12.2/compiler.33191640_0
You can use cudaPackages.overrideScope to plug in the trt release compatible with your cuda, but also I think trt was originally introduced in Nixpkgs with a logic to select the compatible release in each cuda package set automatically. Evidently, that must be have broken
15:51:43
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @ss:someonex.net
Nvidia prevents unattended downloads, of course it broke
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?
15:52:47
@lcw:matrix.orgLucas joined the room.17:13:01
@lcw:matrix.orgLucas

Does anyone have nsight_systems working?

I am using CUDA to develop progrmans on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

``` I was able to
get `ncu` working
19:46:16
@lcw:matrix.orgLucas *

Does anyone have nsight_systems working?

I am using CUDA to develop progrmans on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

{
  description = "nsight_systems";

  inputs = {
    # nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
    # nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
    nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
  };
  outputs = { self, nixpkgs }:
    let
      system = "x86_64-linux";
      pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
    in
    {
      devShells.${system}.default = pkgs.mkShell {
        nativeBuildInputs = [
          pkgs.cudaPackages.nsight_systems
          pkgs.cudaPackages.nsight_compute
        ];
      };
    };
}

```
I was able to get `ncu` working.
19:47:05
@lcw:matrix.orgLucas *

Does anyone have nsight_systems working?

I am using CUDA to develop progrmans on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

{
  description = "nsight_systems";

  inputs = {
    # nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
    # nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
    nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
  };
  outputs = { self, nixpkgs }:
    let
      system = "x86_64-linux";
      pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
    in
    {
      devShells.${system}.default = pkgs.mkShell {
        nativeBuildInputs = [
          pkgs.cudaPackages.nsight_systems
          pkgs.cudaPackages.nsight_compute
        ];
      };
    };
}

I was able to get ncu working.

19:47:39
@lcw:matrix.orgLucas

But when I try to run nsys-ui I get a dialogue box with the error message

Failed to load plugin: QuadDPlugin

Cannot load library /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: undefined symbol: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEED1Ev, version Qt_6

Some functionality will be disabled
19:50:21
@lcw:matrix.orgLucas I do have to launch ncu and ncu-ui via their full paths /nix/store/0v8ydp1hq7ynncwir4hv5hkpna629iw0-cuda12.5-nsight_compute-2024.2.0.16/nsight-compute/2024.2.0/ncu and /nix/store/0v8ydp1hq7ynncwir4hv5hkpna629iw0-cuda12.5-nsight_compute-2024.2.0.16/nsight-compute/2024.2.0/ncu-ui, respectively. 19:52:26
@lcw:matrix.orgLucas *

Does anyone have nsight_systems working?

I am using CUDA to develop programs on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

{
  description = "nsight_systems";

  inputs = {
    # nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
    # nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
    nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
  };
  outputs = { self, nixpkgs }:
    let
      system = "x86_64-linux";
      pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
    in
    {
      devShells.${system}.default = pkgs.mkShell {
        nativeBuildInputs = [
          pkgs.cudaPackages.nsight_systems
          pkgs.cudaPackages.nsight_compute
        ];
      };
    };
}

I was able to get ncu working.

20:02:11
@glepage:matrix.orgGaétan Lepage

connor (he/him) (UTC-5) could you please give a shot at

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlib
20:48:30
@glepage:matrix.orgGaétan Lepage *

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlib
20:48:48
@glepage:matrix.orgGaétan Lepage *

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
20:48:50
@aidalgol:matrix.orgaidalgol
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?
Sorry, I have not been using TensorRT in a while, so I'm not catching these, even though I put it in nixpkgs to begin with. :S
21:19:55
@lcw:matrix.orgLucas
In reply to @lcw:matrix.org

Does anyone have nsight_systems working?

I am using CUDA to develop programs on NixOS 24.05 and it is working great. Now I want to profile my code.
Using the following flake

{
  description = "nsight_systems";

  inputs = {
    # nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
    # nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
    nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
  };
  outputs = { self, nixpkgs }:
    let
      system = "x86_64-linux";
      pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
    in
    {
      devShells.${system}.default = pkgs.mkShell {
        nativeBuildInputs = [
          pkgs.cudaPackages.nsight_systems
          pkgs.cudaPackages.nsight_compute
        ];
      };
    };
}

I was able to get ncu working.

I was able to get nsight_systems working from nixpkgs.url = "github:mcwitt/nixpkgs/fix/nsight_systems";.
21:26:13
@ss:someonex.netSomeoneSerge (matrix works sometimes)
In reply to @lcw:matrix.org

But when I try to run nsys-ui I get a dialogue box with the error message

Failed to load plugin: QuadDPlugin

Cannot load library /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: undefined symbol: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEED1Ev, version Qt_6

Some functionality will be disabled
Looks likw a lwftover vendored qt library that we oughtta relink
21:45:46
@lcw:matrix.orgLucas
In reply to @ss:someonex.net
Looks likw a lwftover vendored qt library that we oughtta relink
Oh cool. Is there an example of relinking the libraries that I can follow? Is it this https://github.com/ConnorBaker/nixpkgs/blob/9ee229fe705580b62fc9011f5d8cc78e87f85971/pkgs/development/cuda-modules/overrides/cuda/nsight_systems.nix#L102-L121 ?
22:19:56
22 Jun 2024
@search-sense:matrix.orgsearch-sense
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies.```
16:12:11
@search-sense:matrix.orgsearch-sense
In reply to @ss:someonex.net
...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression?
*

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies..
```
16:12:24
@connorbaker:matrix.orgconnor (he/him)
In reply to @glepage:matrix.org

connor (he/him) (UTC-5) could you please give a shot at the following ?

nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
I’ll try to run it later tonight if you’d still like
21:10:09
@connorbaker:matrix.orgconnor (he/him)
In reply to @search-sense:matrix.org

the essence of the problem is this:

       > error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
       > auto-patchelf failed to find all the required dependencies..
```
That should be provided by cuda_cudart.lib
21:10:44
@connorbaker:matrix.orgconnor (he/him)Sorry for the lack of movements on PRs and such; I found http://aggregate.org/DIT/PARSEK/ and spent a week and a half trying to rewrite everything to use JAX and that’s been a trip.21:12:08
@hexa:lossy.networkhexa Gaétan Lepage: feel free to cull the botorch test suite 21:29:00
@hexa:lossy.networkhexait keeps clogging my hydra build slots alot21:29:11
@glepage:matrix.orgGaétan LepageYes that's quite sad...21:29:50
@glepage:matrix.orgGaétan Lepage Btw, this PR fixes the jaxlib hash error: https://github.com/NixOS/nixpkgs/pull/321559 21:30:10
@glepage:matrix.orgGaétan LepageStill unclear how/why it happened21:30:23
24 Jun 2024
@connorbaker:matrix.orgconnor (he/him)

Gaétan Lepage: ran into an error doing the build

$ cat ~/.config/nixpkgs/config.nix 
{
  allowAliases = false;
  allowBroken = false;
  allowUnfree = true;
  checkMeta = true;
  cudaCapabilities = [ "7.5" ];
  cudaSupport = true;
}
$ nix build --impure -L github:GaetanLepage/nixpkgs/jax#python311Packages.jaxlibWithCuda

https://gist.github.com/ConnorBaker/56253dd716f7b0a30705916cff13f964

02:48:45
@glepage:matrix.orgGaétan Lepage Thanks for taking the time connor (he/him) (UTC-5).
I actually wanted you to try my jaxlib branch.
The jax branch is for the jax update and it is basically stuck becaus of some important changes in the upstream packaging/build process
06:27:56

Show newer messages


Back to Room ListRoom Version: 9