| 21 Jun 2024 |
Gaétan Lepage | * connor (he/him) (UTC-5) could you please give a shot at the following ?
nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlib
| 20:48:48 |
Gaétan Lepage | * connor (he/him) (UTC-5) could you please give a shot at the following ?
nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
| 20:48:50 |
aidalgol | In reply to @ss:someonex.net ...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression? Sorry, I have not been using TensorRT in a while, so I'm not catching these, even though I put it in nixpkgs to begin with. :S | 21:19:55 |
Lucas | In reply to @lcw:matrix.org
Does anyone have nsight_systems working?
I am using CUDA to develop programs on NixOS 24.05 and it is working great. Now I want to profile my code. Using the following flake
{
description = "nsight_systems";
inputs = {
# nixpkgs.url = "github:NixOS/nixpkgs/release-24.05";
# nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
nixpkgs.url = "github:ConnorBaker/nixpkgs/feat/cudaPackages-fixed-output-derivations";
};
outputs = { self, nixpkgs }:
let
system = "x86_64-linux";
pkgs = import nixpkgs { system = system; config.allowUnfree = true; };
in
{
devShells.${system}.default = pkgs.mkShell {
nativeBuildInputs = [
pkgs.cudaPackages.nsight_systems
pkgs.cudaPackages.nsight_compute
];
};
};
}
I was able to get ncu working.
I was able to get nsight_systems working from nixpkgs.url = "github:mcwitt/nixpkgs/fix/nsight_systems";. | 21:26:13 |
SomeoneSerge (back on matrix) | In reply to @lcw:matrix.org
But when I try to run nsys-ui I get a dialogue box with the error message
Failed to load plugin: QuadDPlugin
Cannot load library /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: /nix/store/hzp2wmqbqihx4slp353ixs405ry6li4f-cuda12.5-nsight_systems-2024.2.3.38-bin/nsight-systems/2024.2.3/host-linux-x64/Plugins/QuadDPlugin/libQuadDPlugin.so: undefined symbol: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEED1Ev, version Qt_6
Some functionality will be disabled
Looks likw a lwftover vendored qt library that we oughtta relink | 21:45:46 |
Lucas | In reply to @ss:someonex.net Looks likw a lwftover vendored qt library that we oughtta relink Oh cool. Is there an example of relinking the libraries that I can follow? Is it this https://github.com/ConnorBaker/nixpkgs/blob/9ee229fe705580b62fc9011f5d8cc78e87f85971/pkgs/development/cuda-modules/overrides/cuda/nsight_systems.nix#L102-L121 ? | 22:19:56 |
| 22 Jun 2024 |
search-sense | In reply to @ss:someonex.net ...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression? the essence of the problem is this:
> error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
> auto-patchelf failed to find all the required dependencies.```
| 16:12:11 |
search-sense | In reply to @ss:someonex.net ...primarily because of^^^ and because no one seems to be actively using Nixpkgs' in-tree trt expression? * the essence of the problem is this:
> error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
> auto-patchelf failed to find all the required dependencies..
```
| 16:12:24 |
connor (burnt/out) (UTC-8) | In reply to @glepage:matrix.org
connor (he/him) (UTC-5) could you please give a shot at the following ?
nix build github:GaetanLepage/nixpkgs/jaxlib#python311Packages.jaxlibWithCuda
I’ll try to run it later tonight if you’d still like | 21:10:09 |
connor (burnt/out) (UTC-8) | In reply to @search-sense:matrix.org
the essence of the problem is this:
> error: auto-patchelf could not satisfy dependency libcudart.so.12 wanted by /nix/store/799sv915xqi5b8n14hdkbbp6h06rrjz7-tensorrt-10.1.0.27-bin/bin/trtexec
> auto-patchelf failed to find all the required dependencies..
```
That should be provided by cuda_cudart.lib | 21:10:44 |
connor (burnt/out) (UTC-8) | Sorry for the lack of movements on PRs and such; I found http://aggregate.org/DIT/PARSEK/ and spent a week and a half trying to rewrite everything to use JAX and that’s been a trip. | 21:12:08 |
hexa | Gaétan Lepage: feel free to cull the botorch test suite | 21:29:00 |
hexa | it keeps clogging my hydra build slots alot | 21:29:11 |
Gaétan Lepage | Yes that's quite sad... | 21:29:50 |
Gaétan Lepage | Btw, this PR fixes the jaxlib hash error: https://github.com/NixOS/nixpkgs/pull/321559 | 21:30:10 |
Gaétan Lepage | Still unclear how/why it happened | 21:30:23 |
| 24 Jun 2024 |
connor (burnt/out) (UTC-8) | Gaétan Lepage: ran into an error doing the build
$ cat ~/.config/nixpkgs/config.nix
{
allowAliases = false;
allowBroken = false;
allowUnfree = true;
checkMeta = true;
cudaCapabilities = [ "7.5" ];
cudaSupport = true;
}
$ nix build --impure -L github:GaetanLepage/nixpkgs/jax#python311Packages.jaxlibWithCuda
https://gist.github.com/ConnorBaker/56253dd716f7b0a30705916cff13f964
| 02:48:45 |
Gaétan Lepage | Thanks for taking the time connor (he/him) (UTC-5).
I actually wanted you to try my jaxlib branch.
The jax branch is for the jax update and it is basically stuck becaus of some important changes in the upstream packaging/build process | 06:27:56 |
hexa | In reply to @hexa:lossy.network it keeps clogging my hydra build slots alot also not a fan of deepwave and qutip | 10:10:47 |
hexa |
https://hj.9fs.net/emery/p/1719216291.477930
| 10:10:57 |
hexa | *
"Wave propagation modules for PyTorch"
| 10:11:12 |
hexa |
"Open-source software for simulating the dynamics of closed and open quantum systems"
| 10:11:27 |
SomeoneSerge (back on matrix) | "Not just an infra problem" | 10:14:54 |
hexa | what are you quoting? | 10:20:20 |
connor (burnt/out) (UTC-8) | In reply to @glepage:matrix.org Thanks for taking the time connor (he/him) (UTC-5).
I actually wanted you to try my jaxlib branch.
The jax branch is for the jax update and it is basically stuck becaus of some important changes in the upstream packaging/build process I tried but didn’t see a jaxlib branch :/ | 12:51:28 |
Gaétan Lepage | Oh indeed, I deleted it after the PR was merged. | 13:09:03 |
connor (burnt/out) (UTC-8) | Sorry I didn’t get to it sooner, my bad :( | 13:19:45 |
connor (burnt/out) (UTC-8) | Gaétan Lepage: you mentioned you had a wrapper (script?) around Nixpkgs-review which ran it with different configurations. Any chance you can share it? I’ve been wanting to have some way to have Nixpkgs-review generate an expression for everything it would build, parameterized by multiple Nixpkgs configs. | 13:21:40 |
Gaétan Lepage | In reply to @connorbaker:matrix.org Sorry I didn’t get to it sooner, my bad :( No worry ! | 13:29:52 |
Gaétan Lepage | In reply to @connorbaker:matrix.org Gaétan Lepage: you mentioned you had a wrapper (script?) around Nixpkgs-review which ran it with different configurations. Any chance you can share it? I’ve been wanting to have some way to have Nixpkgs-review generate an expression for everything it would build, parameterized by multiple Nixpkgs configs. Yes, I have a script which start nixpkgs-review for several architectures at once.
My plan long term is to contribute to upstream for being able to run the tool for several architectures at once and have a single message posted on the PR | 13:31:20 |