| 14 Oct 2024 |
SomeoneSerge (back on matrix) |
error: downloading 'https://github.com/abseil/abseil-cpp/archive/refs/tags/20230125.3.tar.gz' failed
lol
| 20:19:08 |
SomeoneSerge (back on matrix) | In reply to @ss:someonex.net Yes but also the hydra history is all green 𤷠Maybe that just came in from staging | 20:19:30 |
| 15 Oct 2024 |
connor (burnt/out) (UTC-8) | In reply to @glepage:matrix.org I can't get onnx to build...
Here are the logs in case someone know what is happening: https://paste.glepage.com/upload/eel-falcon-sloth Onnx's CMake isn't detecting at least one dependency, so it tries to download them all in order, starting with abseil. Since there's no networking in the sandbox, it fails. | 00:06:48 |
connor (burnt/out) (UTC-8) | I'm currently working on Onnx packaging for a thing, and you can see what I've got going on here: https://github.com/ConnorBaker/cuda-packages/blob/main/cudaPackages-common/onnx.nix
(It's a combination C++/Python install so it's gnarly. But better than having two separate derivations with libraries built with different flags, I guess.) | 00:09:04 |
GaƩtan Lepage | Ok interesting, thanks for sharing | 05:46:57 |
GaƩtan Lepage | Is your plan to upstream this to nixpkgs ? | 05:47:13 |
GaƩtan Lepage | [triton update]
triton-llvm fails during the test phase.
Logs: https://paste.glepage.com/upload/fish-jaguar-pig | 08:48:05 |
| atagen joined the room. | 11:38:21 |
SomeoneSerge (back on matrix) | In reply to @glepage:matrix.org [triton update]
triton-llvm fails during the test phase.
Logs: https://paste.glepage.com/upload/fish-jaguar-pig Can't reproduce, builds for me | 12:35:31 |
SomeoneSerge (back on matrix) | In reply to @glepage:matrix.org [triton update]
triton-llvm fails during the test phase.
Logs: https://paste.glepage.com/upload/fish-jaguar-pig * Can't reproduce, builds for me. Maybe we tried different HEADs? | 12:36:26 |
atagen | hi, what am I missing to get a cache hit? going by this hydra output torch should be in the cache (for nixpkgs 5633bcf). I have nix-community cachix set up, allowUnfree, cudaSupport,, and the package in question is providing its overlay properly with final.callPackage so it ought to be using my system packages | 12:46:24 |
atagen | https://gist.github.com/atagen/615e187e323f3ca3f5f9d40e55ce2b7c | 12:55:50 |
atagen | oof, could it be because I'm specifying python311Packages instead of python3Packages? | 12:57:30 |
atagen | ... yup, that was it | 12:58:23 |
SomeoneSerge (back on matrix) | https://github.com/NixOS/nixpkgs/blob/70f9c111b27db0d459a227e477acce62016cbf10/pkgs/top-level/release-cuda.nix#L118 | 13:04:59 |
SomeoneSerge (back on matrix) | In reply to @glepage:matrix.org [triton update]
triton-llvm fails during the test phase.
Logs: https://paste.glepage.com/upload/fish-jaguar-pig With the current HEAD and ccache off I just reached the pytest branch | 14:17:17 |
GaƩtan Lepage | In reply to @ss:someonex.net With the current HEAD and ccache off I just reached the pytest branch You mean that you were able to build it fine ? | 14:47:18 |
SomeoneSerge (back on matrix) | Yes | 14:47:27 |
SomeoneSerge (back on matrix) | Well the pytest bit fails with these 20 tests ofc but that'll come later | 14:47:41 |
GaƩtan Lepage | Ok, weird then... | 14:49:24 |
GaƩtan Lepage | Btw, I'm running a cross-system review for this triton PR. | 14:49:35 |
GaƩtan Lepage | quite a few rebuilds | 14:49:40 |
connor (burnt/out) (UTC-8) | In reply to @glepage:matrix.org Ok interesting, thanks for sharing Yep, that's the goal. My hope is to replace the current CUDA packaging stuff with what I've got there. I personally will be maintaining CUDA 11.8 for a while but mark it as end of life. Since it requires toolchains which will be removed upstream, I'll keep it out of tree. My plan is to only maintain the latest version of CUDA, but block upgrades to newer versions if some prominent packages don't build, even on master. I plan to ship the same version of most libraries that NVIDIA does with its ML containers, which means roughly a monthly release cadence. | 16:19:57 |
connor (burnt/out) (UTC-8) | Of course, all this is pending agreement with the other maintainers, but it would certainly help cut down the scope of CUDA packages and allow us to better populate the cache since there'd be really just one version supported upstream | 16:20:36 |
GaƩtan Lepage | This looks smart indeed ! | 16:55:13 |
| 16 Oct 2024 |
GaƩtan Lepage | As the onnx failure was blocking me elsewhere, I went and fixed it myself.
Any review is welcome :)
https://github.com/NixOS/nixpkgs/pull/348985 | 09:07:07 |
hexa | :: (nixbld1) ā /nix/store/svw8b4655f6w413xz23jjg6yn4b1d9p0-python3.12-tensordict-0.5.0
UID PID PPID STIME TIME COMMAND
30001 4207 4170 15:14 00:00:00 bash -e /nix/store/v6x3cs394jgqfbi0a42pam708flxaphh-default-builder.sh
30001 4737 4207 15:15 00:02:09 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 4781 4737 15:15 00:00:03 [pt_main_thread] <defunct>
30001 28942 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 28943 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 28984 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 29021 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29073 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 29098 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 29144 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 29184 4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001 29246 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29264 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29304 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29344 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29384 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29463 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29464 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29512 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29540 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29590 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29631 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29664 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29736 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29750 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29821 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29824 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29901 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29905 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29986 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 29989 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30044 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30069 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30110 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30150 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30214 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30231 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30273 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30311 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30390 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30398 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30460 4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001 30471 4737 15:17 00:00:00 [pt_main_thread] <defunct>
| 17:06:17 |
hexa | tensordict has been a pain in the last python-updates cycle | 17:06:26 |
hexa | and still is | 17:06:30 |
GaƩtan Lepage | Ah really ? Lately I had no issues on master to build it. It takes some time but it always succeed.
Have you bumped its version ?
Do you want me to have a look ? | 17:21:59 |