!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

211 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda42 Servers

Load older messages


SenderMessageTime
15 Oct 2024
@atagen:imagisphe.reatagen... yup, that was it12:58:23
@ss:someonex.netSomeoneSerge (utc+3)https://github.com/NixOS/nixpkgs/blob/70f9c111b27db0d459a227e477acce62016cbf10/pkgs/top-level/release-cuda.nix#L11813:04:59
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @glepage:matrix.org
[triton update]
triton-llvm fails during the test phase.
Logs: https://paste.glepage.com/upload/fish-jaguar-pig
With the current HEAD and ccache off I just reached the pytest branch
14:17:17
@glepage:matrix.orgGaétan Lepage
In reply to @ss:someonex.net
With the current HEAD and ccache off I just reached the pytest branch
You mean that you were able to build it fine ?
14:47:18
@ss:someonex.netSomeoneSerge (utc+3)Yes14:47:27
@ss:someonex.netSomeoneSerge (utc+3)Well the pytest bit fails with these 20 tests ofc but that'll come later14:47:41
@glepage:matrix.orgGaétan LepageOk, weird then...14:49:24
@glepage:matrix.orgGaétan LepageBtw, I'm running a cross-system review for this triton PR.14:49:35
@glepage:matrix.orgGaétan Lepagequite a few rebuilds14:49:40
@connorbaker:matrix.orgconnor (he/him) (UTC-7)
In reply to @glepage:matrix.org
Ok interesting, thanks for sharing
Yep, that's the goal. My hope is to replace the current CUDA packaging stuff with what I've got there.
I personally will be maintaining CUDA 11.8 for a while but mark it as end of life. Since it requires toolchains which will be removed upstream, I'll keep it out of tree.
My plan is to only maintain the latest version of CUDA, but block upgrades to newer versions if some prominent packages don't build, even on master.
I plan to ship the same version of most libraries that NVIDIA does with its ML containers, which means roughly a monthly release cadence.
16:19:57
@connorbaker:matrix.orgconnor (he/him) (UTC-7)Of course, all this is pending agreement with the other maintainers, but it would certainly help cut down the scope of CUDA packages and allow us to better populate the cache since there'd be really just one version supported upstream16:20:36
@glepage:matrix.orgGaétan LepageThis looks smart indeed !16:55:13
16 Oct 2024
@glepage:matrix.orgGaétan Lepage As the onnx failure was blocking me elsewhere, I went and fixed it myself.
Any review is welcome :)
https://github.com/NixOS/nixpkgs/pull/348985
09:07:07
@hexa:lossy.networkhexa (UTC+1)
:: (nixbld1) → /nix/store/svw8b4655f6w413xz23jjg6yn4b1d9p0-python3.12-tensordict-0.5.0
  UID     PID    PPID STIME     TIME COMMAND
30001    4207    4170 15:14 00:00:00 bash -e /nix/store/v6x3cs394jgqfbi0a42pam708flxaphh-default-builder.sh
30001    4737    4207 15:15 00:02:09 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001    4781    4737 15:15 00:00:03 [pt_main_thread] <defunct>
30001   28942    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   28943    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   28984    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   29021    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29073    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   29098    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   29144    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   29184    4737 15:17 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30001   29246    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29264    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29304    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29344    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29384    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29463    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29464    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29512    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29540    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29590    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29631    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29664    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29736    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29750    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29821    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29824    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29901    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29905    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29986    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   29989    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30044    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30069    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30110    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30150    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30214    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30231    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30273    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30311    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30390    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30398    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30460    4737 15:17 00:00:00 [pt_main_thread] <defunct>
30001   30471    4737 15:17 00:00:00 [pt_main_thread] <defunct>
17:06:17
@hexa:lossy.networkhexa (UTC+1)tensordict has been a pain in the last python-updates cycle17:06:26
@hexa:lossy.networkhexa (UTC+1)and still is17:06:30
@glepage:matrix.orgGaétan Lepage Ah really ? Lately I had no issues on master to build it. It takes some time but it always succeed.
Have you bumped its version ?
Do you want me to have a look ?
17:21:59
@hexa:lossy.networkhexa (UTC+1)no failing tests17:30:12
@hexa:lossy.networkhexa (UTC+1)but they're slowing to a crawl17:30:17
@hexa:lossy.networkhexa (UTC+1)I have not touched it at all17:30:25
@hexa:lossy.networkhexa (UTC+1)
tensordict> test/smoke_test.py .                                                     [  0%]
tensordict> test/test_compile.py ...........................................ssss.... [  0%]
tensordict> ............................ssss....                                     [  0%]
tensordict> test/test_distributed.py s.............................................. [  0%]
tensordict> ..........                                                               [  0%]
tensordict> test/test_functorch.py ................................................. [  0%]
tensordict> ...............................................                          [  0%]
tensordict> test/test_fx.py ...                                                      [  0%]
tensordict> test/test_h5.py ...                                                      [  0%]
tensordict> test/test_memmap.py .................................................... [  0%]
tensordict> ........................................................................ [  1%]
tensordict> ........................................................................ [  1%]
tensordict> ........................................................................ [  1%]
tensordict> ........................................................................ [  1%]
tensordict> ........................................................................ [  2%]
tensordict> ........................................................................ [  2%]
tensordict> ........................................................................ [  2%]
tensordict> .                                                                        [  2%]
tensordict> test/test_nn.py ........................................................ [  2%]
tensordict> ........................................................................ [  3%]
tensordict> ........................................................................ [  3%]
tensordict> ........................................................................ [  3%]
tensordict> ........................................................................ [  3%]
tensordict> ..................................                                       [  3%]
tensordict> test/test_tensorclass.py .........................s..................... [  3%]
tensordict> .............s...........................                                [  4%]
tensordict> test/test_tensordict.py .......................................s........ [  4%]
tensordict> s....................................................................... [  4%]
tensordict> ..s..........s...................................s...................... [  4%]
tensordict> ........................................................................ [  4%]
tensordict> .....................s...............s.................................. [  5%]
tensordict> ........................................................................ [  5%]
tensordict> ........................................................................ [  5%]
tensordict> ........................................................................ [  5%]
tensordict> ........................................................................ [  6%]
tensordict> ................ssssssssssssssssssssssssssssssssssssssssssssssssssssssss [  6%]
tensordict> ssssssss................................................................ [  6%]
tensordict> ................................................................ssssssss [  6%]
tensordict> ssssssss................................................................ [  7%]
tensordict> ........................................................................ [  7%]
tensordict> ........................................................................ [  7%]
tensordict> .............................s...............s...............s.......... [  7%]
tensordict> .....s.................................................................. [  7%]
tensordict> ........................................................................ [  8%]
tensordict> ........................................................................ [  8%]
tensordict> ........................................................................ [  8%]
tensordict> ........................................................................ [  8%]
tensordict> ........................................................................ [  9%]
tensordict> ........................................................................ [  9%]
tensordict> ........................................................................ [  9%]
tensordict> ........................................................................ [  9%]
tensordict> .......................................................s.s.............. [ 10%]
tensordict> ........................................................................ [ 10%]
tensordict> ........................................................................ [ 10%]
tensordict> ........................................................................ [ 10%]
tensordict> ......................................................................ss [ 10%]
tensordict> s....ss.......sss....ss................................................. [ 11%]
tensordict> ........................s............................................... [ 11%]
tensordict> ........................................................................ [ 11%]
tensordict> ........................................................................ [ 11%]
tensordict> ........................................................................ [ 12%]
tensordict> ........................................................................ [ 12%]
tensordict> ........................................................................ [ 12%]
tensordict> ................................................................ssssssss [ 12%]
tensordict> ssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssss [ 13%]
tensordict> ssssssssssssssss........................................................ [ 13%]
tensordict> ........................................................................ [ 13%]
tensordict> ........................................................................ [ 13%]
tensordict> ..............................................................ss........ [ 13%]
tensordict> ........................................................................ [ 14%]
tensordict> ........................................................................ [ 14%]
tensordict> ........................................................................ [ 14%]
tensordict> ........................................................................ [ 14%]
tensordict> ........................................................................ [ 15%]
tensordict> ........................................................................ [ 15%]
tensordict> ........................................................................ [ 15%]
tensordict> ........................................................................ [ 15%]
tensordict> ........................................................................ [ 16%]
tensordict> ........................................................................ [ 16%]
tensordict> ........................................................................ [ 16%]
tensordict> ........................................................................ [ 16%]
tensordict> ........................................................................ [ 16%]
tensordict> ........................................................................ [ 17%]
tensordict> ........................................................................ [ 17%]
tensordict> ........................................................................ [ 17%]
tensordict> ........................................................................ [ 17%]
tensordict> ................s....................................................... [ 18%]
tensordict> .......................................................s.s.............. [ 18%]
tensordict> ........................................................................ [ 18%]
tensordict> ................................................................ssssssss [ 18%]
tensordict> ........................................................................ [ 19%]
tensordict> ...........................s............................................ [ 19%]
tensordict> ........................................................................ [ 19%]
tensordict> ........................................................................ [ 19%]
tensordict> ........................................................................ [ 20%]
tensordict> ........................................................................ [ 20%]
tensordict> ........................................................................ [ 20%]
tensordict> ........................................................................ [ 20%]
tensordict> ........................................................................ [ 20%]
tensordict> ...................................................sss.................. [ 21%]
tensordict> ........................................................................ [ 21%]
tensordict> ........................................................................ [ 21%]
tensordict> ........................................................................ [ 21%]
tensordict> ........................................................................ [ 22%]
tensordict> ..........ss.....s..ssssssssssssssssssssssssssss........................ [ 22%]
17:30:45
@hexa:lossy.networkhexa (UTC+1)
:: (nixbld10) → /nix/store/svw8b4655f6w413xz23jjg6yn4b1d9p0-python3.12-tensordict-0.5.0
  UID     PID    PPID STIME     TIME COMMAND
30010  432250  432213 17:17 00:00:00 bash -e /nix/store/v6x3cs394jgqfbi0a42pam708flxaphh-default-builder.sh
30010  433687  432250 17:17 00:02:07 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  434047  433687 17:18 00:00:03 [pt_main_thread] <defunct>
30010  464302  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464342  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464382  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464422  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464462  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464502  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464542  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464582  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464622  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464662  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464702  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464742  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464766  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464822  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464862  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464903  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464945  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464986  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465029  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465071  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465076  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465157  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465192  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465244  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465286  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465328  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465370  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465413  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465455  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465499  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465541  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465583  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465614  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465669  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465712  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465755  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465797  433687 17:20 00:00:00 [pt_main_thread] <defunct>
17:30:54
@hexa:lossy.networkhexa (UTC+1)this is the current situation I'm seeing17:30:59
@hexa:lossy.networkhexa (UTC+1)I'm heading out for dinner, maybe it will complete in the next 2h 17:31:37
@hexa:lossy.networkhexa (UTC+1)

┃ ⏵ python3.12-tensordict-0.5.0 (pytestCheckPhase) ⏱ 2h35m4s

19:52:24
@hexa:lossy.networkhexa (UTC+1)hasn't moved one bit since19:52:27
@ss:someonex.netSomeoneSerge (utc+3)
In reply to @hexa:lossy.network
:: (nixbld10) → /nix/store/svw8b4655f6w413xz23jjg6yn4b1d9p0-python3.12-tensordict-0.5.0
  UID     PID    PPID STIME     TIME COMMAND
30010  432250  432213 17:17 00:00:00 bash -e /nix/store/v6x3cs394jgqfbi0a42pam708flxaphh-default-builder.sh
30010  433687  432250 17:17 00:02:07 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  434047  433687 17:18 00:00:03 [pt_main_thread] <defunct>
30010  464302  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464342  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464382  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464422  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464462  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464502  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464542  433687 17:20 00:00:00 /nix/store/wfbjq35kxs6x83c3ncpfxdyl5gbhdx4h-python3-3.12.6/bin/python3.12 -m pytest -k not test_copy_onto and not test_mp and not test_functional and not test_linear and not test_seq and not test_seq_lmbda
30010  464582  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464622  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464662  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464702  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464742  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464766  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464822  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464862  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464903  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464945  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  464986  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465029  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465071  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465076  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465157  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465192  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465244  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465286  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465328  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465370  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465413  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465455  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465499  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465541  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465583  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465614  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465669  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465712  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465755  433687 17:20 00:00:00 [pt_main_thread] <defunct>
30010  465797  433687 17:20 00:00:00 [pt_main_thread] <defunct>
Shall we make it pytest -n1
20:06:34
@hexa:lossy.networkhexa (UTC+1)not sure yet 🙂 20:29:41
@hexa:lossy.networkhexa (UTC+1)feel free to try20:29:46
17 Oct 2024
@hexa:lossy.networkhexa (UTC+1)
In reply to @ss:someonex.net
Shall we make it pytest -n1
doesn't use xdist
00:43:31

Show newer messages


Back to Room ListRoom Version: 9