| 21 Jul 2025 |
connor (he/him) | Went ahead and merged it | 17:20:26 |
| 23 Jul 2025 |
apyh | oof the nccl version in nixpkgs is quite old now | 16:30:38 |
apyh | (quite old in the ml world, lol. only a month old) | 16:31:25 |
apyh | torchtitan needs torch 2.8, torch 2.8 requires nccl 2.27, gotta update nccl myself | 16:31:49 |
apyh | guess I'll pr to nixpkgs lol | 16:31:56 |
apyh | pr opened 😁 | 16:59:39 |
Gaétan Lepage | Can you share the link apyh? | 22:56:02 |
apyh | ah sure! https://github.com/NixOS/nixpkgs/pull/427804 | 23:00:23 |
apyh | they added a bunch of new stuff so i have to patch the shebang in a second python script. surprisingly didn't cause a build failure without it, just didn't export some of the new symbols | 23:01:02 |
Gaétan Lepage | Thanks! | 23:03:49 |
| 24 Jul 2025 |
apyh | huh. thanks for the nixpkgs-review. very strange to me that it fails to build pytorch as a result, but that the python 3.13 failure is just a bunch of .. warnings inside torch? i'll compile again locally to see.. | 14:56:24 |
apyh | can't repro the build failure locally for python312Packages.torchWithCuda Gaétan Lepage 🤔 left a comment here to that effect https://github.com/NixOS/nixpkgs/pull/427804#issuecomment-3114819745 | 20:26:13 |
apyh | can't repro any of the build failures in fact, only took 3.5 hours per torch to test 😭 | 23:51:03 |