NixOS CUDA | 311 Members | |
| CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda | 61 Servers |
| Sender | Message | Time |
|---|---|---|
| 19 May 2024 | ||
| An RTX 3080 is too old?! | 23:01:15 | |
In reply to @aidalgol:matrix.orgDid you build with cudaCapabilities = [ ... "8.6" ... ]? | 23:49:24 | |
| * Oh... | 23:51:03 | |
| (Did I just edit a message instead of sending a new one again?) | 23:51:45 | |
I just cloned https://github.com/ConnorBaker/nix-cuda-test.git, applied the fix described earlier, then ran nix run .#nix-cuda-test | 23:52:36 | |
In reply to @aidalgol:matrix.orgSo yeah there was a question about cudaCapabilities = [ ... "8.6" ] | 23:52:41 | |
| I saw it. 👍️ | 23:52:53 | |
In reply to @aidalgol:matrix.orgOur culprit: https://github.com/ConnorBaker/nix-cuda-test/blob/182c2148e6df0932fe19f9cb7180173ee2f9cb2d/flake.nix#L66 | 23:53:08 | |
| 20 May 2024 | ||
| Ah yeah sorry I usually just have it set to my GPU to speed up compile since I use it to test PRs | 00:22:23 | |
| Sooo... while that was building, KDE crashed to the display manager, and now the GPU usage is showing a non-zero value as I expected earlier. I have no idea what could have changed. | 01:12:24 | |
| Maybe it forced the driver to reload lol | 03:46:33 | |
| connor (he/him) (UTC-5): Samuel Ainsworth: Madoura: 1 I'd like us to add a more generic alias to this room that would encompass a wider range of topics, rocm at the least. The reasoning is the same: i don't think we need a special room for cuda... Madoura: what do rocm-maintainers think about their presence on matrix? | 14:20:05 | |
| Could call the room nixos-gpu since that covers rocm too. | 14:21:23 | |
In reply to @trexd:matrix.orgYes, gpu/coprocessors/accelerators/scicomp/ai even/anything in that direction. Well there already is nixos hpc and nixos data science but I don't see much conversation there, what has to be changed to spark conversations? There's activity in matthewcroughan's flake room, and sometimes hete, but not in nixos ds. | 14:34:08 | |
| * Yes, gpu/coprocessors/accelerators/scicomp/ai even/anything in that direction. Well there already is nixos hpc and nixos data science but I don't see much conversation there, what has to be changed to spark conversations? There's activity in matthewcroughan's flake room, and sometimes here, but not in nixos ds. | 14:34:25 | |
| To tackle diamond dependencies (among other things), I started making https://github.com/ConnorBaker/dep-tracker Specify a flake attribute for a package and it’ll grab a copy of all the attributes on the package containing a list of dependencies (the attributes it looks for are here https://github.com/ConnorBaker/dep-tracker/blob/cd8e927c561f3f1ed5c904609654c946d85cf954/packages/dep-tracker/dep_tracker/types.py#L15). It’ll look through those arrays and populate a SQLite database with libraries it finds in those dependencies. Now, a question: besides recursing and doing the same for every dependency I find (that is, harvesting attributes and updating the database), is there an easier way to get the closure of dependencies without building the package? IIRC A different question: with that hardcoded list of attributes I inspect, is it possible I’d miss dependencies (and therefore libraries) which are present in the closure? @someoneserge you have good ideas about finding dependencies — any suggestions? Currently finding what a dependency provides is limited to listing the names of libraries present under | 17:57:11 | |
| is there some reason we've got an out of date tensorflow build from source? | 18:07:41 | |
| oh maybe the bin has cuda support now? | 18:08:17 | |
For deciding which dependencies to retain a runtime reference to | 18:11:59 | |
| Have you seen https://fzakaria.com/2023/09/11/quick-insights-using-sqlelf.html? | 18:17:01 | |
In reply to @tpw_rules:matrix.orgBecause a >200 IQ is necessary to grasp this derivation 😅 | 18:19:34 | |
In reply to @glepage:matrix.orgWhat's the TLDR on why tensorflow is so difficult to package if you don't mind me asking? Maybe this is another example of "packaging is a hard problem" that I can add to my Nix pitch slides. | 18:27:20 | |
| Well, the [tensorflow derivation](https://github.com/NixOS/nixpkgs/blob/master/pkgs/development/python-modules/tensorflow/default.nix= is ~600 lines of hacking around the bazel build system, + doing a bunch of hacks to inject our own dependencies + CUDA stuff... All of this requires a lot of expertise (that I personnaly lack). It is surely one of the hardest packages that I am aware of in the python package set. | 18:36:24 | |
| * Well, the tensorflow derivation is ~600 lines of hacking around the bazel build system, + doing a bunch of hacks to inject our own dependencies + CUDA stuff... All of this requires a lot of expertise (that I personnaly lack). It is surely one of the hardest packages that I am aware of in the python package set. | 18:36:30 | |
| Now, if you want to take this challenge and update tensorflow, please go ahead. For context, there is a stale PR for updating tensorflow to 2.14: https://github.com/NixOS/nixpkgs/pull/272838 | 18:38:02 | |
In reply to @glepage:matrix.orgI'm already full up with packaging hasktorch haha! Yeah I've had a look at the derivation before and it seems nuts. | 18:49:58 | |
In reply to @connorbaker:matrix.orgI think that's what it is, yeah. This seems to break upon resuming the machine from suspend. | 20:06:08 | |
In reply to @ss:someonex.netI have! I saw Farid give a presentation on it at NixCon NA and that was neat; but it’s not packaged in Nixpkgs and I don’t want to do it :/ | 20:58:45 | |
| does the tensorflow-bin have cuda support? | 21:41:29 | |
| Hmm I guess so | 22:01:54 | |