Nix Data Science | 297 Members | |
| 62 Servers |
| Sender | Message | Time |
|---|---|---|
| 30 Aug 2022 | ||
| 14:56:16 | ||
| 16:51:54 | ||
| 5 Sep 2022 | ||
| 05:39:36 | ||
| 18:21:51 | ||
| 6 Sep 2022 | ||
| Nix is attractive for reproducibility and provenance tracking. However, for ML workflows one major limitation is that all data produced as part of a derivation resides in /nix/store. It would be great to be able to define flakes for large datasets and use them as inputs to other programs/flakes. For example define a flake for ImageNet and then get train and test splits as outputs of those downstream flakes. Currently such a train/test split would require duplication of the original dataset twice into /nix/store if I'm not mistaken. | 15:34:39 | |
| apologies if there is an existing solution to this kind of data handling with nix that I'm unaware of | 15:35:12 | |
| i guess the splitting could of course be done with symlinks. main question is then is there any hope of handling large datasets at rest on some storage outside /nix | 15:39:06 | |
| Nix master supports impure derivations. Impure derivations have network access, so you could write your result somewhere else. I have not tried this yet, but I imagine this going to be very useful also in CI/CD. | 17:04:49 | |
| https://github.com/NixOS/nix/pull/6227 | 17:05:19 | |
| thanks. I currently use --impure on non-NixOS with nixGL to get cuda working anyway, so that may be the ticket | 17:07:24 | |
| that's not the same though, that's impure evaluation | 17:08:40 | |
| i see that now. should've read before replying.. | 17:09:23 | |
| 7 Sep 2022 | ||
| Has anyone got a working example of
...but it keeps segfaulting later when I prompt What helps is running the | 12:43:40 | |
| 15 Sep 2022 | ||
| 12:10:46 | ||
| 12:10:49 | ||
| 16 Sep 2022 | ||
| 07:37:08 | ||
| 18:17:04 | ||
In reply to @flxai:matrix.orgTry using jupyterWith + poetry2nix with flake.nix! It's hard but fun! | 18:33:55 | |
| sepiabrown: Thanks, this sounds like a good mix. Do you have configs lieing around somewhere? | 18:38:49 | |
In reply to @flxai:matrix.org https://github.com/sepiabrown/jupyterWith_poetry2nix If you are new to NixOS, the above solution may be hard! If other advanced user could have a look at my code and have any comment for improvement, it would be appreciated! | 19:22:32 | |
| Thanks. I'll have a look the following days and try my best. Not that comfortable with Nix and its components yet | 19:28:06 | |
| 18 Sep 2022 | ||
| 17:20:55 | ||
| 24 Sep 2022 | ||
| 10:45:47 | ||
| 30 Sep 2022 | ||
| On the topic of poetry, do you guys have any nice methods for getting an editable install using pure nix? I use a flake similar to above with poetry2nix, which works great for creating a derivation for a python package, but I then need to run poetry shell from inside a devshell (through direnv) to get editable install. Poetry shell creates a virtualenv in ~/.cache/pypoetry/virtualenvs, and the module versions installed there can sometimes differ from those in the devshell. | 00:43:25 | |
| poetry lock works well of course, but for packaging in nix i'd like to do most testing with the versions specified in flake.lock rather than poetry.lock, if possible. For now i do active development in poetry shell, then run tests there and again in nix | 00:45:57 | |
| 01:55:14 | ||
| 02:40:36 | ||
| 11:22:50 | ||
| 9 Oct 2022 | ||
| 17:52:33 | ||
| 15 Oct 2022 | ||
| 00:33:53 | ||