!eWOErHSaiddIbsUNsJ:nixos.org

NixOS CUDA

290 Members
CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda57 Servers

Load older messages


SenderMessageTime
31 Aug 2022
@tpw_rules:matrix.orgtpw_rulesthe tl;dr is that almost every generation of card requires binaries that target it specifically. binaries can't be used on cards that don't match. there are pre-binary forms which can be JITed into binaries in most circumstances, but performance can suffer. distributors would compile all binaries possible, but for the user who wants it to work on just their card, not doing that can save literal hours of compilation time00:46:47
@tpw_rules:matrix.orgtpw_rules * the tl;dr is that almost every generation of card requires binaries that target it specifically. binaries can't be used on cards that don't match. there are pre-binary forms which can be JITed into binaries in most circumstances, but performance can suffer. distributors usually compile all binaries possible, but for the user who wants it to work on just their card and has to compile from source, which is everybody for nixpkgs cuda stuff, not doing that can save literal hours of compilation time00:47:09
@tpw_rules:matrix.orgtpw_rules"all binaries possible" also depends on the cuda library version and specific package capabilities00:47:46
@skainswo:matrix.orgSamuel AinsworthIIUC the tradeoff here is between user compile times and the size of cached builds, ie. not every user needs every arch but we support more than one so that users don't have to rebuild locally00:47:57
@tpw_rules:matrix.orgtpw_rulesi wonder what percentage of nixpkgs cuda users use cached builds. i think it's very low00:48:29
@skainswo:matrix.orgSamuel Ainsworththere's no established guidelines for this atm, packages set their own cuda arch's independently00:48:39
@tpw_rules:matrix.orgtpw_rulesmaybe i am wrong00:48:41
@skainswo:matrix.orgSamuel AinsworthI would actually assume it's quite high. tensorflowWithCuda is something like 48 CPU-hours to build00:49:02
@skainswo:matrix.orgSamuel Ainsworthor maybe 24... I don't remember exactly00:49:09
@skainswo:matrix.orgSamuel Ainsworthbut it's a big boy00:49:12
@tpw_rules:matrix.orgtpw_rulesyea, that's why i have a 48 core server as a remote builder and don't update my nixpkgs set except every 6 months :)00:49:41
@skainswo:matrix.orgSamuel Ainsworthhehe lucky you00:49:53
@skainswo:matrix.orgSamuel Ainsworththis was exactly why we built out the cachix cache00:50:16
@skainswo:matrix.orgSamuel Ainsworthhttps://app.cachix.org/cache/cuda-maintainers#pull00:50:34
@skainswo:matrix.orgSamuel Ainsworthand full wiki for context: https://nixos.wiki/wiki/CUDA00:50:51
@tpw_rules:matrix.orgtpw_rulesbut in any case, being able to know what you'll get and how to ask for it would be good improvements to make, even if we leave the default at "all possible" is a good thing00:50:52
@skainswo:matrix.orgSamuel Ainsworthyeah, that's fair00:51:08
@skainswo:matrix.orgSamuel Ainsworthwould be nice to get all packages aligned on how to do this in a consistent manner00:51:30
@tpw_rules:matrix.orgtpw_rulesyeah i've used that before, but the fact that it GCs old stuff spooks me a little. like i said i don't update much and not just because of build times, so being forced to upgrade or have to suffer the compile anyway in the future i don't like00:51:47
@skainswo:matrix.orgSamuel AinsworthI haven't encountered missing things due to GC myself, but I can understand the concern00:52:23
@tpw_rules:matrix.orgtpw_rulesisn't that the deal with cachix? it will only keep around the latest builds? i figured with how hongry these derivations are that time would come quickly00:52:46
@tpw_rules:matrix.orgtpw_rulesbut if it's been going for many months now and it hasn't had to GC anything i will reconsider my stance00:53:19
@skainswo:matrix.orgSamuel Ainsworth Domen Kožar has graciously offered the group some nice headroom. o/w ur right we would burn through the cache like every day 00:53:52
@skainswo:matrix.orgSamuel Ainsworthdef show cachix some love if you can!00:54:32
@tpw_rules:matrix.orgtpw_rulesi've said it before but it's still true, thanks very much for leading this. i know the road is bumpy but the value is immense00:55:00
@skainswo:matrix.orgSamuel Ainsworth Thank you so much tpw_rules ! I really appreciate that. It can be grueling at times, so I'm really happy to hear your words of encouragement 00:56:05
@skainswo:matrix.orgSamuel Ainsworthglad that our work is valuable to you as well!00:56:21
@hexa:lossy.networkhexasank a few hours into setting up stable-diffusion02:20:37
@hexa:lossy.networkhexano dice with rocm on my 5700 XT (NAVI 10)02:20:48
@hexa:lossy.networkhexaI hate gpus02:20:56

Show newer messages


Back to Room ListRoom Version: 9