NixOS CUDA | 289 Members | |
| CUDA packages maintenance and support in nixpkgs | https://github.com/orgs/NixOS/projects/27/ | https://nixos.org/manual/nixpkgs/unstable/#cuda | 57 Servers |
| Sender | Message | Time |
|---|---|---|
| 15 Feb 2025 | ||
| Well, from whose perspective? From the PoV of the community, definitely a goal. As far as selling this idea to commercial entities goes, they couldn't care less, but we should advertise it as a prerequisite, because we need a cache to make development/maintenance reasonably efficient, and it might as well be a public cache | 10:50:21 | |
| I imagine that the amount and size of builds would make cachix or other cloud storage unfeasible. If it was only a dev cache could probably get away with just serving it off the CI master, if it was a proper public cache with non-trivial amount of users probably want a dedicated machine (or more than one if you want to keep the cache around for a while). | 11:05:52 | |
| Let's fill a rack with compute and storage! | 11:07:01 | |
In reply to @zowoq:matrix.orgI guess it would be more a dev cache I guess. | 11:07:18 | |
| * I guess it would be more a dev cache. | 11:07:21 | |
| I think we're probably close to users having problems with the cachix cache expiring too quickly. Our setup doesn't allow us to selectively push to the cache, everything that goes through CI gets pushed. Would need to move these jobs to another hydra/machine but that would also avoid needing to deal with this:
| 11:13:25 | |
| One thing that SomeoneSerge (UTC+U[-12,12]) | 11:14:27 | |
| * One thing that SomeoneSerge (UTC+U[-12,12]) touched on was to encourage companies to contribute (financially) to nix-community. | 11:15:12 | |
| It'll be less initial setup if a proper public cache isn't need. | 11:15:28 | |
True. Maybe we should allocate setting up a tvix nar-bridge as a substituter as a separate task, so that public cache can still be a thing xD | 11:29:04 | |
Indeed, that's one way to get started at least | 11:32:00 | |
| But with the intention of building a scalable persistent-ish cache later | 11:32:56 | |
| Running another dedicated machine (e.g. cheapish hetzner box) with just hydra and harmonia for the cache isn't a problem and spot instances for builders wouldn't be much maintenance overhead. Scope, funding, etc. are questions that I'll leave for @zimbatm. | 12:27:05 | |
Not 100% sure, what do you mean? The problematic cuda versions are 11.0-11.3. 11.4 and later have individual redistributables. 10.x are already deprecated/removed from nixpkgs, so no need to worry about those. | 18:22:06 | |
| * Not 100% sure, what do you mean? The problematic cuda versions are
| 18:22:29 | |
In reply to @ruroruro:matrix.orgFor what it’s worth 11.x will be removed prior to 25.05 from what I remember | 19:34:06 | |
| 19:48:53 | ||
In reply to @ss:someonex.netTo this end, would public financial support for this project allow for anonymous contributions? | 19:52:24 | |
| I think OpenCollective allows anonymous donations (in the sense of hiding the source from the public, but not from the project owners) | 19:53:21 | |
In reply to @zowoq:matrix.orgAll substituter implantations today are centralized, right? It'd be neat to build one on top of IPFS, or similar, for example. In my head, no one host would control any one store path and something like Shamir secret sharing could support k of n hosts being able to sign a store path. | 19:54:44 | |
| * All substituter implementations today are centralized, right? It'd be neat to build one on top of IPFS, or similar, for example. In my head, no one host would control any one store path and something like Shamir secret sharing could support k of n hosts being able to sign a store path. | 19:55:02 | |
In reply to @ss:someonex.netThis is potentially dangerous. Is there any effort that you know of for nix-community or you guys to work toward a funding solution which obscures this information from even OpenCollective, much less the project owners? | 19:56:34 | |
| * This is potentially dangerous. Is there any effort that you know of for nix-community or you guys (CUDA dudes) to work toward a funding solution which obscures this information from even OpenCollective, much less the project owners? | 19:57:01 | |
| In what sense would this be dangerous for nix-community specifically? | 19:57:47 | |
| For the contributor, not nix-community. | 19:58:01 | |
| Although, if they spend money that can be proven to derive from criminal action then it could endanger nix-community, too. Not thinking about that, though. | 19:58:29 | |
| * Although, if they spend money that can be proven to have been derived from criminal action then it could endanger nix-community, too. Not thinking about that, though. | 19:58:46 | |
| Regarding the ipfs message. That's not so much a substituter problem (an nginx instance with a local directory is more or less a substituter) as it is a build queue/hydra problem: afaiu we have no better solution for managing keys, than to copy outputs from all builders to a dedicated signer machine, nor do we have ready tools yet for doing this kind of cross-validation you describe. IPFS is more relevant for egress/distribution, which I'd say is a secondary concern right now. In either case, this is more of a research question, whereas with the CI we want to an immediate need with available tools | 20:10:41 | |
In reply to @ss:someonex.netSorry, didn't mean to imply that this idea solves any problem that you had, here. So definitely off-topic. | 20:12:43 | |
| Just a drive-by thought. | 20:12:52 | |