!CcTBuBritXGywOEGWJ:matrix.org

NixOS Binary Cache Self-Hosting

173 Members
About how to host a very large-scale binary cache and more60 Servers

Load older messages


SenderMessageTime
23 Feb 2024
@whentze:matrix.orgWanja Hentze joined the room.12:22:33
@dritonr:matrix.orgdritonr joined the room.12:27:33
29 Feb 2024
@kip93:matrix.orgkip93 joined the room.14:33:52
1 Mar 2024
@patka_123:matrix.orgpatka joined the room.08:10:16
@fgaz:matrix.orgfgaz joined the room.09:27:14
2 Mar 2024
@nh2:matrix.orgnh2 joined the room.01:42:07
@nh2:matrix.orgnh2

raitobezarius: OK I joined. Copying my message from the other channel, for context for others:

You were looking at dedicated-hosting the binary cache regarding the AWS cost sink. I run 1 PB of CephFS clusters on Hetzner, and can set that up quite easily with NixOps. Do you want to team up on this topic?

My feeling is we could Host the 500 TB of binary cache on 6x SX134 servers (960 TB raw), with EC 4+2, which provides 580 TB HA storage. With 10 Gbit/s Internet.

For 6 * 245 = 1470 EUR/month.

To transfer out, with AWS Snowball: If we can content-deduplicate the 500 TB by factor 2x to 250 GB (using attic or bupstash as shown on https://github.com/NixOS/nixpkgs/issues/89380) the one-off cost to ship to Germany would be ~12k EUR or so.

01:46:11
@raitobezarius:matrix.orgraitobezariusYes, so the problem is that the Foundation doesn't have 1470EUR/mo01:46:27
@raitobezarius:matrix.orgraitobezariusEven if we offsetted the Snowball, it's further unclear01:46:48
@raitobezarius:matrix.orgraitobezarius(plus, discussing with Hetzner to do the operations)01:47:01
@raitobezarius:matrix.orgraitobezarius(BTW, infra people just removed NixOps — finally — :D — but I guess anything modern as deployment framework can work)01:47:28
@nh2:matrix.orgnh2 raitobezarius: what is the latest discourse post that shows the current spending on hosting and how much of that AWS gives us for free? 01:47:38
@raitobezarius:matrix.orgraitobezarius
In reply to @nh2:matrix.org
raitobezarius: what is the latest discourse post that shows the current spending on hosting and how much of that AWS gives us for free?
cache is 15K USD/mo
01:47:51
@raitobezarius:matrix.orgraitobezarius9K of it is free credit from AWS01:47:57
@raitobezarius:matrix.orgraitobezarius6K of it is out of our own pocket01:48:02
@nh2:matrix.orgnh2So I guess "the Foundation doesn't have 1470EUR/mo" is referring to "... on top of the AWS spending, vs instead of it"?01:48:48
@raitobezarius:matrix.orgraitobezariusI'd need to jump again in the finances to look at all of that01:49:37
@raitobezarius:matrix.orgraitobezariusbut I don't think we can support it on the long term that easily01:49:44
@raitobezarius:matrix.orgraitobezariuseven out of AWS spending01:49:50
@raitobezarius:matrix.orgraitobezariusthe big problem also is that running the operations to have a self hosted cache is currently difficult01:50:02
@raitobezarius:matrix.orgraitobezariusso it would have to be on the top of AWS spending to avoid disasters01:50:10
@raitobezarius:matrix.orgraitobezariusthere's a double problem that is01:50:32
@raitobezarius:matrix.orgraitobezarius(a) AWS offers a "nice" durability metric that cannot be reproduced with a Hetzner setup01:50:45
@raitobezarius:matrix.orgraitobezarius(b) Infra people may not have the bandwidth to touch this in the next months01:50:55
@raitobezarius:matrix.orgraitobezarius(c) Foundation may not have the money/energy, when it will come to it, to go for that solution01:51:11
@raitobezarius:matrix.orgraitobezarius(actually a triple problem)01:51:16
@raitobezarius:matrix.orgraitobezariusAnother solution I am personally pursuing is just to get 1PB myself and put it in a non-super-available colo but almost free colo and duplicate the data01:52:46
@nh2:matrix.orgnh2I mean for the 5k/month difference you can literally hire Ceph experts full time to run the cluster for you 01:54:25
@raitobezarius:matrix.orgraitobezariusyeah but 6K/mo not being sustainable means that even by throwing 5k/mo to hire Ceph experts full time, this won't be a viable solution, no?01:54:48
@raitobezarius:matrix.orgraitobezariusRight now, the trajectory the Foundation is taking is to launch garbage collection to reduce those 6K/mo01:55:19

Show newer messages


Back to Room ListRoom Version: 10