!CcTBuBritXGywOEGWJ:matrix.org

NixOS Binary Cache Self-Hosting

168 Members
About how to host a very large-scale binary cache and more57 Servers

Load older messages


SenderMessageTime
2 Mar 2024
@raitobezarius:matrix.orgraitobezariusI'm playing all those values by mind22:35:43
@raitobezarius:matrix.orgraitobezariusbut then I think we can say ≥ 3K22:35:49
@delroth:delroth.net@delroth:delroth.net
In reply to @raitobezarius:matrix.org
maybe I'm wrong, I don't have access to cost explorer
I have access to cost explorer but I don't think I know how to interpret it - I'm pretty sure the $9K/month discount is applied weirdly in there
22:37:34
@raitobezarius:matrix.orgraitobezariusmmm, I remember we were able to isolate the bandwidth costs without the 9K/mo there but yeah22:37:57
@raitobezarius:matrix.orgraitobezariusduring a meeting with zimbatm with the S3 folks22:38:06
@nh2:matrix.orgnh2
In reply to @delroth:delroth.net
(AFAIK nobody has made a proper call on what kind of availability target we'd like to hit, so it's hard to know what kind of HA requirements as well as staffing we'd need)
For comparison, my startup is 3 people. We run a Ceph cluster for 4 years, backing a public-facing product, and the Ceph has higher uptime / availablity than S3 in those 4 years.
In practice, only I maintain the cluster, and spend less than 2% of my working time on it.
22:38:39
@delroth:delroth.net@delroth:delroth.netout.png
Download out.png
22:42:38
@delroth:delroth.net@delroth:delroth.netpossibly pre-tax22:42:54
@delroth:delroth.net@delroth:delroth.netso might need to multiply by 1.2122:43:01
@raitobezarius:matrix.orgraitobezariuspre-tax22:43:01
@raitobezarius:matrix.orgraitobezariustax events happen at "point in time"22:43:06
@delroth:delroth.net@delroth:delroth.net(that's Jan 2024)22:43:14
@raitobezarius:matrix.orgraitobezarius(and cause a spike in the cost explorer IIRC)22:43:17
@delroth:delroth.net@delroth:delroth.netso if we could spend e.g. $250/month on reducing those $3500 down to $500 or so... I think we'd have a lot of space to experiment with how to handle the cold storage22:55:50
@delroth:delroth.net@delroth:delroth.netand I don't think it's a significant effort to get there22:56:03
@delroth:delroth.net@delroth:delroth.netwell, it's definitely a few months of planning and work22:56:20
@delroth:delroth.net@delroth:delroth.netbut it's maybe 10x less effort than a full migration away from S3?22:57:04
@delroth:delroth.net@delroth:delroth.netalso postpones solving the availability problem since you can failover22:57:40
@delroth:delroth.net@delroth:delroth.net(assuming we keep dual-writing)22:57:49
@delroth:delroth.net@delroth:delroth.net * (assuming we keep dual-writing, which we should)22:58:07
@nh2:matrix.orgnh2
In reply to @delroth:delroth.net
so if we could spend e.g. $250/month on reducing those $3500 down to $500 or so... I think we'd have a lot of space to experiment with how to handle the cold storage
That's a good point. Make a cheaper cluster for serving, where the costs of the additional cluster (traffic + storage) are cheaper than the S3 traffic money saved.
23:02:20
@delroth:delroth.net@delroth:delroth.netanother benefit is that we can use that new cache infra as an on-ramp into the more complex full S3 migration23:03:04
@delroth:delroth.net@delroth:delroth.netif we build it roughly how we'd build the final stack except smaller23:03:26
@delroth:delroth.net@delroth:delroth.netwhich has a lot of value in making people feel more comfortable with maintaining the infra long term, getting familiar with the tech stack, etc.23:03:52
3 Mar 2024
@zimbatm:numtide.comJonas Chevalier side note, but nh2: you might also be interested in joining the archivist channel with flokli and edef, where it's more about content-deduplication: #archivists:nixos.org . the volume is quite slow, but feel free to ask them questions. 12:08:27
@zimbatm:numtide.comJonas Chevalierthe main thing that is holing the dedupe back is hitting the AWS pricing, so moving most or everything to self-hosted would make things easier12:09:14
@zimbatm:numtide.comJonas Chevalier raitobezarius: I'm not too concerned about a sustained 2k/month cost. I know at least one company willing to sponsor 100k / year. The only thing that is missing is for the foundation to approach them with a proposal that makes sense. And I think there are more companies out there like that. 12:11:40
@zimbatm:numtide.comJonas Chevalierwe've been circling this particular issue for a while now. let's take the plunge. especially if nh2 is willing to help set things up and teach us about Ceph. we could order 3 servers and get a prototype up and running.12:13:49
@raitobezarius:matrix.orgraitobezarius
In reply to @zimbatm:numtide.com
raitobezarius: I'm not too concerned about a sustained 2k/month cost. I know at least one company willing to sponsor 100k / year. The only thing that is missing is for the foundation to approach them with a proposal that makes sense. And I think there are more companies out there like that.
If we are confident in this, I agree :)
13:12:08
@delroth:delroth.net@delroth:delroth.net
In reply to @zimbatm:numtide.com
we've been circling this particular issue for a while now. let's take the plunge. especially if nh2 is willing to help set things up and teach us about Ceph. we could order 3 servers and get a prototype up and running.
by "let's take the plunge" and "teach us", who is "us"? Because even if we have a Ceph cluster running, without the required work to actually use it to displace AWS usage, it's just an extra liability. And imo that work is more involved than the work required to set up a Ceph cluster, if only because nobody has even charted in details what needs to be done...
22:15:14

Show newer messages


Back to Room ListRoom Version: 10