!CcTBuBritXGywOEGWJ:matrix.org

NixOS Binary Cache Self-Hosting

158 Members
About how to host a very large-scale binary cache and more54 Servers

Load older messages


SenderMessageTime
4 Mar 2024
@edef1c:matrix.orgedefthe perf requirements on the storage backends are really loose, except for serving 404s14:50:34
@edef1c:matrix.orgedef * the perf requirements on the storage backends are really loose, except for serving narinfo 404s14:50:55
@edef1c:matrix.orgedefbut narinfo keys are only 5G of stuff, we can serve 404s basically however we want14:51:22
@edef1c:matrix.orgedefany actual serving ends up being from fastly and too heavily cached for the backend's perf to matter14:52:17
@zimbatm:numtide.comJonas Chevalier nh2: do you want to join the infra meeting on Thusday 18:00 GMT+1 and hash this out with us? 14:55:08
@raitobezarius:matrix.orgraitobezariusIsn't delroth going to be off?14:55:46
@raitobezarius:matrix.orgraitobezariusI think it's good to have delroth on those discussions14:55:57
@zimbatm:numtide.comJonas Chevalierit's fine, we already discussed this14:57:12
@zimbatm:numtide.comJonas Chevalierwe have the overall ideas but what's missing is to map some of the unknowns, like delroth said; can we make this sustainable, what a migration path looks like, ...15:00:45
@zimbatm:numtide.comJonas Chevalierwe might be able to port other things than the cache first15:01:56
@edef1c:matrix.orgedefso like, my biggest proposal wrt the cache GC is that we aggregate the "deleted" data into Glacier Deep Archive, as large objects22:12:32
@edef1c:matrix.orgedefthat locks us in for 6 months, but if there are no other takers, i'll put down the $3k to buy myself 6 months of development time for an exit strategy from AWS for that data22:13:19
@edef1c:matrix.orgedefit should dedupe quite well, but my biggest issue is simply time pressure22:14:24
@edef1c:matrix.orgedef i don't really intend don't intend to lose a single byte of that historical data but the stress from trying to do all of this fast is weighing on me 22:16:33
@edef1c:matrix.orgedefthe narinfo dataset is archived in several places now, that part we have covered22:17:52
@raitobezarius:matrix.orgraitobezarius
In reply to @edef1c:matrix.org
that locks us in for 6 months, but if there are no other takers, i'll put down the $3k to buy myself 6 months of development time for an exit strategy from AWS for that data
we collected enough money to put the 3K as part of the "binary cache niceties" budget
22:18:04
@raitobezarius:matrix.orgraitobezariusfwiw22:18:09
@edef1c:matrix.orgedefsure, that works for me, $3k is def still a meaningful cost for me22:18:32
@edef1c:matrix.orgedefi just know that history would not judge me kindly if i let this data go to /dev/null22:19:13
5 Mar 2024
@nh2:matrix.orgnh2
In reply to @zimbatm:numtide.com
nh2: do you want to join the infra meeting on Thusday 18:00 GMT+1 and hash this out with us?
Unfortunately I'll be on a train at that time, so my ability to join may be reduced
02:30:40
@nh2:matrix.orgnh2
In reply to @edef1c:matrix.org
so like, my biggest proposal wrt the cache GC is that we aggregate the "deleted" data into Glacier Deep Archive, as large objects
edef: What will be the cost of getting them out again, just to be sure that it won't be forbiddingly large?
02:31:47
@edef1c:matrix.orgedefbatch restores are free, they just have 12h latency03:32:17
@edef1c:matrix.orgedefrestores happen to S3 reduced redundancy but we'd only need to float a small fraction of the dataset at a time03:33:31
@nh2:matrix.orgnh2I see, that makes sense03:34:59
@edef1c:matrix.orgedefso we can tune that for however much compute we want to throw at it in parallel03:35:31
@edef1c:matrix.orgedefi can run some numbers wrt the best bang-per-buck there but not right this second03:36:12
@edef1c:matrix.orgedefbasically depends on what the supply curve for EC2 spot compute looks like03:36:38
@nh2:matrix.orgnh2For Ceph hosting, do we know what the IOPS of cache.nixos.org are, just to see if some basic small cluster on HDDs could handle it?03:39:18
@edef1c:matrix.orgedefpresumably you want backend I/O, ie to the S3 bucket?03:40:25
@nh2:matrix.orgnh2yes, that would be the equivalent of what would hit the disks03:41:04

Show newer messages


Back to Room ListRoom Version: 10