!CcTBuBritXGywOEGWJ:matrix.org

NixOS Binary Cache Self-Hosting

173 Members
About how to host a very large-scale binary cache and more60 Servers

Load older messages


SenderMessageTime
6 Mar 2024
@edef1c:matrix.orgedefit's really not a big deal16:50:40
@raitobezarius:matrix.orgraitobezarius
In reply to @edef1c:matrix.org
keeping a disaster recovery copy is cheap
Yes but this is not what we were discussing in that context :p
16:52:25
@raitobezarius:matrix.orgraitobezariusYes, we can keep additional copies16:52:34
@raitobezarius:matrix.orgraitobezarius(so that healing is magic and what not)16:52:44
@edef1c:matrix.orgedef in general, throwing around numbers of nines is cute, but none of it means anything until you're specifying what SLOs these are for 17:01:06
@edef1c:matrix.orgedefseven nines of durability+availability doesn't mean anything by itself if we're not specifying the details17:02:14
@edef1c:matrix.orgedefavailable within what timescale? we can get a lot of nines for durability and availability within 14 days latency, we can all keep copies on tape in our basement17:03:18
@edef1c:matrix.orgedefbut the question that should be driving the SLO conversation is "what goals do these SLOs accomplish"17:05:11
@edef1c:matrix.orgedef
In reply to @whentze:matrix.org
do any cloud providers even have 7-nines SLAs for this kind of service?
Glacier, including Glacier Instant Retrieval, claims to offer 11 nines of durability
17:08:24
@edef1c:matrix.orgedefi don't have empirical data to support that but i would treat that number with some skepticism17:08:37
@raitobezarius:matrix.orgraitobezarius
In reply to @edef1c:matrix.org
i don't have empirical data to support that but i would treat that number with some skepticism
I know durability loss in AWS
17:09:43
@raitobezarius:matrix.orgraitobezarius
In reply to @edef1c:matrix.org
i don't have empirical data to support that but i would treat that number with some skepticism
* I know of durability loss stories in AWS
17:09:48
@raitobezarius:matrix.orgraitobezariusAnd it was under the SLO17:09:57
@raitobezarius:matrix.orgraitobezarius(yada yada when you know how the sausage is made you know why this happens and in which scale, etC.)17:10:25
@raitobezarius:matrix.orgraitobezarius * (yada yada when you know how the sausage is made you know why this happens and in which scale, etc.)17:10:26
@raitobezarius:matrix.orgraitobezarius But I don't think we need to convince folks of the fact that SLO is a budget and you burn into it 17:11:12
@raitobezarius:matrix.orgraitobezariusIt's not a mathematical guarantee17:11:21
@edef1c:matrix.orgedef
In reply to @edef1c:matrix.org
in general, throwing around numbers of nines is cute, but none of it means anything until you're specifying what SLOs these are for
basically i don't really want to be all "you need to be this tall to ride, please have actual irl ops experience at scale to talk at all" but if people want to participate in conversations like this and make demands about SLOs/SLAs i'd like them to at least read the SRE at Google book and learn to think about this well, it's literally free
17:11:27
@zimbatm:numtide.comJonas ChevalierI'll talk to Domen and sort this out. I think he is motivated to raise funds, which could be pretty helpful. But yeah..17:12:32
@edef1c:matrix.orgedefi would like to not lose a single bit of the cache, but i know it's statistically unlikely i can guarantee this; in expectation, we've already lost some17:12:33
@edef1c:matrix.orgedefbut i do think we can buy a lot of latitude for mistakes on the serving/hot storage stack if we have a cold copy17:13:43
@edef1c:matrix.orgedef
In reply to @raitobezarius:matrix.org
But I don't think we need to convince folks of the fact that SLO is a budget and you burn into it
so like, one of the things i'm curious about is what the actual distributions are
17:18:41
@blitz:chat.x86.lolJulian Stecklina joined the room.17:19:30
@edef1c:matrix.orgedefeg is this whole-object loss, object corruption, is this uncorrelated across objects or correlated within prefixes17:19:41
@zimbatm:numtide.comJonas ChevalierOne thing I am excited about if we end up self-hosting is that it will be easier to do experiments, like a smart narinfo database that can answer to queries. We can also introduce a log of all the new entries, to make it easier for other caches to mirror the main cache.17:21:24
@edef1c:matrix.orgedefyeah, exactly17:21:36
@edef1c:matrix.orgedefone of the things in the works is incremental updating of the narinfo dataset from S317:21:50
@whentze:matrix.orgWanja Hentzeand dedup, of course17:22:19
@zimbatm:numtide.comJonas Chevalierif we can be smart about it, it has the potential to have better latency responses on the 404s17:22:24
@zimbatm:numtide.comJonas Chevalieryeah and dedup will be easier to try, without worrying to pay 10k17:22:39

Show newer messages


Back to Room ListRoom Version: 10