NixOS ACME / LetsEncrypt | 105 Members | |
| Another day, another cert renewal | 46 Servers |
| Sender | Message | Time |
|---|---|---|
| 16 Nov 2024 | ||
| m1cr0man do you prefer I wait for your PR to be merged before opening one for this to avoid conflicts? | 18:02:56 | |
| I would yeah, if that's alright? I'd also like to give that a review when I get a moment | 18:06:34 | |
| Ok no worries | 18:07:40 | |
In reply to @thinkchaos:matrix.orgDefinitely. I used the same approach generating selfsigned first, then notify systemd to continue dependant units. Do you use the targets to sequence startup of other services, or something else? | 18:26:49 | |
I don't use the self signed certs and my services use requires = [ "acme-finished-${cert}.target" ]; | 18:29:39 | |
In reply to @m1cr0man:m1cr0man.comExactly, certmagic coordinates using some shared storage. So you'd get some rudimentary balancing via DNS RR, but no redundancy. For that you still need some IP failover setup. | 18:32:53 | |
In reply to @thinkchaos:matrix.orgI wonder if I can hack around that with certmagic, haha. I kinda don't want to run the daemon as root, but maybe a separate service can run as root, act on PathModified to check for valid certs, then fire the targets. π | 18:45:05 | |
The daemon should be ok running as acme user and group combined with something like SupplementaryGroups = lib.unique (map (c: c.group) cfg.certs) | 19:13:33 | |
Actually plain acme user should do, certs.*.user doesn't do anything anymore so the acme user can read/write all certs | 19:21:53 | |
In reply to @stephank:stephank.nlmaybe you can hook something up with user systemd | 19:39:29 | |
In reply to @stephank:stephank.nl Cool ok. This is one of those things that I don't think we will ever support in the in-tree ACME module, at least not whilst we use Lego. The main thing going around in my head is that I don't want either project to give the impression that the other is broken in some way, but if there are additional use cases and features in nixos-certmagic that security.acme does not currently offer, then users who stumble upon both options can understand when to chose one implementation over the other. I find it hard to justify ever changing from lego right now. Most people treat ACME as set-and-forget, literally. If we break people's configs, we need to have a really good reason to do so, and right now I don't think that removing a few lines of bash or improving activation time for more-than-average cert configs is worth it. That's why I'm apprehensive about all this talk of using daemons. Out of tree - sure, I can see the appeal as both an upgrade for those who can handle the reconfiguration of their system, and to cover extra use cases. | 20:32:21 | |
| ThinkChaos: thanks for the second round of reviews on my PR. I am contesting both of the latest comments though π I was wondering if you wanted to have further discussion? | 20:34:26 | |
| The path thing is just a nit, fine by me to leave as is (I'll say that in the review too) | 20:35:56 | |
That's not how I view it, for instance secrets go in /run too. To me it's for anything ephemeral. AFAIU tmpfiles is generally used for stuff that's required as long as the system is active. Which fits well here. | 20:39:23 | |
And small side effect is we don't use the lockdir var in the service so it makes dependency more hidden | 20:40:15 | |
* And small side effect is we don't use the lockdir var in the service so it makes the dependency more hidden | 20:40:23 | |
| The acme-setup.service is a requirement of all the renewal services (and is oneshot+RemainAfterExit), but systemd-tmpfiles is not. We actually had a test failure on hydra a couple of days ago because tmpfiles had not ran when lockdir was accessed. Let me see if I can find you the logs. | 20:41:10 | |
I agree with this, however /run/acme is directly related to service activation + logic implemented in systemd services. Having its lifecycle managed as a RuntimeDirectory definitely makes things easier. I will definitely add a comment to say where it's created, that's a good call that the relation is not obvious | 20:42:16 | |
In reply to @k900:0upti.meThis was the lockfiles error we saw last week. | 20:47:35 | |
Ok then RuntimeDir is ok with meI thought tmpfiles was something the activation scripts ensured ran earlier based on how it's generally used, but never confirmed that assumption. That also means lots of modules are broken π | 20:47:47 | |
| I'll reply and approve π | 20:47:58 | |
| So did I TBH, but then we had that race/permissions error, and the directory hadn't been created when acme-lockfiles ran. Maybe we could add more systemd service dependencies, but RuntimeDirectory was more appropriate IMO | 20:49:19 | |
In reply to @thinkchaos:matrix.orgThank you :D | 20:51:15 | |
I still am not a fan of the RuntimeDir + RemainAfterExit to persist the dir, and the dir being used outside the service itself, but it's better than something that's broken πHow would you feel if I just do a mkdir -p in the lock script in my PR? | 20:55:04 | |
* I still am not a fan of the RuntimeDir + RemainAfterExit to persist the dir, and the dir being used outside the service itself, but it's better than something that's broken πHow would you feel if I just do a mkdir -p in the lock script in my (future) PR? | 20:55:17 | |
* I still am not a fan of the RuntimeDir + RemainAfterExit to persist the dir, and the dir being used outside the service itself, but it's better than something that's broken π~~How would you feel if I just do a mkdir -p in the lock script in my (future) PR?~~ needs root | 20:59:33 | |
* I still am not a fan of the RuntimeDir + RemainAfterExit to persist the dir, and the dir being used outside the service itself, but it's better than something that's broken π~How would you feel if I just do a mkdir -p in the lock script in my (future) PR?~ needs root | 20:59:39 | |
* I still am not a fan of the RuntimeDir + RemainAfterExit to persist the dir, and the dir being used outside the service itself, but it's better than something that's broken πmkdir -p in the lock script in my (future) PR? | 20:59:48 | |
| 17 Nov 2024 | ||
| Given how systemd dependent we are already, I generally prefer the systemd solution if one is available. It is very well tested and has outstandingly stable behaviour. Less bash scripting means less custom code for us to maintain | 13:15:23 | |
Just got done testing an --overwrite-domains option for lego that lets us remove domainHash entirely. The delta on the module is kinda underwhelming but less code is less code | 22:50:19 | |