!vxTmkuJzhGPsMdkAOc:transformierende-gesellschaft.org

NixOS Matrix Subsystem

118 Members
Coordination and discussion about the matrix subsystem in NixOS - https://nixos.wiki/wiki/Matrix61 Servers

Load older messages


SenderMessageTime
8 Jan 2024
@hexa:lossy.networkhexayeah, pretty much the same cause as last time22:37:42
@hexa:lossy.networkhexaloving the redundant stack trace22:39:14
@hexa:lossy.networkhexatesting if depending on network-online.target does the trick22:40:02
@hexa:lossy.networkhexaok, here is another take on the situation after adding that dependency22:47:03
@hexa:lossy.networkhexamatrix-synapse.service has Restart=on-failure22:47:32
@hexa:lossy.networkhexa * matrix-synapse.service has Restart=on-failure 22:47:37
@hexa:lossy.networkhexathe workers as well22:47:48
@hexa:lossy.networkhexa so the matrix-synapse.service boots after a a few retries (because the idm is on a vpn, and the routing daemon needs to converge first....) 22:48:19
@hexa:lossy.networkhexa the issue is that the worker services are failed with dependency when matrix-synapse.service fails the first time 22:48:48
@hexa:lossy.networkhexaeven if it works on the second try - too late.22:49:25
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its) We could set RestartMode=direct on matrix-synapse.service so it doesn't go through a failure state 22:50:09
@dandellion:dodsorf.asDandellionThe options I've seen to fix that floating around online is essentially the wait-for-script we removed22:50:48
@hexa:lossy.networkhexaYeah, thought of BindsTo= and the wait script22:51:07
@hexa:lossy.networkhexa not sure about the implications of RestartMode=direct 22:51:18
@dandellion:dodsorf.asDandellionhttps://github.com/systemd/systemd/issues/131222:51:36
@dandellion:dodsorf.asDandellionso seems direct can work?22:51:45
@dandellion:dodsorf.asDandellionas long as we don't put any limits on the amount of restarts or something to that effect?22:52:25
@hexa:lossy.networkhexasounds like that was the solution22:52:28
@hexa:lossy.networkhexaanyway, that means matrix-synapse.service can never reach failed state, which is not ideal for monitoring22:52:54
@dandellion:dodsorf.asDandellionI've not seen/used it before so don't really know what it does22:53:11
@dandellion:dodsorf.asDandellion * I've never seen/used it before so don't really know what it does22:53:17
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its)
In reply to @hexa:lossy.network
anyway, that means matrix-synapse.service can never reach failed state, which is not ideal for monitoring
How so? If restart limits are hit the service should still transition to inactive/failed even with direct set
22:54:49
@hexa:lossy.networkhexaoh yeah, that is correct22:55:04
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its) Or we use the Upholds= fix mentioned in the issue Dandellion linked 22:56:19
@hexa:lossy.networkhexayeah, that sounds like exactly what we want22:57:11
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its) We could add Upholds=matrix-synapse-worker-...service to matrix-synapse.service so as soon as matrix-synapse.service is running it should start the workesr 22:57:18
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its) * We could add Upholds=matrix-synapse-worker-...service to matrix-synapse.service so as soon as matrix-synapse.service is running it should start the workers 22:57:27
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its)Not sure how stopping everything with the target would work in that case22:57:44
@hexa:lossy.networkhexa BindsTo=? 22:57:56
@sophie:catgirl.cloud⛧-440729 [sophie] (it/its) Hmm I feel like RestartMode=direct is the somewhat cleaner solution but idk 22:59:52

Show newer messages


Back to Room ListRoom Version: 4