| 18 Jul 2023 |
Mic92 | the auto nixos flake update can be monitored, while also being less complex. | 10:37:23 |
Mic92 | adis: does lemmy has some health endpoint that I can add to monitoring? | 10:38:07 |
adisbladis | We could build with prometheus support, but it's not enabled by default iirc | 10:42:55 |
Domen Kožar | Mic92: what kind of monitoring you'd like? | 10:43:02 |
Domen Kožar | like system monitoring? memory, cpu, etc? | 10:44:00 |
adisbladis | In reply to @adis:blad.is We could build with prometheus support, but it's not enabled by default iirc I'll look into this a bit later | 10:44:45 |
Mic92 | domenkozar: if the deployment succeeded. | 10:48:08 |
Mic92 | so exit code of last deployment. | 10:48:19 |
Mic92 | Either as textfile in prometheus format or there is an http exporter with that metric. | 10:48:36 |
Domen Kožar | cachix deploy activate will exit with non zero if any of deployments fail | 10:51:08 |
Mic92 | Ok. But how does this integrated with the cachix-agent? | 10:53:49 |
Domen Kožar | for example: https://github.com/cachix/cachix-ci-agents/actions/runs/5521933214/jobs/10071274310 | 10:53:50 |
Domen Kožar | oh you'd like to have this on the agent itself? | 10:54:13 |
Domen Kožar | interesting, we could add opentelemetry events for this | 10:54:35 |
Mic92 | some /metrics thing would be also enough. | 10:54:52 |
zowoq | In reply to @domenkozar:matrix.org cachix deploy activate will exit with non zero if any of deployments fail but not if using async? | 10:55:01 |
Domen Kožar | I'm happy to implement that if it unblocks you | 10:55:04 |
Mic92 | I don't think many people have infra for opentelemetry | 10:55:10 |
Domen Kožar | In reply to @zowoq:matrix.org but not if using async? not in async mode yeah | 10:55:19 |
Domen Kožar | In reply to @joerg:thalheim.io I don't think many people have infra for opentelemetry you can use prometheus exporter | 10:55:37 |
Domen Kožar | and I've merged the PR that has otl nixos service | 10:55:49 |
Domen Kožar | opened https://github.com/cachix/cachix/issues/562 | 10:56:33 |
Mic92 | Domen Kožar: https://github.com/influxdata/telegraf/tree/master/plugins/inputs/opentelemetry | 10:57:39 |
Mic92 | I guess opentelemetry also works here. | 10:57:47 |
Domen Kožar | most OSS supports it, that's why it's cool | 10:58:00 |
Domen Kožar | Mic92: so if we implement this it would unblock you to use it? | 10:58:20 |
Mic92 | zowoq: what do you think about that ^ | 10:58:30 |
Domen Kožar | (trying to prioritize workload) | 10:58:29 |