!sBfrWMVsLoSyFTCkNv:nixos.org

OfBorg

177 Members
Number of builds and evals in queue: <TBD>63 Servers

Load older messages


SenderMessageTime
29 Sep 2023
@asymmetric:matrix.dapp.org.uk@asymmetric:matrix.dapp.org.uk
In reply to @cole-h:matrix.org
My point with #2 is that (as far as I know), the coordinator doesn't know which machine took the job, so how should it determine which build machine to inform about the change? (I think the simplest depiction of how it all works, from my knowledge, is GitHub webhook -> ofborg-core (coordinator) -> AMQP server <-> ofborg-eval-X (evaluator and builder))
But the "comment in PR body" could work, if ofborg is notified about that information (it must be, in some roundabout way, because we have access to the PR number)......
ofborg-core being the php code running php? as that seems to be the thing that sits between hooks and rabbitmq
19:05:05
@asymmetric:matrix.dapp.org.uk@asymmetric:matrix.dapp.org.uk
In reply to @cole-h:matrix.org
My point with #2 is that (as far as I know), the coordinator doesn't know which machine took the job, so how should it determine which build machine to inform about the change? (I think the simplest depiction of how it all works, from my knowledge, is GitHub webhook -> ofborg-core (coordinator) -> AMQP server <-> ofborg-eval-X (evaluator and builder))
But the "comment in PR body" could work, if ofborg is notified about that information (it must be, in some roundabout way, because we have access to the PR number)......
* ofborg-core being the php code under ./php? as that seems to be the thing that sits between hooks and rabbitmq
19:05:17
@cole-h:matrix.orgcole-h Not solely that. The core machine also runs most of these binaries (that aren't build or eval related): https://github.com/NixOS/ofborg/tree/released/ofborg/src/bin 19:06:39
1 Oct 2023
@cafkafk:gitter.imcafkafk joined the room.14:39:06
@cafkafk:gitter.imcafkafkdisk full https://github.com/NixOS/nixpkgs/pull/258395/checks?check_run_id=1729277569214:39:14
@hexa:lossy.networkhexahttps://ofborg.org/prometheus/alerts hmm14:41:55
@hexa:lossy.networkhexadon't think we monitor the darwin machines in prometheus14:43:41
@lily:lily.flowersLily Foster
In reply to @hexa:lossy.network
don't think we monitor the darwin machines in prometheus
We do, but I bet they don't push that metric
14:45:15
@hexa:lossy.networkhexahttps://ofborg.org/prometheus/graph?g0.expr=node_os_info&g0.tab=1&g0.stacked=0&g0.show_exemplars=0&g0.range_input=1h14:45:36
@hexa:lossy.networkhexaI don't think we do πŸ˜„14:46:07
@lily:lily.flowersLily FosterYeah looks like they are only sending ofborg metrics14:46:19
@lily:lily.flowersLily FosterNo system metrics14:46:23
@hexa:lossy.networkhexaoh14:46:30
@hexa:lossy.networkhexawhere did you find a darwin metric?14:46:51
@lily:lily.flowersLily FosterActually the metrics are only sent from the central queue manager. So yeah they aren't sending anything14:47:18
@lily:lily.flowersLily Foster(for darwin build info)14:47:25
@lily:lily.flowersLily FosterYou're right14:47:29
@pederbs:pvv.ntnu.nopbsds joined the room.14:49:17
@cole-h:matrix.orgcole-h
In reply to @cafkafk:gitter.im
disk full https://github.com/NixOS/nixpkgs/pull/258395/checks?check_run_id=17292775692
Should be fixed now.
14:56:40
@lily:lily.flowersLily Foster Any chance we could run the prometheus agent on darwin cole-h? πŸ₯Ί 14:57:27
@cole-h:matrix.orgcole-hWe do, I guess they're just not scraped14:58:43
@cole-h:matrix.orgcole-h
curl -ss http://...:9100/metrics | head
# HELP go_gc_duration_seconds A summary of the pause duration of garbage collection cycles.
# TYPE go_gc_duration_seconds summary
go_gc_duration_seconds{quantile="0"} 3.9458e-05
go_gc_duration_seconds{quantile="0.25"} 5.4583e-05
go_gc_duration_seconds{quantile="0.5"} 6.6707e-05
go_gc_duration_seconds{quantile="0.75"} 9.1292e-05
go_gc_duration_seconds{quantile="1"} 0.000423
go_gc_duration_seconds_sum 251.971410887
go_gc_duration_seconds_count 115795
# HELP go_goroutines Number of goroutines that currently exist.
15:00:39
@cole-h:matrix.orgcole-hOK, they're scraped now.15:14:17
@hexa:lossy.networkhexathe node_filesystem_free metric that is used in the alerting rule does not exist 15:31:28
@hexa:lossy.networkhexaimage.png
Download image.png
15:31:52
@hexa:lossy.networkhexathis one15:31:53
@cole-h:matrix.orgcole-hπŸ”₯🐢πŸ”₯15:32:09
@hexa:lossy.networkhexashould probably use avail_bytes πŸ˜„ 15:32:17
@hexa:lossy.networkhexaor free_bytes15:32:34
@hexa:lossy.networkhexaheck, what is the difference?15:32:40

Show newer messages


Back to Room ListRoom Version: 6