!zghijEASpYQWYFzriI:nixos.org

Hydra

363 Members
105 Servers

Load older messages


SenderMessageTime
28 Jun 2025
@elvishjerricco:matrix.orgElvishJerricco I've been running a personal hydra for a couple years now, and a couple days it basically just stopped doing builds. I can get it to do a couple more every time I restart it, but it doesn't continue through the queue. Meanwhile in the journal I see this every 10 seconds: hydra-queue-runner[345]: checking the queue for builds... 02:41:59
@aftix:matrix.orgaftixI had something similar which happened since the disk was full03:01:48
@hexa:lossy.networkhexaoh yeah, watermark level 🙂 03:04:21
@hexa:lossy.networkhexaminimumDiskFree[Evaluator]03:04:47
@elvishjerricco:matrix.orgElvishJerriccothat has never been a problem before, and my disk is not as full as it has been in the past03:05:34
@elvishjerricco:matrix.orgElvishJerriccohow would I check if that's it?03:05:43
@hexa:lossy.networkhexadid you configure a limit?03:09:11
@elvishjerricco:matrix.orgElvishJerricconope03:09:20
@hexa:lossy.networkhexathen the feature is probably not enabled03:09:40
@hexa:lossy.networkhexa you can inspect the internal state of the queue-runner by appending /queue-runner-status to the baseur 03:11:59
@hexa:lossy.networkhexa * you can inspect the internal state of the queue-runner by appending /queue-runner-status to the base url 03:12:03
@hexa:lossy.networkhexarelevant is whether there are runnables, because only those jobs are ready to be scheduled03:12:59
@hexa:lossy.networkhexaideally consecutiveFailures is 0 for all builders03:13:17
@elvishjerricco:matrix.orgElvishJerricco
   "machineTypes" : {
      "x86_64-linux:kvm,nixos-test" : {
         "runnable" : 5,
         "running" : 0
      },
      "x86_64-linux:local" : {
         "runnable" : 0,
         "running" : 0
      }
   },

"consecutiveFailures" : 1, for the builder, presumably because I canceled a build

03:17:54
@hexa:lossy.networkhexadoes the builder have the relevant system features?03:30:49
@elvishjerricco:matrix.orgElvishJerriccoYep03:30:55
@elvishjerricco:matrix.orgElvishJerricco I also see "disabledUntil" : 1751079864,, which corresponds with the log message hydra-queue-runner[345]: will disable machine ‘ssh://builder@pyromancer’ for 71s that came after I canceled that build. 03:31:24
@elvishjerricco:matrix.orgElvishJerriccoBut that timestamp has come and gone03:31:28
@elvishjerricco:matrix.orgElvishJerriccolong gone03:31:30
@elvishjerricco:matrix.orgElvishJerricco should it still have that disabledUntil field if the time has passed? 03:31:48
@hexa:lossy.networkhexaI don't think so03:32:06
@hexa:lossy.networkhexaor yes, it can03:32:22
@hexa:lossy.networkhexah.n.o has that as well03:32:27
@elvishjerricco:matrix.orgElvishJerricco My other builder has "disabledUntil" : 0, 03:32:40
@hexa:lossy.networkhexayeah, if it never failed03:32:46
@hexa:lossy.networkhexaI think consecutiveFailures gets reset when it continues working03:33:05
@hexa:lossy.networkhexabut disabledUntil and lastFailure are sticky until queue-runner restart03:33:24
@hexa:lossy.networkhexapossibly stale ssh session?03:33:51
@hexa:lossy.networkhexakill the local nix-daemon?03:34:04
@hexa:lossy.networkhexatest the ssh connection?03:34:08

Show newer messages


Back to Room ListRoom Version: 6