!zghijEASpYQWYFzriI:nixos.org

Hydra

356 Members
107 Servers

Load older messages


SenderMessageTime
28 Jun 2025
@hexa:lossy.networkhexaideally consecutiveFailures is 0 for all builders03:13:17
@elvishjerricco:matrix.orgElvishJerricco
   "machineTypes" : {
      "x86_64-linux:kvm,nixos-test" : {
         "runnable" : 5,
         "running" : 0
      },
      "x86_64-linux:local" : {
         "runnable" : 0,
         "running" : 0
      }
   },

"consecutiveFailures" : 1, for the builder, presumably because I canceled a build

03:17:54
@hexa:lossy.networkhexadoes the builder have the relevant system features?03:30:49
@elvishjerricco:matrix.orgElvishJerriccoYep03:30:55
@elvishjerricco:matrix.orgElvishJerricco I also see "disabledUntil" : 1751079864,, which corresponds with the log message hydra-queue-runner[345]: will disable machine ‘ssh://builder@pyromancer’ for 71s that came after I canceled that build. 03:31:24
@elvishjerricco:matrix.orgElvishJerriccoBut that timestamp has come and gone03:31:28
@elvishjerricco:matrix.orgElvishJerriccolong gone03:31:30
@elvishjerricco:matrix.orgElvishJerricco should it still have that disabledUntil field if the time has passed? 03:31:48
@hexa:lossy.networkhexaI don't think so03:32:06
@hexa:lossy.networkhexaor yes, it can03:32:22
@hexa:lossy.networkhexah.n.o has that as well03:32:27
@elvishjerricco:matrix.orgElvishJerricco My other builder has "disabledUntil" : 0, 03:32:40
@hexa:lossy.networkhexayeah, if it never failed03:32:46
@hexa:lossy.networkhexaI think consecutiveFailures gets reset when it continues working03:33:05
@hexa:lossy.networkhexabut disabledUntil and lastFailure are sticky until queue-runner restart03:33:24
@hexa:lossy.networkhexapossibly stale ssh session?03:33:51
@hexa:lossy.networkhexakill the local nix-daemon?03:34:04
@hexa:lossy.networkhexatest the ssh connection?03:34:08
@elvishjerricco:matrix.orgElvishJerriccowell ssh is working, it seems03:36:08
@elvishjerricco:matrix.orgElvishJerriccothe way this hydra is set up is a little stupid03:36:17
@elvishjerricco:matrix.orgElvishJerriccoit's running in a nixos container on the host that is the builder :P03:36:34
@elvishjerricco:matrix.orgElvishJerriccoI don't remember why I set it up this way; I assume the daemon-socket bind mounted from the host in the container would have let hydra use the local machine as a builder03:38:23
@elvishjerricco:matrix.orgElvishJerriccohuh, restarting the nix daemon on the host, there's three processes left in the systemd unit from the previous service instance03:40:06
@elvishjerricco:matrix.orgElvishJerriccoI think whatever's going on has something to do with failed builds. It seems like it's chewing through successful builds, but once one fails it stops scheduling builds and never starts again03:47:50
@elvishjerricco:matrix.orgElvishJerriccoI recently updated this system, which included this hydra update: https://github.com/NixOS/nixpkgs/commit/cd9bf3369b9fc4ea0a6a8d91902a41d520580cb9 Which begins with this commit: https://github.com/NixOS/hydra/commit/720db63d52ebcbda617603e7aa5b5c750cc6afec05:05:32
@elvishjerricco:matrix.orgElvishJerricco hmmm 05:05:40
@elvishjerricco:matrix.orgElvishJerriccomayhaps this "smarter scheduling criteria" is causing my problem?05:05:58
@elvishjerricco:matrix.orgElvishJerriccowell, still seems like I have the problem after patching in a revert for that05:18:25
@elvishjerricco:matrix.orgElvishJerricco ok no, reverting hydra's src to the version I was using before does not fix the problem 05:30:01
@elvishjerricco:matrix.orgElvishJerriccoI'm at a loss here. Anybody have any ideas I can try?05:32:52

Show newer messages


Back to Room ListRoom Version: 6