| 7 Oct 2021 |
Vladimír Čunát | If some jobs are required for channel updates to succeed, shouldn't we make the critical job (tested here) depend on all those jobs? | 08:54:39 |
Vladimír Čunát | * This case was probably me. | 08:55:23 |
Vladimír Čunát | Well, I wonder where to reply... seeing the question on one issue and two chat channels. | 08:56:36 |
Vladimír Čunát | ☝️ Jonas Chevalier | 08:57:23 |
Jonas Chevalier | that's fine, I'm just trying to better understand what happened | 08:58:38 |
Jonas Chevalier | I'm not sure super familiar with Hydra, and the channel-update logic | 08:59:37 |
Jonas Chevalier | Vladimír Čunát: what are typical conditions where builds get canceled? | 09:00:31 |
Vladimír Čunát | I do it in a few cases | 09:01:28 |
Vladimír Čunát |
- replaced builds: if we have newer builds of the same jobs (scheduled) and keeping the older ones doesn't seem very useful anymore. (makes sense when they're many, especially if combined with high load on Hydra)
| 09:02:52 |
Vladimír Čunát |
- channel waiting on bad test. Sometimes the channel would update, but it waits to finish all builds. And sometimes there's a few tests that never succeeded recently and just wait for a long time-out. Cancelling those stragglers can speed up channel update. (makes sense if the channel is quite old currently, e.g. after some period of channel blockers)
| 09:04:34 |
Jonas Chevalier | Is anybody else keeping a pulse on Hydra like you do? | 09:11:43 |
Vladimír Čunát | I'm not sure. I do watch https://status.nixos.org/ (or a script of mine with differently defined timestamps, based on committer time) If some channel is getting over three days, I'm looking at what's wrong; typically it's something easy to solve or speed up. | 09:14:00 |
Vladimír Čunát | I mean, there certainly are other cases of people "unblocking channels", based on what happens, but I have no real insight into that. | 09:22:08 |
Jonas Chevalier | thanks | 09:22:58 |
Jonas Chevalier | it would be useful if those user interventions were logged somehow, so that we could know who is doing what | 09:24:29 |
Vladimír Čunát | It's not the first time there was this idea: https://github.com/NixOS/hydra/issues/786 | 09:27:23 |
| 8 Oct 2021 |
Jonas Chevalier | I added some templates to the repo and looking for feedback: https://github.com/NixOS/nixos-org-configurations/issues/new/choose
The goal is to make it easier for users to report issues with the infrastructure, and in the future request access or new resources to be deployed. | 07:33:59 |
| 10 Oct 2021 |
hexa | There seems to be a caching issue with channels.nixos.org https://github.com/NixOS/nixos-org-configurations/issues/169#issuecomment-939478669 | 14:17:48 |
sterni | this has been like this for over a week (two?) | 14:18:58 |
sterni | seems like forever | 14:19:00 |
sterni | also: has like any darwin build happened in the last week? | 14:48:22 |
K900 | It feels like some of the CDN nodes are being weirdn | 15:03:57 |
K900 | * It feels like some of the CDN nodes are being weird | 15:04:00 |
K900 | I've been hitting really old builds for a while but it seems OK now | 15:04:10 |
| 11 Oct 2021 |
| chvp joined the room. | 06:45:43 |
| Ryan Burns joined the room. | 07:58:59 |
Ryan Burns | In reply to @sternenseemann:systemli.org also: has like any darwin build happened in the last week? nope. correct me if I'm wrong but based on https://hydra.nixos.org/status every darwin builder has been stuck on the same job for going on 7 days now | 08:01:43 |
Ryan Burns | In reply to @sternenseemann:systemli.org also: has like any darwin build happened in the last week? * nope. correct me if I'm wrong but based on https://hydra.nixos.org/status every x86_64-darwin builder has been stuck on the same job for going on 7 days now | 08:02:08 |
toonn | Hmm, looks like there's 7 idle x86_64-darwin builders rn even though the queue is massive ~100k. Why might this happen? | 11:54:00 |
lukegb (he/him) | Possibly some stdenv jobs are stuck on a broken worker and blocking everything else? | 11:54:44 |