!RROtHmAaQIkiJzJZZE:nixos.org

NixOS Infrastructure

422 Members
Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) | Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 | See #infra-alerts:nixos.org for real time alerts from Prometheus.132 Servers

Load older messages


SenderMessageTime
9 May 2026
@emilazy:matrix.orgemilyif there are decently logs for the cache uploads that could be accessed that would likely help narrow things down a lot17:21:22
@hexa:lossy.networkhexaraces are absolutely a possiblity17:21:47
@hexa:lossy.networkhexanote that I tried out the new queue-runner at least three times in the last two weeks17:22:00
@emilazy:matrix.orgemilythese issues have been present for years17:22:11
@hexa:lossy.networkhexagood17:22:14
@emilazy:matrix.orgemilybut getting worse in the past, say, couple months?17:22:18
@emilazy:matrix.orgemilymuch worse17:22:31
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)Hm is S3 still very bad on the on queue runner?17:23:09
@emilazy:matrix.orgemily

I think it would require

  1. builder A starts building fish
  2. builder A finishes building fish
  3. builder A uploads build log and fish^doc
  4. builder B substitutes fish^doc
  5. builder B starts building fish
  6. builder B uploads fish^out
  7. builder A doesn't get to upload fish^out
17:23:56
@emilazy:matrix.orgemily seems pretty contrived to me, the entire second build has to happen between builder A starting to upload and finishing (and what would be pulling in fish^doc to begin with?) 17:24:17
@hexa:lossy.networkhexanobody changed anything about s3, hydra repo has migrated to the new queue-runner i march17:24:41
@emilazy:matrix.orgemily seems more likely for the upload of fish^out to just hard fail the first time and then a second builder later picks it up and mangles it 17:24:45
@hexa:lossy.networkhexaRedacted or Malformed Event17:24:46
@hexa:lossy.networkhexathe old queue-runner is gone from the repo17:24:52
@hexa:lossy.networkhexayeah, same reaction17:25:07
@hexa:lossy.networkhexaI suddenly had to pin hydra without any prior communication17:25:18
@emilazy:matrix.orgemilydo logs exist for an attempt to push out a given store path and whether it succeeded?17:26:11
@emilazy:matrix.orgemily or for multiple builds of a derivation that happen? like can we get data about whether fish's outputs/logs on the cache for that one derivation are actually chimerical between two separate builds? 17:26:40
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)Hm maybe last-modified could be a rough approximation?17:27:33
@emilazy:matrix.orgemilywe run into this issue ~every staging cycle now it feels like from what I've seen, and it's gone from "once or twice a year" to "constantly hitting users" it seems17:27:36
@emilazy:matrix.orgemilyit also holds up cycles because it breaks downstream builds etc.17:28:03
@hexa:lossy.networkhexayeah17:34:45
@hexa:lossy.networkhexafrom may 7th 21:56 right now17:35:04
@hexa:lossy.networkhexaso less than 2 days17:35:15
@hexa:lossy.networkhexahttps://termbin.com/69iy17:36:19
@hexa:lossy.networkhexaall fish related things17:36:22
@emilazy:matrix.orgemilyyeah, not quite the retention we'd need to catch this :(17:38:25
@emilazy:matrix.orgemily we could check it next time we observe this cropping up in a staging-next cycle 17:38:35
@emilazy:matrix.orgemilydoes S3 retain something like ^? like the date a bucket entry was added/modified?17:38:49
@emilazy:matrix.orgemily
May 08 04:16:49 mimas hydra-queue-runner[1392977]: warning: unable to upload 'https://nix-cache.s3.us-east-1.amazonaws.com/log/xsvcvrzr8v1p7jpldddr8wkmaz84knpi-config.fish.drv': Timeout was reached (28) Connection timed out after 17368 milliseconds; retrying in 275 ms (attempt 1/5)
17:39:21

Show newer messages


Back to Room ListRoom Version: 6