!RROtHmAaQIkiJzJZZE:nixos.org

NixOS Infrastructure

417 Members
Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) | Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 | See #infra-alerts:nixos.org for real time alerts from Prometheus.130 Servers

Load older messages


SenderMessageTime
9 May 2026
@emilazy:matrix.orgemilyI guess the queue runner would observe the output is missing and schedule another build?19:25:42
@emilazy:matrix.orgemilyright. so that could be fixed at the queue runner level? "if we have an output waiting to be uploaded, then don't spawn another build; just keep trying to upload that output"?19:26:14
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)
In reply to @emilazy:matrix.org
right. so that could be fixed at the queue runner level? "if we have an output waiting to be uploaded, then don't spawn another build; just keep trying to upload that output"?
Makes sense yeah
19:26:57
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)Things probably get complicated when the builder dies halfway?19:27:29
@emilazy:matrix.orgemilyas in half-way through sending its successfully-built outputs to the queue runner?19:27:57
@k900:0upti.meK900Then it should probably discard the entire build19:28:13
@emilazy:matrix.orgemilyyeah, though I think the issue is potentially that stuff happens per-output?19:28:33
@emilazy:matrix.orgemily"waiting for all outputs to be ready for upload before uploading any of them" would be good19:28:54
@k900:0upti.meK900I wonder if there's any reasonable way to do two phase commit on this19:29:59
@k900:0upti.meK900Like upload to a temporary directory and then move atomically19:30:07
@k900:0upti.meK900If S3 lets you do that19:30:15
@emilazy:matrix.orgemilyyeah, if we could have S3 expose all the narinfos atomically that would be great19:30:20
@emilazy:matrix.orgemilyI mean even just uploading all the actual outputs first and then uploading the narinfos would probably help19:30:28
@emilazy:matrix.orgemilyI don't really understand why Hydra would decide to build something it's trying to upload anyway though19:32:03
@emilazy:matrix.orgemilydoes it ever give up on retrying to upload?19:32:10
@emilazy:matrix.orgemilylike, anything that wants something in the process of uploading should block on that upload anyway, right?19:32:52
@emilazy:matrix.orgemily so perhaps the solution could be as simple as just "never stop retrying uploads"? it wouldn't handle the "builder disappears" case Sergei Zimmerman (xokdvium) mentioned but that's at least an edge case 19:33:24
@emilazy:matrix.orgemily hexa (signing key rotation when): can we get queue runner logs for musikcube 19:46:16
@emilazy:matrix.orgemilyhttps://github.com/NixOS/nixpkgs/issues/517508 is recent19:46:27
@emilazy:matrix.orgemilythough, let me check it's not a library dependency actually19:46:39
@hexa:lossy.networkhexano entries19:47:42
@hexa:lossy.networkhexa5 tries iirc19:48:24
@emilazy:matrix.orgemilyseems like we should just make that infinite?19:50:46
@emilazy:matrix.orgemilyif you give up on uploading and rebuild instead, you then still have to upload the output, so no benefit19:50:58
@emilazy:matrix.orgemily ok, it's actually ffmpeg-headless on unstable, last rebuilt 2026-04-23 19:53:53
@emilazy:matrix.orgemilyare these logs from the old or new queue runner? having trouble chasing code paths20:00:54
@hexa:lossy.networkhexaold20:01:29
@hexa:lossy.networkhexathe new runner isn't live20:01:36
@emilazy:matrix.orgemily ok, so it looks like Nix will retry up to download-attempts (even for uploads) times, unless it gets status 400–500 other than 408, 501, 505, or 511 20:18:09
@hexa:lossy.networkhexaand I assume you only got that from code and not docs20:18:42

Show newer messages


Back to Room ListRoom Version: 6