!RROtHmAaQIkiJzJZZE:nixos.org

NixOS Infrastructure

394 Members
Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) | Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 | See #infra-alerts:nixos.org for real time alerts from Prometheus.119 Servers

Load older messages


SenderMessageTime
19 Mar 2026
@joerg:thalheim.ioMic92aws, doesn't seem to use this interface12:28:30
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)Well it effectively does the same thing. What aws probably does is retry on that 400 error12:29:30
@arianvp:matrix.orgArian
In reply to @joerg:thalheim.io
but aws does it's own pooling
Only if you use the transfer API iirc which we didn't
12:32:54
@arianvp:matrix.orgArianFor S3 you also need to retry on 503 for uploads https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance-design-patterns.html12:34:08
@arianvp:matrix.orgArianThey use 503 for rate limit :')12:34:30
@joerg:thalheim.ioMic92https://github.com/aws/aws-sdk-cpp/blob/9204e236faaa1ca6a0342dee7caf61c7cf5ad8bb/src/aws-cpp-sdk-core/source/client/CoreErrors.cpp#L90-L9112:35:08
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)Yup bernardo added that in https://github.com/NixOS/nix/pull/1544912:35:11
@arianvp:matrix.orgArianSweet12:35:38
@joerg:thalheim.ioMic92where dos this handle 400?12:37:33
@joerg:thalheim.ioMic92* where does this handle 400?12:37:39
@xokdvium:matrix.orgSergei Zimmerman (xokdvium)I was talking about 503, 400 is not handled there. We'd need to look at the response xml12:38:17
@arianvp:matrix.orgArianLine 11312:38:56
@arianvp:matrix.orgArianDecides if an error is retry able 12:39:11
@joerg:thalheim.ioMic92True, but I think Sergei is right, we need to look at the xml12:41:00
@joerg:thalheim.ioMic92deploying this now12:44:59
@joerg:thalheim.ioMic92how did you came up with this number, so I can monitor?12:49:53
@hexa:lossy.networkhexajournalctl -u hydra-queue-runner --since "2 days ago" --grep="unable to upload" | wc -l12:50:38
@joerg:thalheim.ioMic92okay, we don't have any upload error since: Thu Mar 19 01:01:45 PM UTC 202613:02:00
@joerg:thalheim.ioMic92Let's see13:02:05
@joerg:thalheim.ioMic92journalctl -u hydra-queue-runner --since "3 hour ago" --grep="unable to upload" is also empty13:02:13
@joerg:thalheim.ioMic92while true; do journalctl -u hydra-queue-runner --since "2026-03-19 13:01:45" --grep="unable to upload"; sleep 180; done13:06:19
@joerg:thalheim.ioMic92 * while true; do journalctl -u hydra-queue-runner --since "2026-03-19 13:01:45" --grep="unable to upload"; sleep 180; done for monitoring 13:06:29
@joerg:thalheim.ioMic92so far only getting a bunch of 500er: Mar 19 13:11:55 mimas hydra-queue-runner[2800047]: warning: unable to upload 'https://nix-cache.s3.us-east-1.amazonaws.com/nar/0a8jn6fw74pdnpa10viam4wb4jmpszkfw19zrrm1h1n40zlpnlcx.nar.xz': HTTP error 503 Mar 19 13:11:55 mimas hydra-queue-runner[2800047]: warning: unable to upload 'https://nix-cache.s3.us-east-1.amazonaws.com/nar/0a8jn6fw74pdnpa10viam4wb4jmpszkfw19zrrm1h1n40zlpnlcx.nar.xz': HTTP error 503 13:16:20
@joerg:thalheim.ioMic92 * so far only getting a bunch of one 500er: Mar 19 13:11:55 mimas hydra-queue-runner[2800047]: warning: unable to upload 'https://nix-cache.s3.us-east-1.amazonaws.com/nar/0a8jn6fw74pdnpa10viam4wb4jmpszkfw19zrrm1h1n40zlpnlcx.nar.xz': HTTP error 503 13:16:40
@joerg:thalheim.ioMic92Still only one error.15:24:49
@joerg:thalheim.ioMic92Backport of the fix: https://github.com/NixOS/nix/pull/1552515:58:59
@joerg:thalheim.ioMic92 hexa (signing key rotation when): vcunat Jeremy Fleischman (jfly) dgrig https://meet.cccda.de/nix-osin-fra starts in 5min according to my calendar Arian already implied he can't join today. 16:56:00
@hexa:lossy.networkhexahuh, arian implied he could demo the mdm stuff earlier today16:57:43
@arianvp:matrix.orgArianJust wrote in the infra team chat. I have a monthly team meeting that overlaps :( 16:59:08
@hexa:lossy.networkhexaok16:59:17

Show newer messages


Back to Room ListRoom Version: 6