!RROtHmAaQIkiJzJZZE:nixos.org

NixOS Infrastructure

387 Members
Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) | Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 | See #infra-alerts:nixos.org for real time alerts from Prometheus.120 Servers

Load older messages


SenderMessageTime
6 Oct 2021
@vcunat:matrix.orgVladimír Čunát I assume that you correctly see cache.nixos.org. CNAMEd to dualstack.v2.shared.global.fastly.net. and it's some local problem with Fastly. (On my end I'm getting different IPs and cert seems accepted.) 21:32:11
@vcunat:matrix.orgVladimír Čunát
$ openssl s_client -servername cache.nixos.org -connect 195.201.36.118:443
[...]
Certificate chain
 0 s:CN = nextcloud.isardvdi.com
   i:C = US, O = Let's Encrypt, CN = R3
 1 s:C = US, O = Let's Encrypt, CN = R3
   i:C = US, O = Internet Security Research Group, CN = ISRG Root X1
 2 s:C = US, O = Internet Security Research Group, CN = ISRG Root X1
   i:O = Digital Signature Trust Co., CN = DST Root CA X3
21:34:01
@vcunat:matrix.orgVladimír ČunátThat's not just expired but very wrong cert.21:34:11
@vidister:entropia.devidister / fionaYeah, I don't know why it resolved to that IP, I can't get it to resolve that again, but it's still in the cache on my computer. dafuq_21:36:21
@vidister:entropia.devidister / fiona * Yeah, I don't know why it resolved to that IP, I can't get it to resolve that again, but it's still in the cache on my computer. dafuq?21:36:25
@vcunat:matrix.orgVladimír ČunátThe IP belongs to Hetzner, according to whois.21:37:13
@sandro:supersandro.deSandroRestart systemd-resolved maybe?21:37:20
@vcunat:matrix.orgVladimír ČunátIt really sounds like some DNS problem near your end.21:38:13
@sandro:supersandro.deSandro
In reply to @vcunat:matrix.org
The IP belongs to Hetzner, according to whois.
The sources of this data are not always up to date and in the past I already got really bogus results. Like 3 results from 3 services about location, owner and so on
21:38:34
@vcunat:matrix.orgVladimír Čunát... though theoretically it is possible to be some Fastly DNS problem.21:38:40
@vidister:entropia.devidister / fionaI'm not running systemd-resolved and the machine is using 1.1.1.1/8.8.8.8. I don't know where that result came from21:39:11
@vidister:entropia.devidister / fionathis is so weird21:39:14
@vidister:entropia.devidister / fionawell, thanks for helping, I'm super confused and I'll just flush my dns cache and stop investigating here..21:40:44
@vcunat:matrix.orgVladimír ČunátPerhaps, if it seems like a one-time issue, but it's weird.21:42:27
7 Oct 2021
@zimbatm:numtide.comJonas Chevalierthere is always the cosmic ray that can flip a bit :)08:12:02
@zimbatm:numtide.comJonas ChevalierI'm investigating channel updates. nixos-unstable stopped pushing.08:12:52
@zimbatm:numtide.comJonas Chevalier * I'm investigating channel updates. nixos-unstable stopped pushing. (investigation: https://github.com/NixOS/nixos-org-configurations/issues/180)08:17:03
@zimbatm:numtide.comJonas Chevalierdid anybody cancel builds recently? It looks like it was just caused by a bunch of cancelled builds.08:44:20
@vcunat:matrix.orgVladimír ČunátThis case was probably me.08:53:57
@vcunat:matrix.orgVladimír Čunát If some jobs are required for channel updates to succeed, shouldn't we make the critical job (tested here) depend on all those jobs? 08:54:39
@vcunat:matrix.orgVladimír Čunát * This case was probably me. 08:55:23
@vcunat:matrix.orgVladimír Čunát Well, I wonder where to reply... seeing the question on one issue and two chat channels. 08:56:36
@vcunat:matrix.orgVladimír Čunát ☝️ Jonas Chevalier 08:57:23
@zimbatm:numtide.comJonas Chevalierthat's fine, I'm just trying to better understand what happened08:58:38
@zimbatm:numtide.comJonas ChevalierI'm not sure super familiar with Hydra, and the channel-update logic08:59:37
@zimbatm:numtide.comJonas Chevalier Vladimír Čunát: what are typical conditions where builds get canceled? 09:00:31
@vcunat:matrix.orgVladimír Čunát I do it in a few cases 09:01:28
@vcunat:matrix.orgVladimír Čunát
  1. replaced builds: if we have newer builds of the same jobs (scheduled) and keeping the older ones doesn't seem very useful anymore. (makes sense when they're many, especially if combined with high load on Hydra)
09:02:52
@vcunat:matrix.orgVladimír Čunát
  1. channel waiting on bad test. Sometimes the channel would update, but it waits to finish all builds. And sometimes there's a few tests that never succeeded recently and just wait for a long time-out. Cancelling those stragglers can speed up channel update. (makes sense if the channel is quite old currently, e.g. after some period of channel blockers)
09:04:34
@zimbatm:numtide.comJonas ChevalierIs anybody else keeping a pulse on Hydra like you do?09:11:43

Show newer messages


Back to Room ListRoom Version: 6