!RROtHmAaQIkiJzJZZE:nixos.org

NixOS Infrastructure

388 Members
Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) | Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 | See #infra-alerts:nixos.org for real time alerts from Prometheus.121 Servers

Load older messages


SenderMessageTime
3 Sep 2021
@vcunat:matrix.orgVladimír Čunát * It seems quite a waste, and apparently I have to restart the stuff manually as well.16:59:35
@grahamc:nixos.org@grahamc:nixos.orghttps://monitoring.nixos.org/prometheus/graph?g0.expr=up%7Binstance%3D%22546ef6b6.packethost.net%22%7D&g0.tab=0&g0.stacked=0&g0.range_input=1w17:00:40
@grahamc:nixos.org@grahamc:nixos.org taking a look at this one machine's up status over the past week indicates that indeed it booted, lived for about 25 minutes, and then terminated 17:01:13
@grahamc:nixos.org@grahamc:nixos.orgit looks like their spot market is churning indeed17:03:39
@vcunat:matrix.orgVladimír ČunátMaybe I'd prefer scheduling big-parallel builds to machines that seem likely to live longer. Assuming this is a common situation and we can tell in advance if it might happen.17:03:54
@grahamc:nixos.org@grahamc:nixos.orgthis isn't a common situation really17:04:11
@grahamc:nixos.org@grahamc:nixos.orgthe machines we get are usually very stable and long lived17:04:18
@vcunat:matrix.orgVladimír ČunátThen it's OK, I think. Jobs need some babysitting anyway.17:04:49
@grahamc:nixos.org@grahamc:nixos.orgsometimes their spot market auction process gets in to a bad state and churns and churns17:05:02
@vcunat:matrix.orgVladimír ČunátBlack Friday today?17:07:54
@baloo_:matrix.orgbaloono it's in November17:08:47
@grahamc:nixos.org@grahamc:nixos.orghttps://monitoring.nixos.org/grafana/d/wiaOmQ4nk/equinix-metal-churn?orgId=1&refresh=10s&var-instance_class=m1.xlarge.x86&var-facility=All17:20:39
@grahamc:nixos.org@grahamc:nixos.orgyou can see the churn very clearly here17:20:56
@grahamc:nixos.org@grahamc:nixos.organd one of the causes of the spot price in the top graph17:21:05
@grahamc:nixos.org@grahamc:nixos.orgams1 isn't super happy either17:23:27
@grahamc:nixos.org@grahamc:nixos.orgI'll take both out17:23:30
@grahamc:nixos.org@grahamc:nixos.orgwell now I've really stepped in it17:29:44
@grahamc:nixos.org@grahamc:nixos.orgrecreating the spot market request deleted all the existing requests (I expected this) then a bug in either the terraform provider or their API made the "create" step fail17:42:37
@grahamc:nixos.org@grahamc:nixos.orgworking with their support team18:05:02
@grahamc:nixos.org@grahamc:nixos.orglooks like we're getting hardware upgrades18:47:25
@grahamc:nixos.org@grahamc:nixos.orgI did not expect to spend half the work day on this 🙃19:05:53
4 Sep 2021
@grahamc:nixos.org@grahamc:nixos.orgokay, I finished the work to get these new classes of hardware in hydra03:01:31
@grahamc:nixos.org@grahamc:nixos.orgwent from machines with: 2x 2 x Intel Xeon E5-2650 v4 (ie: 24 cores, 2.2ghz) and 256G ram to: 1x 1 x AMD EPYC 7402P (ie: 24 cores, 2.8ghz) and 64G RAM and 1x 1 x AMD EPYC 7402P (ie: 24 cores, 2.8ghz) and 256G RAM 03:03:12
@grahamc:nixos.org@grahamc:nixos.orgthat second machine has 2x 25Gbps network connections :drool: 03:03:41
@grahamc:nixos.org@grahamc:nixos.org * went from machines with: 2x 2 x Intel Xeon E5-2650 v4 (ie: 24 cores, 2.2ghz) and 256G ram to: 1x 1 x AMD EPYC 7402P (ie: 24 cores, 2.8ghz) and 64G RAM and 1x 1 x AMD EPYC 7502P (ie: 32 cores @ 2.5Ghz) and 256G RAM 03:05:43
@lukegb:zxcvbnm.ninjalukegb (he/him)Ooh03:20:14
@lukegb:zxcvbnm.ninjalukegb (he/him) Thanks so much for your work grahamc (he/him) ❤️ 03:20:34
@grahamc:nixos.org@grahamc:nixos.org:) 03:22:29
@grahamc:nixos.org@grahamc:nixos.orgit is possible I broke the aarch64 machines while updating the channel03:43:04
@grahamc:nixos.org@grahamc:nixos.orgI'm not too stressed about that, but I'm also not likely to fix that tonight03:43:18

Show newer messages


Back to Room ListRoom Version: 6