NixOS Infrastructure - Public Room Timeline

	NixOS Infrastructure	379 Members
	Next Infra call: 2024-07-11, 18:00 CEST (UTC+2) \| Infra operational issues backlog: https://github.com/orgs/NixOS/projects/52 \| See #infra-alerts:nixos.org for real time alerts from Prometheus.	117 Servers

Load older messages

Sender	Message	Time
21 Sep 2025
hexa	anyway, these are my thoughts. enjoy	21:30:58
dgrig	ECC would be good to have, but I wonder if those intel 13900s are affected by this or did Hetzner RMA'd them/patched them all fast enough	21:35:57
hexa	supposedly fixed by firmware updates a while ago	21:37:17
	Nico joined the room.	21:42:45
Tom	I think you'll end up paying and receiving (if you don't tell them to remove those) the default two 1tb drives anyway. So if you add 2x 1tb you're getting 1 TB, 1 TB, 2 TB and 2 TB. Just mentioning it in case that wasn't considered and somehow affects the plans.	21:45:14
hexa	yeah, I'm aware.	21:45:53
hexa	ECC would be good to have ECC has been doing its thing on haumea for a few months, so we're not going to give that up	22:33:24
hexa	the primary reason we're making a move	22:33:47
hexa	`MCE records summary: 9168 Corrected error, no action required. errors`	22:34:00
hexa	* `Memory controller events summary: Corrected on DIMM Label(s): 'mc#0csrow#1channel#0' location: 0:1:0:-1 errors: 4973 MCE records summary: 9168 Corrected error, no action required. errors`	22:34:20
22 Sep 2025
	@felix.schroeter:scs.ems.host changed their display name from Felix Schröter to Felix Schröter (🌄 29.09. – 05.10.).	09:55:59
Vladimír Čunát	Some Hydra metrics are gone now: https://grafana.nixos.org/d/MJw9PcAiz/hydra-jobs	16:09:52
Vladimír Čunát	I think I saw them working less than 24h ago.	16:11:08
ghpzin	`instance` label changed, so dashboard shows nothingjm	16:18:44
ghpzin	`instance` label changed, so dashboard shows nothing, metrics seem to be still there	16:19:05
ghpzin	`instance` label changed, so dashboard shows nothing: https://grafana.nixos.org/goto/DdQG9uCNg?orgId=1	16:25:43
ghpzin	`instance` label changed from `monitoring.nixos.org:9200` to `localhost:9200`, so dashboard shows nothing: https://grafana.nixos.org/goto/DdQG9uCNg?orgId=1 https://github.com/NixOS/infra/commit/0ebab0affe655b6a278843124a10637c50e6e78d#diff-7dbeda5809ed539867d6bc0ec13abd3ecfd987097db6c868533588c2e36b3d82L54	16:48:41
hexa	yeah, was fairly pointless to expose the metric externally	17:08:37
hexa	unclear why grafana shows no result, but with prometheus I see the metric	17:12:13
hexa	https://prometheus.nixos.org/query?g0.expr=hydra_steps_queued&g0.show_tree=0&g0.tab=table&g0.range_input=1h&g0.res_type=auto&g0.res_density=medium&g0.display_mode=lines&g0.show_exemplars=0	17:12:20
ghpzin	Metrics are still there, dashboard hardcodes `instance="monitoring.nixos.org:9200"`	17:15:55
Vladimír Čunát	I think I fixed the graphs now.	17:16:37
Vladimír Čunát	Just edited the dashboard.	17:17:02
sterni (he/him)	Do any (Darwin) builders use Nix 2.30 already? We've noticed the interesting problem that the new build-dir location causes longer paths which can relatively easily cause failures becasue unix socket paths are quite restricted on macOS apparently. (Observed with `haskellPackages.network.)	19:50:07
emily	IIRC this was mitigated in the latest patch release	19:50:54
emily	with possible further changes to come	19:51:07
emily	(the limits are similarly strict on Linux btw, just the sandbox helps there)	19:51:21
hexa	no, they track nixos-stable	19:51:33
hexa	* no, they track nixpkgs-stable	19:51:36
sterni (he/him)	https://github.com/NixOS/nix/pull/13839?	19:51:40

Show newer messages

Back to Room ListRoom Version: 6