!oNSIfazDqEcwhcOjSL:matrix.org

disko

372 Members
disko - declarative disk partitioning - https://github.com/nix-community/disko97 Servers

Load older messages


SenderMessageTime
30 May 2024
@lassulus:lassul.uslassulusbut nixos-anywhere usually takes care of that by running a kexec11:20:45
@waltmck:matrix.orgwaltmckwhen you initialize a new partition, you shouldn't be assuming that the disk is zeroed out11:20:47
@waltmck:matrix.orgwaltmckRight, disko runs after a kexec11:21:07
@lassulus:lassul.uslassulusah, we run wipefs before running disko11:22:06
@lassulus:lassul.uslassulushttps://github.com/nix-community/disko/blob/master/disk-deactivate/disk-deactivate.jq#L3311:22:06
@waltmck:matrix.orgwaltmckI've tried this many times across reboots and it is totally reproducible. I haven't tried manually zeroing out the disks though just because I assumed that semantically the curring contents of a disk shouldn't matter when reformatting11:22:13
@waltmck:matrix.orgwaltmck
In reply to @waltmck:matrix.org
I've tried this many times across reboots and it is totally reproducible. I haven't tried manually zeroing out the disks though just because I assumed that semantically the curring contents of a disk shouldn't matter when reformatting
current
11:22:43
@lassulus:lassul.uslassuluswell they shouldn't, but sometimes, if there is already an mdadm it can get activated at random times and that can interfere for example with other stuff11:23:28
@waltmck:matrix.orgwaltmckInteresting, I'm not really sure how that works. How does the mdadm get activated? Is the problem that writes to the virtual device might be concurrent with writes to the underlying devices?11:24:34
@lassulus:lassul.uslassulusnot sure when that exactly happens, otherwise I would have reproduced and fixed it :) but I have seen raid devices being activated later, some minutes after booting11:26:34
@lassulus:lassul.uslassulusif this happens after we run the disk-deactivate script, things get wonky11:27:05
@lassulus:lassul.uslassulusbut not sure that's even the issue here, I guess you can check if there are any lingering raid devices active11:28:02
@waltmck:matrix.orgwaltmckI could give you access to the server if it's helpful for you to debug this issue with disko11:28:08
@waltmck:matrix.orgwaltmck(independently of my issue, if you are having trouble reproducing)11:28:23
@waltmck:matrix.orgwaltmckthere's nothing on it, I could just wipe it after you're done11:29:51
@lassulus:lassul.uslassulushmm, not sure I have the time to debug that further :) also not sure if the issue would happen if disko is run again?11:31:07
@waltmck:matrix.orgwaltmckI've restarted the server a few times and the issue persists11:31:45
@waltmck:matrix.orgwaltmckI'll restart a few more times and hopefully that will fix the issue. If not, I'll let you know11:33:01
@lassulus:lassul.uslassulushmm, it also fails after the partprobe11:37:56
@lassulus:lassul.uslassulusand the zap should after the partprobe11:38:02
@lassulus:lassul.uslassulusthis could be another issue11:38:08
@waltmck:matrix.orgwaltmckyep, issue just persisted under another full reboot. Reboots wipe all of the rescue system state so the problem is either in my config file or in the disk state11:38:14
@lassulus:lassul.uslassulushttps://github.com/nix-community/disko/pull/65411:40:41
@lassulus:lassul.uslassulusah forgot to rebase11:41:23
@waltmck:matrix.orgwaltmck ahh, I will try running sgdisk --zap-all 11:41:25
@waltmck:matrix.orgwaltmckI think it worked11:43:10
@waltmck:matrix.orgwaltmck

the output for sgdisk --zap-all /dev/md/raid1 was

Warning! Disk size is smaller than the main header indicates! Loading
secondary header from the last sector of the disk! You should use 'v' to
verify disk integrity, and perhaps options on the experts' menu to repair
the disk.
Warning! One or more CRCs don't match. You should repair the disk!
Main header: OK
Backup header: OK
Main partition table: OK
Backup partition table: ERROR

****************************************************************************
Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT, but disk
verification and recovery are STRONGLY recommended.
****************************************************************************
GPT data structures destroyed! You may now partition the disk using fdisk or
other utilities.
11:43:32
@waltmck:matrix.orgwaltmckgreat that fixed everything11:46:36
@lillecarl:matrix.orglillecarl lassulus: Out of curiosity, would changing the size of partitions resolve this? Since /dev/md/raid1 header would be at a different location.... right? 12:37:49
@lassulus:lassul.uslassulusnot sure, probably12:38:46

Show newer messages


Back to Room ListRoom Version: 10