OMG: whitby is being rebooted

#210
Opened by tazjin at 2022-10-12T07·10+00

This is a tracking issue for scheduled maintenance of whitby. It has been up for quite a long time:

tazjin@whitby ~> uptime
 07:05:30  up 831 days 12:59,  1 user,  load average: 0.17, 0.31, 0.32

Rebooting whitby is tricky because the intended process includes an initrd SSH server in which we enter the disk encryption password, but we basically have never actually done this.

Before actually rebooting, the following checklist will be run:

We will reboot with the KVM console attached and monitor the reboot.

After rebooting, we will run this checklist:


  1. During whitby's uptime, nixpkgs has had numerous bugs that broke the writing of bootloader entries.

  1. tazjin updated the body of this issue at 2022-10-12T07·14+00
  2. tazjin updated the body of this issue at 2022-10-12T07·15+00
  3. tazjin updated the body of this issue at 2022-10-12T07·27+00
  4. tazjin updated the body of this issue at 2022-10-12T07·33+00
  5. tazjin updated the body of this issue at 2022-10-12T07·37+00
  6. Current whitby system generation is 393 after canonicalising at latest HEAD. This matches the latest entry in grub.cfg, making me think the bootloader is up-to-date.

    tazjin at 2022-10-12T07·39+00

  7. tazjin updated the body of this issue at 2022-10-12T07·40+00
  8. tazjin updated the body of this issue at 2022-10-12T07·41+00
  9. First problem: Can't get the HTML5 based KVM console to work anymore. It just shows me some green blobs, but the little thumbnail screenshot looks correct.

    Hetzner's fallback thing is a JavaWS application (of course), so I'm trying to figure out how to run that right now.

    tazjin at 2022-10-12T07·48+00

  10. Current status:

    KVM console

    That's good enough for me, we're going in.

    tazjin at 2022-10-12T07·54+00

  11. Unlocking the disk over SSH worked perfectly fine.

    tazjin at 2022-10-12T08·01+00

  12. IPv4 works (I connected over it), seems like v6 also came back up normally:

    tazjin@sanduny ~> ping -6 whitby.tvl.su
    PING whitby.tvl.su(whitby.tvl.fyi (2a01:4f8:242:5b21:0:feed:edef:beef)) 56 data bytes
    64 bytes from whitby.tvl.fyi (2a01:4f8:242:5b21:0:feed:edef:beef): icmp_seq=1 ttl=52 time=23.3 ms
    

    tazjin at 2022-10-12T08·01+00

  13. tazjin updated the body of this issue at 2022-10-12T08·02+00
  14. tazjin closed this issue at 2022-10-12T08·02+00
  15. Minor problems that occured:

    • irccat and dependent services started into failed state, restarting them fixed it

    • panettone also started into a failed state, and restarting it fixed it

    tazjin at 2022-10-12T08·10+00