summaryrefslogtreecommitdiff
path: root/tasks/machine-room.gmi
blob: 83e562b37e58160444ef3750aedffc7e9266b661 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
# Machine room tasks

## Tags

* assigned: pjotrp
* priority: medium
* type: system administration
* keywords: system administration, octopus, gateway, tux02, tux01, tux03

## Tasks

### UTHSC

* [ ] get data from summer211.uthsc.edu (access machine room)
* [ ] decommission/surplus out-racked machines (whith @arthurc)
      + see also ../issues/systems/decommission-machines.gmi
* [ ] Install tux04-tux09
* [ ] VPN access and FoUT

Network:

* [ ] use fiber optics for subnet Octopus and Tuxes

Backups & storage:

* [+] Order storage and caddies (w. Tamara)
* [_] data warehousing
* [+] run sheepdog as root: redis password error; introduce SHEEPDOG_CONF
* [ ] tux01 has unused 4TB spinning disk
* [ ] tux02 has unused 2x4TB spinning disks and 2TB nvme /dev/nvme0n1 on adapter
      https://www.cyberciti.biz/faq/upgrade-update-samsung-ssd-firmware/
      apt-get install fwupd fwupdate
      fwupdmgr get-devices
      fwupdmgr update
      The previously problematic Samsung 980 Pro was basically using the 3B2QGXA7, and now Samsung has introduced a new 5B2QGXA7 firmware to fix the problem. The problem mainly affects the 2TB version of the 980 Pro
* [ ] Check backups of etc etc.

Security:

* [?] DMZ plan
* [ ] Limit idrac access
* [ ] space server out-of-band access

### Spice

* [ ] Run GN off balg01
* [ ] Firewall out of band
* [ ] Add storage
* [ ] Convert balg02 to Guix server

### Done

* [X] Tux01 and Tux02 disk space issues
* [X] Reinstate backup drops on tux02, rabbit, &space and &epysode; reduce incoming IP
* [X] Pluto tool with Zach & Efraim
* [X] Order drives and caddies tux01 & tux02 (with @haoc)
* [X] Introduce &disk space and mdstat monitor
* [X] Machine room HDDs