summaryrefslogtreecommitdiff
path: root/tasks/machine-room.gmi
diff options
context:
space:
mode:
authorPjotr Prins2024-11-18 08:08:34 +0100
committerPjotr Prins2024-11-18 08:08:39 +0100
commit75b93091cf87a266315f8bd8a00a0d60b3cc1069 (patch)
tree15667030f46ab9f12b8f5d78f1a4125eaa9ac853 /tasks/machine-room.gmi
parent07d7b2e7d6e600bcf4645fa60939d7d51f6f4dcf (diff)
downloadgn-gemtext-75b93091cf87a266315f8bd8a00a0d60b3cc1069.tar.gz
Tasks: update machine room tasks
Diffstat (limited to 'tasks/machine-room.gmi')
-rw-r--r--tasks/machine-room.gmi56
1 files changed, 32 insertions, 24 deletions
diff --git a/tasks/machine-room.gmi b/tasks/machine-room.gmi
index f6c7737..661419b 100644
--- a/tasks/machine-room.gmi
+++ b/tasks/machine-room.gmi
@@ -1,36 +1,37 @@
# Machine room tasks
-## Tags
+# Tags
* assigned: pjotrp
* priority: medium
* type: system administration
* keywords: system administration, octopus, gateway, tux02, tux01, tux03
-## Tasks
+# Tasks
-### UTHSC
+## GN
-* [ ] describe machines with Rick Stripes
-* [ ] get bacchus back on line
-* [ ] fix www.genenetwork.org and gn2.genenetwork.org https
+* [ ] Trait vectors for Johannes
+* [ ] !!Organize pluto, update Julia and add apps to GN menu Jupyter notebooks
+* [ ] !!Xusheng jumpshiny services
+* [ ] Slurm on production for GEMMA speedup
+* [ ] Embed R/qtl2 (Alex)
+* [ ] Hoot in GN2 (Andrew)
* [ ] tux02 certbot failing (manual now)
-* [ ] get data from summer211.uthsc.edu (access machine room)
-* [ ] VPN access and FoUT
* [ ] penguin2 has 32TB of space we can use on NFS/backups
-Network:
+## Octopus:
-* [ ] Octopus: wire up machines so they talk with each other over fiber
+* [ ] Fix Tux05 badblocks on /dev/sdb2 1050624 47925247 46874624 22.4G Linux filesystem
+* [ ] !!Ceph on Tuxes
+* [ ] Monitor nodes
+* [ ] Check machines so they talk with each other over fiber
-Lambda:
+## Backups & storage:
-* [ ] remote access? (with Erik)
- * [X] get BMC password
-
-Backups & storage:
-
-* [_] data warehousing
+* [ ] Create and check backups of tux04 etc etc.
+* [ ] set up zero to backup tux02 and report to redis
+* [ ] reintroduce borg-borg on zero
* [+] run sheepdog as root: redis password error; introduce SHEEPDOG_CONF
* [ ] tux01 has unused 4TB spinning disk
* [ ] tux02 has unused 2x4TB spinning disks and 2TB nvme /dev/nvme0n1 on adapter
@@ -39,22 +40,22 @@ Backups & storage:
fwupdmgr get-devices
fwupdmgr update
The previously problematic Samsung 980 Pro was basically using the 3B2QGXA7, and now Samsung has introduced a new 5B2QGXA7 firmware to fix the problem. The problem mainly affects the 2TB version of the 980 Pro
-* [ ] Check backups of etc etc.
Security:
* [ ] Limit idrac access
-* [X] space server out-of-band access
-### Spice
+## Spice
-* [ ] Run GN off balg01
* [ ] Add firewall test to sheepdog
-* [ ] Convert balg02 to Guix server
-* [ ] VM for student team
-### Done
+## Done
+* [X] describe machines with Rick Stripes
+* [X] get bacchus back on line
+* [X] fix www.genenetwork.org and gn2.genenetwork.org https
+* [-] get data from summer211.uthsc.edu (access machine room)
+* [X] VPN access and FoUT
* [X] lambda: get fiber working
* [X] lambda: add to Octopus HPC
* [X] lambda: racked up and runs
@@ -82,3 +83,10 @@ Security:
* [X] tux07 has no fiber
* [X] tux08 has no fiber
* [X] tux09 has no fiber
+### Lambda
+* [X] remote access? (with Erik)
+ * [X] get BMC password
+* [X] space server out-of-band access
+### Spice
+* [X] Run GN off balg01
+* [X] Convert balg02 to Guix server