oph:cluster:messages
Differenze
Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.
Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente | ||
oph:cluster:messages [2025/04/18 13:15] – [2025-04-18] Maint completed diego.zuccato@unibo.it | oph:cluster:messages [2025/05/09 11:09] (versione attuale) – [2025-05-09] diego.zuccato@unibo.it | ||
---|---|---|---|
Linea 10: | Linea 10: | ||
===== 2025 ===== | ===== 2025 ===== | ||
- | <WRAP center round alert 60%> | + | ==== 2025-05-09 ==== |
- | :!: PLANNED MAINTENANCE :!: | + | * slowness resolved (backup completed) |
- | On 2025-04-16 the cluster will be turned off for maintenance. It will be turned back on on 2025-04-18. | + | |
- | </ | + | ==== 2025-05-08 ==== |
+ | * generalized slowness: due to ongoing backup, access to /home is really slow. A concurrent check of the underlying RAID volume made it even worse. Check have been paused and backup is nearing completion, so the system should soon return to normality | ||
+ | * mtx12 is offline again due to RAM issues | ||
+ | |||
+ | ==== 2025-04-22 ==== | ||
+ | * recreated missing reservations -- please check names with '' | ||
+ | * mtx12 is (temporarily, | ||
==== 2025-04-18 ==== | ==== 2025-04-18 ==== | ||
* Maintenance (nearly) completed. Two nodes are still down (gpu01 and gpu02), and some jobs migh have failed when scheduled on misbehaving nodes (bld17 and bld18, now fixed) | * Maintenance (nearly) completed. Two nodes are still down (gpu01 and gpu02), and some jobs migh have failed when scheduled on misbehaving nodes (bld17 and bld18, now fixed) | ||
+ | * **15:30 Update** all the nodes are currently working: don't break' | ||
==== 2025-04-10 ==== | ==== 2025-04-10 ==== | ||
* Created a reservation to avoid having running jobs during maintenance (**2025-04-16T10: | * Created a reservation to avoid having running jobs during maintenance (**2025-04-16T10: |
oph/cluster/messages.1744982133.txt.gz · Ultima modifica: 2025/04/18 13:15 da diego.zuccato@unibo.it