SCIAMA
High Performance Compute Cluster
Scheduled Maintenance and Outages
24th January
We have a critical hardware failure on SCIAMA which requires urgent replacement. This requires us to stop lustre. This is a reschedule of last weeks attempt. During the outage you will not be able to read/write any data to /mnt/lustre.
January/February
IS are upgrading the Cisco network switches that SCIAMA uses to connect to the outside world. This upgrade will temporarily disrupt connection to SCIAMA's login nodes and Jupyterhub server for a few minutes. Actual date to be confirmed.
5th September
We have a critical hardware failure on SCIAMA which requires urgent replacement. This requires us to stop lustre and take the users' home directories offline. We have scheduled this for Monday 5th September. During the outage you will not be able to log on to SCIAMA or read/write any data to /mnt/lustre so we suggest you hold off running any further jobs until afterwards.
25th July
SCIAMA is currently experiencing outages. Thank you for your patience while we are working to restore the system.
20th June - 25th June
We will be scheduling regular maintenance windows for SCIAMA to carry out any updates to the OS or applications, the first one is scheduled for 20th June for a week. During this period SCIAMA will be unavailable and any jobs submitted to SLURM that have not completed will be cancelled!
Imortant updates to the OS and SLURM will take a place during this maintenance window.
13th April - 15th April
SCIAMA will need to be rebooted to mount the lustre storage, updates to SLURM scheduler and users home directory. SCIAMA will not be available during this period.
24th -25th March 2022
Urgent maintenance is required to one of the SCIAMA racks. Nodes 224-247 will need to be powered down whilst this maintenance takes place.
I have begun the process of draining the nodes, this affects sciama3.q where only 24 nodes will be available, please bear this in mind when submitting your jobs.
If your job is still running on Monday it may get cancelled, apologies for any inconvenience but the work is necessary.