SCIAMA

High Performance Compute Cluster

Menu
  • Home
  • Getting Started
    • How to get an account
    • How to Log in
    • Log in from Windows
    • Using X2Go
    • Create an SSH key pair
    • SSH Fingerprints
    • Using SSH keys
    • Training Exercises
    • Slurm MPI Batch Script
  • Using SCIAMA
    • Using Jupyter Notebooks
    • Using JupyterHub
    • Using Spyder
    • Storage Policy
    • Submitting jobs
    • Using GPUs
    • Using Matlab
    • Using Conda Environments
    • SLURM for PBS Users
    • File transfers
    • GitHub/BitBucket
    • Cloud Backup
    • Reserving Compute Nodes
    • Acknowledging SCIAMA
  • Resources
    • Hardware
    • Queues
    • Software
      • Using Software Modules
      • Changing Default Modules
      • Mixing MPI and OpenMP
      • Using CONDA
  • System Status
    • Core Usage
    • Disk Usage
    • Users Logged In
    • Jobs Status
    • Jobs by Group
    • Node Status
    • SCIAMA Grafana Dashboard
    • Maintenance and Outages
  • Contact Us

Scheduled Maintenance and Outages

24th January

We have a critical hardware failure on SCIAMA which requires urgent replacement. This requires us to stop lustre. This is a reschedule of last weeks attempt. During the outage you will not be able to read/write any data to /mnt/lustre.

January/February

IS are upgrading the Cisco network switches that SCIAMA uses to connect to the outside world. This upgrade will temporarily disrupt connection to SCIAMA's login nodes and Jupyterhub server for a few minutes. Actual date to be confirmed.

5th September

We have a critical hardware failure on SCIAMA which requires urgent replacement. This requires us to stop lustre and take the users' home directories offline. We have scheduled this for Monday 5th September. During the outage you will not be able to log on to SCIAMA or read/write any data to /mnt/lustre so we suggest you hold off running any further jobs until afterwards.

25th July

SCIAMA is currently experiencing outages. Thank you for your patience while we are working to restore the system.

20th June - 25th June

We will be scheduling regular maintenance windows for SCIAMA to carry out any updates to the OS or applications, the first one is scheduled for 20th June for a week. During this period SCIAMA will be unavailable and any jobs submitted to SLURM that have not completed will be cancelled!

Imortant updates to the OS and SLURM will take a place during this maintenance window.

13th April - 15th April

SCIAMA will need to be rebooted to mount the lustre storage, updates to SLURM scheduler and users home directory. SCIAMA will not be available during this period.

24th -25th March 2022

Urgent maintenance is required to one of the SCIAMA racks. Nodes 224-247 will need to be powered down whilst this maintenance takes place.

I have begun the process of draining the nodes, this affects sciama3.q where only 24 nodes will be available, please bear this in mind when submitting your jobs.

If your job is still running on Monday it may get cancelled, apologies for any inconvenience but the work is necessary.

Copyright © 2022 ICG, University of Portsmouth