Unplanned disk array failure /storage/brno2/

18-24.3.2023 - Unplanned disk array failure /storage/brno2/

Update 03/27/2023: There is another problem, it will be fixed in a few hours. Please be patient. The disk array was returned to service in the afternoon the same day.

Update 03/24/2023: The /storage/brno2/ disk array is back in full operation. Data remains intact.

-----------

Dear user,

On Saturday afternoon (18 March) there was a HW failure of the /storage/brno2/ disk array. We are working on getting it back up and running in cooperation with the supplier. We are not yet able to say when the array will be operational. The supplier is proceeding carefully so that we do not lose the stored data.

It is not possible to log in to frontends where this array serves as /home (skirit, onyx) and the disk array cannot be accessed from elsewhere (from other frontends or nodes). OnDemand is also affected.

Running jobs that copy output back to the array fail to do this, and the data remains in the scratch on the appropriate node where it was running. To access the data on the compute nodes, use the following shortcut:

     go_to_scratch JOB_NUMBER_INCLUDING_PBS_SERVER_NAME
     FOR EXAMPLE 
     tarkil.grid.cesnet.cz$ go_to_scratch 79868.meta-pbs.metacentrum.cz

You can use other frontends (https://wiki.metacentrum.cz/wiki/Frontend) and disk arrays during the outage.

With apologies and thanks for understanding
MetaCenter Team

Ivana Křenková, Sat Mar 18 10:16:00 CET 2023