Scheduled Maintenance for SGI Altix UV, SGI Altix ICE 8400 and IBM iDataplex HTC
SGI Altix UV, SGI Altix ICE 8400, and IBM iDataPlex HTC scheduled maintenance windows occur, as necessary, during these periods:
These maintenance windows represent periods when UITS may choose to drain the queues of running jobs and suspend access to the cluster operation for HPC/HTC maintenance purposes.
The maintenance periods are monthly. Interruptions are kept as brief as possible. Prior to performing maintenance during any of these time windows, UITS will notify users via the HPC-Announce list at least 10 days prior to the maintenance procedure. The notification will describe the nature and extent (partial or full) of the interruptions of HPC/HTC services.
Batch Queues Maintenance
Batch queues will also be modified prior to scheduled downtimes to hold jobs which request more wallclock time than remains before the shutdown (unless the job request specifies that the job is using checkpoint/restart).
Reasons for Scheduled Maintenance Include:
Unavoidable (emergency) downtime may occur as a result of any of the above reasons at almost any time. Such events are rare and great effort is made to avoid these situations. However, when emergency maintenance is needed, the UITS unit responsible for the item affected will provide as much notice to users as possible and work to resolve the fault as quickly as possible.
Notifications and Communications to HPC Users
The following notification practices to HPC/HTC users will be conducted as part of all software or hardware maintenance, hardware installation, planned outages, and unplanned outages.
Site map: http://rc.arizona.edu/sitemap