All work has completed, and Summit has been returned to service.
Jan 6, 17:42 MST
The previously-announced electrical fault at the HPCF was fixed around 4:00 PM, and we have been bringing up affected Blanca and Summit nodes. Blanca has been returned to full operation and its PM reservations released. Core components of Summit have been powered on and are working, and compute nodes are being powered on now.
Jan 6, 17:09 MST
Today's planned maintenance activities have completed. However, an electrical fault occurred in the HPCF power distribution system during the Summit bringup. We have halted the Summit bringup process until a facilities electrician is able to assess and respond to the fault.
In the mean time, we intend to bring up the Blanca nodes that were affected by today's HPCF outage, as they should be unaffected by this electrical fault.
Jan 6, 15:16 MST
Summit, including Summit storage, and Blanca nodes in the HPCF, have been shut down as scheduled. This supports powering off the HPCF cooling system for regular maintenance.
We will be monitoring the facilities work throughout the day, and will restore Summit and Blanca to full production as soon as possible once the work has been completed.
Jan 6, 07:19 MST
In progress -
Scheduled maintenance is currently in progress. We will provide updates as necessary.
Jan 6, 07:00 MST
Research Computing will perform regularly-scheduled planned maintenance Wednesday, 6 January 2021. January's activities include
- HPCF tower cleaning and maintenance
Maintenance is scheduled to take place between 07:00 and 19:00, though service will be restored as soon as all activities have concluded. During the maintenance period no jobs will run on Summit resources, or Blanca resources that reside in the HPCF. This includes all "bhpc" nodes and a single Blanca GPU node. Summit storage will also be unavailable.
If you have any questions or concerns, please contact email@example.com
Dec 18, 18:31 MST