Summit GPU node outage
Incident Report for CU Boulder RC
Resolved
This incident has been resolved.
Posted Aug 11, 2022 - 18:00 MDT
Monitoring
A fix has been implemented and we are monitoring the results.
Posted Aug 11, 2022 - 17:44 MDT
Identified
The Summit GPU nodes (the "sgpu" partition) is presently out of service so that we can implement a new node image to address Slurm issues. We anticipate this work will be completed shortly.

No other Summit partitions are impacted by this outage.
Posted Aug 11, 2022 - 14:13 MDT
This incident affected: RMACC Summit.