The root cause of this outage has been identified and cleared in the upstream campus network. The elements of the RC Core infrastructure that were affected were vulnerable to this upstream outage because of an unrelated migration in progress, and would not be vulnerable once the migration has completed.
Posted 3 months ago. Sep 29, 2018 - 23:36 MDT
Summit and Blanca have been returned to service, but we will continue monitoring to see if an upstream failure might cause us further issues.
Posted 3 months ago. Sep 28, 2018 - 16:45 MDT
An unplanned outage in the RC Core infrastructure has led to RC directory services being inaccessible. To prevent potential job failures the queueing systems on Summit and Blanca have been stopped until access to the directory is restored.
Posted 3 months ago. Sep 28, 2018 - 16:14 MDT
This incident affected: Research Computing Core, RMACC Summit, and Blanca.