Resolved: Power outage in the Shared Computing Cluster

Incident Discovery Time: 06:39am on 12/18/2023 Time of Resolution: 10:15am on 12/18/2023 Services Impacted: Research Computing

Description of Impact

Two ITIC Level 4 power sag events occurred at 6:39am on Monday Dec. 18 at the MGHPCC. The power sag caused a circuit breaker to trip and power was lost to all non-UPS Shared Computing Cluster (SCC) systems.

Incident Description and Resolution

At 9:15 am power was restored to the SCC racks. RCS Systems Team successfully restored SCC operations by 10:15 am. If you continue to have issues, please contact the IT Help Center.

Previous Update

Incident Discovery Time: 08:00am on 12/18/2023 Services Impacted: Research Computing

Description of Impact

There has been an unplanned power outage in the SCC.

Current Status

IS&T teams have not yet identified the cause of the incident, but are investigating.

Additional Information

The power may be back on shortly, after circuit breakers are replaced. Next Update: 11:30am