How a power blip briefly broke GitHub’s boxes and tripped it offline

February 4th 2016

Github is relied on by thoundands of IT companies experienced an outage on 28th January 2016, resulting in those companies unable to work, therefore loss of revenue.

Whats interesting is that github’s internal communication system that engineers use also relies on the same data centre which delayed engineers communications. Surely it would have been a much better idea to at least use a separate data centre or even a different service.

In order to prevent this happening again, there should be redundant roll-over servers that can be switched on at anytime to restore the service. Policy should be re-written to include different contact procedures in the event of an emergency.