We are experiencing some difficulties with aphrodite.krystal.co.uk – the server was rebooted due to an unknown software error around 17:05hrs. It is now experiencing an incredibly high load at what is a peak time. We are currently waiting for the server to stabilise pending an investigation into the nature of the original problem.
Update 24/04 19:32 One of the hard drives in Aphrodite’s RAID array has failed. We’re now removing the affected hard drive and will boot it as soon as it’s possible to do so safely. Please bare with us.
Update 19:48 The server is now back up, though in a “degraded” RAID mode – it only has 1 hard drive instead of the normal 2. Performance will be adversely affected until the RAID array is rebuilt. Thanks for your patience.
Update 20:33 A replacement hard drive has replaced the failed one and the RAID array is currently rebuilding. We expect this will take around 12 hours during which time server will be less responsive than normal as the data set is rebuilt. We therefore advise users to refrain from any heavy usage and be aware there is a higher than normal chance of an outage.
Update 21:41: The rebuild is at 20%
Update 00:15 The rebuild is at 80%
Update 25th April 01:00 The rebuild of the raid array is now complete. We can’t be 100% sure that this problem was caused by a failing disk, and so we will maintain our extra monitoring on this server for some time.