forum

Unexpected Downtime 2011-02-06 UTC12:21-23:27

posted
Total Posts
46
Topic Starter
peppy
Hi guys. Was a long night so I'll keep this relatively brief! Firstly, we are up and running with no data loss (not that you should expect any less). Turns out the boot drive in the main server decided to lose a few bits which were quite important to booting. Read on for more technical details. I will be taking measures to ensure a faster recovery should this happen again. Thanks for your patience :).

Please take this moment to follow/bookmark @osustatus. I use it to get information out when the osu! website is not available.

Note that replay data will not be available for another hour or two while it is checked for consistency.

---- technical stuff below ----

The problem was quite blatantly obvious when grub decided to not recognise the boot partition. Usually this would be a very simple operation -- pop in a rescue cd and run fsck on the damaged partition, then reboot. Unfortunately due to the remote location of the server, there is quite a delay in communication and actions of the remove hands. It took around 9 hours of communication to get KVM (keyboard/video/mouse access at a very low level) access to the server, and then a while after to get a CD drive with a rescue cd.

Unfortunately, they decided to give me a dated CentOS 5 rescue disk, which didn't have reiserfs repair tools loaded (note for the future: use ext3/4 for boot partitions just so they are standard). Loading an rpm is near impossible on a rescue cd due to dependencies, so I decided to try my luck at compiling the required binary on my home server and transferring it across. This worked wonders.

The actual repair process (from the point of having access to a rescue CD) took around 26 minutes, for what it's worth.

It is interesting to note that the reiser node tree was in perfect shape. Replaying the transaction log was enough to fix the problem. Kind of makes you wonder...
Lunah_old
YAY! :D
DJ Angel
finnaly...osu...is back...

thanks peppy
Hiyorin
Thanks! Been waiting patiently all day :) Knew you would pull through
MysticTechnika
x_x Yay ~
jjrocks
Thanks peppy especially because it was an all nighter you had to pull of to do it
Natteke
Yowane _ Haku
i don't understand the english but I feel lucky.
Colin Hou
Orz
OzzyOzrock
At least it's back up now. o/
taq
:D YEAH
Pereira006
YOSHA !!!!! :D
James2250
Don't fully understand all of the technical stuff listed but basically sounds like the problem was waiting for other people more than the error itself (which is always very annoying)

Thanks for all the effort you put into restoring the server peppy~
Neo@lex
Why was it down in the first place?
FabledArc
Thanks for getting the server back up. :D It's greatly appreciated. ^_^
synchroblst
Basically what everyone said. I'm glad the server's back up, at one point I thought my connection decided to screw me over because everything didn't work. xD
Topic Starter
peppy
Replays are back now. Everything should be running as per normal.
NatsumeRin

Natteke wrote:

crystalsuicune
Thanks Peppy!
Glead to see that the servers are back :3
rockmmer
THANK YOU PPY :)
TKiller

peppy wrote:

Replays are back now.
sadly, they are not
Gabi
Sweet, maps don't take 2 hours to DL anymore :)

Natteke wrote:

Topic Starter
peppy

TKiller wrote:

peppy wrote:

Replays are back now.
sadly, they are not
Care to elaborate on this?
Zekira
There are still some maps that don't let me view the replays, like http://osu.ppy.sh/b/81342 . This was 40 minutes ago, however, so I'm not sure if it works now or not.
xsrsbsns
Good work on the fixes, replays work now (they weren't 5 minutes ago, for me)..
TKiller

peppy wrote:

Care to elaborate on this?
nevermind, everything started woking suddenly, dunno the reason, I didn't even restart osu!
Nakagawa_Kanon

peppy wrote:

(note for the future: use ext3/4 for boot partitions just so they are standard)
Yup, that's essential. I've used reiserfs long time ago and found compatibility terrible. So I use ext4 for / and btrfs for data drives.

GJ, Peppy!
Wishy
I just got two top scores at two different maps and I my replay isn't available in any of them, looks like the submitting replay system isn't working (it isn't that I care about the replays, but it'd be bad if this goes unnoticed for a long time).
Miu Matsuoka
Yay! It's back~ :)

but I can't be #1 anymore
Or login with any crazy names... x)
But thanks, peppy!
celiceyy
the replays which made just now are still not available to watch :( .
Bass
Well @Neoalex, server has been shutdown due to big spam, also anyone could change nick w/o making new account, rvrn to BanchoBot, or...Saturos.
DeeN
thank u ppy :)
JhowM
why all my recent plays aren't available for replaying?
i've played a lot today so i'm wondering why it's happening, since i've played AFTER the replays came back
i can watch only as local replay (because i always save local replay), but not on the online rankings.

Edit:I've just ranked #24 on Hirano Aya (Izumi Konata) - Motteke! Sera-Fuku [Normal] (Taiko)
and it replays fine

btw i got to go sleep, bye
SapphireGhost
Phew, this outage kept making me think there was something dreadfully wrong with my internet. :P
SoulAlchemist
I don't understand all of the technical stuff but I'm still happy that the sever is back up.
Thank you ppy.
DoZzeR
Nice job peppy! Good thing that osu! is back :)
Garven
Appreciate the efforts you have to go through to keep this place running. o/
Curisu
It's great~nice job ppy ^0^/
seeker_nami
Thanks to this I've realized how chaotic life without Osu! can be... hell broke loose for several hours! x.x

You're THE man Peppy! Thanks ^___^v
Semi_Semi
thanks><
psyclone
Thanks! (^o^)/
Guy-kun
Good job with deciding to compile the binary, sounds like you had some fun there.
<:
qlum
I almost thought it was because of the forest fires in your area
TBTE

Neo@lex wrote:

Why was it down in the first place?
You Don't Read
Jelly
Yay! :)
BTW, when Bancho was shut down, there was a message that it was because there's not enough people to moderate or something [?]
Cristian
Good luck the next time ;)
Please sign in to reply.

New reply