SAUK Discussion Board

Go Back   SAUK Discussion Board > Board Management > The Garage
Join! Blogs FAQ Calendar Today's Posts Search

Notices

Reply  Post New Thread
 
Thread Tools
  #1  
Old 25th January 2012, 10:45
Occultus Occultus is offline
Administrator
 
Join Date: Apr 2006
Location: SAUK HQ
Posts: 18,330

Mood
Tired

Default SAUK Outage Jan 2012

Welcome back, everyone

As you should be aware by now, our hosting company, A Small Orange, suffered a major hardware failure across their cloud hosting platform last Friday. SAUK was unreachable as a direct result. We were told originally that they would be unable to recover any of our data. I then spent the weekend working on getting a brand new server up and running, with the intention of getting the site back online with the most recent backup that we had access to at that time (from the 14th January 2012). This came online briefly on Monday evening. Just after I got the site back online, ASO got in touch to let us know that they had been able to recover our data, and were in a position to recreate our old server environment on our new one. We have then had to wait for them to do just that, and that's why the board did not reappear until this morning.

Everything should be working as it was before the recent downtime, but there will likely be a few bits and pieces that will need tweaking. The board may have to close intermittently for a short while over the next few days if stuff needs doing. If you do happen to experience any problems using the board over the next few days or so, please post something here or send me a PM so I can take a look at your issue ASAP.

PS: Some info from ASO, for the techies...

Quote:
Hello,

As many of you are aware, the A Small Orange Cloud Hosting platform suffered a prolonged outage that began in the late afternoon (Eastern time) on Friday, January 20th. At the most basic level, this outage was due to a disk failure in the array.

The cloud storage configuration, which is made up of a 16 disk SAN array, can tolerate one drive failure and continue to operate normally. This automatic failover provides the reliability that has made cloud platforms popular. In last nightâ??s incident, however, the SAN array had two drive failures, which is beyond the tolerance fault level of the system.

Numerous attempts have been made to rebuild the array and regain access to the data, but so far those attempts have failed. Due to the fact that two drives were lost, we feel at this point that all data across the entire cloud platform is lost and is likely unrecoverable.

We are continuing to work on the storage system and with its manufacturers to attempt to recover the data, but we wanted to let you know what the situation is at this point.

We are deeply sorry for the downtime you have experienced so far. As a web hosting company, letting customers know that data has been lost and is unrecoverable is the worst thing we ever have to do. We stand behind the products we sell and, in the rare occasions when there are failures that have a potentially damaging affect on our customers, it is a difficult thing to have to share with you. We know you count on us, and we hate falling short of expectations. Your trust has helped us grow into the company we are today and we are deeply grateful.

We know you will have many questions as to why and how this happened, and we will be happy to answer whatever questions we can. Unfortunately, A Small Orange leased the cloud platformâ??s hardware from a third party datacenter for our customersâ?? use. In some cases, we may not be able to fully answer your questions until we are done investigating. The monitoring of the integrity of the hardware system and the disks were the responsibility of the data center we leased the hardware from. We are working to get answers regarding when the disk issues occurred, how they occurred, whether there was any notification of impending disaster and if that took place. While we can tell you what has happened to the system, unfortunately, we don\'t have answers yet as to why it happened. This is as frustrating for us as it likely is for you and we are sorry we do not have more information to share with you at this time.

Our lack of control over our current cloud hosting platform and our inability to monitor certain aspects of it (both due to the fact it was leased hardware), is what led us four months ago to invest significantly in the design of our own cloud platform. Many of you have gotten the emails regarding the migration that was due to begin this upcoming Monday. Our new cloud platform is wholly owned, controlled, and monitored by A Small Orange, and is designed to be fault tolerant to two disk drives vs. the current cloud set up, which is fault tolerant to only one failure. Our new cloud is currently up and working well and new customers are already being placed on it.

While we realize that there is little we can do to correct a situation as catastrophic as this, we do want to do whatever we can to help you recover from this situation. For those customers that would like to stay with A Small Orange, we will:

- provide you with 6 free months on the new cloud
- automatically upgrade you to the next highest package level at no cost
- offer you 2 GB of swap memory at no cost
- offer our R1Soft Bare Metal Backup Solution with multiple points of restore at no cost

You can find details on how to get your new cloud hosting account activated at: http://asmallorange.com/cloud-2012.php

We do realize that for some, this is a catastrophic failure that will result in your choosing to leave A Small Orange. While we hope that we can retain your trust and your business, we understand this won\'t be possible for everyone. If you choose to leave A Small Orange we will, of course, refund you any pre-paid fees as well as pay for your first six months of comparably priced hosting at a new hosting company. You can find details on this at: http://asmallorange.com/cloud-2012.php

We will be reaching out to many of you over the next several days by phone to discuss what has happened and to answer any questions you might have. Customer service continues to be our number one priority at A Small Orange and we are working hard to do whatever we can to make this situation right.

We again extend our deepest apologies.

Very truly yours,

The Team at A Small Orange
--
Douglas Hanna
A Small Orange LLC
Quote:
Early on Saturday, January 21st, we made the extremely difficult decision to let our customers know that the cloud platform we had been using had likely suffered a catastrophic and unrecoverable event that we believed had permanently destroyed our ability to bring up the system and subsequently read or access any customer data. Even though we had planned for a migration to our brand new cloud platform to start just two days later, we (and our customers) were out of luck. It was probably the worst news we could have shared with our customers that morning and it hurt.

While many of our employees, supervisors, and managers were completely focused on doing whatever we could to help our customers get over this catastrophic data loss, our top system administrators continued to work long hours with the manufacturers and engineers of the cloud platform searching for any potential solution. Any lead was worth pursuing and we were determined to do just that until we had exhausted every option available. It took well over a hundred man hours, but we are happy to report that a significant amount of data was able to be obtained from the old cloud and that we do indeed have a full backup of your previous virtual machine.

Again, to be clear - we have a full backup of your data as it existed at the time the cloud went down on Friday evening.

As many people made some changes to their cloud set up after we recreated accounts, we are reaching out individually to customers that already have new cloud accounts to ask them what they want us to do with the recovered data.

We can completely overwrite what you currently have and reproduce your data to its exact state previous to when the cloud went offline (option 1), or we can make this data available to you and you can make changes as you see fit (option 2). Completely overwriting your current virtual machine with the backup is the fastest and easiest way to get your data back, but we want to make sure you have a choice and can do what works best for you.

Please let us know as soon as possible which option you would prefer so that we can help to restore your account as quickly as possible.

Again, we sincerely apologize for the stress and inconveniences that this series of events has caused. It has been a stressful few days for everyone, but we are delighted to let you know about the progress we have made.

As always, if there is anything that we can do to help you or any further questions we can answer, please do not hesitate to let us know.
--
Jen Lepp
Director of Operations
A Small Orange
Reply With Quote
  #2  
Old 25th January 2012, 13:38
GhostOnMagneticTape GhostOnMagneticTape is offline
Member
 
Join Date: Jul 2005
Location: Source
Posts: 5,513

Mood
Mellow

Default Re: SAUK Outage Jan 2012

Many thanks for Anne's and your hard work in getting it back online quickly with no data loss?! You must run back ups quite regularly as the site went down about 15 mins after I posted my last message which is still there or is it some automated web server mirroring system you use?

Anyway kudos Mr Occultus!
Reply With Quote
  #3  
Old 25th January 2012, 16:26
VO2 VO2 is offline
Member
 
Join Date: Sep 2007
Posts: 846
Blog Entries: 2
Default Re: SAUK Outage Jan 2012

Well done you people

Reply With Quote
  #4  
Old 25th January 2012, 18:40
talisman talisman is offline
Member
 
Join Date: Sep 2005
Location: Kent, UK
Posts: 1,052
Blog Entries: 14
Default Re: SAUK Outage Jan 2012

Well done admins for getting everything working again!
Reply With Quote
  #5  
Old 25th January 2012, 19:00
courageous courageous is offline
Member
 
Join Date: Nov 2008
Location: Manchester
Posts: 138

Mood
Busy

Default Re: SAUK Outage Jan 2012

Many thanks guys. This website is a life-line for me. x
Reply With Quote
  #6  
Old 25th January 2012, 19:47
am1000 am1000 is offline
Member
 
Join Date: Oct 2009
Location: Chester
Posts: 2,036
Blog Entries: 26
Default Re: SAUK Outage Jan 2012

Thanks for getting everything back on track
Reply With Quote
  #7  
Old 25th January 2012, 19:56
MrMr MrMr is offline
Member
 
Join Date: Jun 2011
Posts: 1,103
Default Re: SAUK Outage Jan 2012

Yep, thanks to all those involved in sorting it out, must have been a palaver...but one much appreciated
Reply With Quote
  #8  
Old 25th January 2012, 23:03
T T is offline
Member
 
Join Date: Dec 2005
Location: Merseyshire
Posts: 10,797
Blog Entries: 22

Mood
Musical

Default Re: SAUK Outage Jan 2012

Cheers Dan and Anne and all the mods in doing a sterling job in trying to get SAUK back up n running kudos to you all
Reply With Quote
  #9  
Old 25th January 2012, 23:13
Fluppy Fluppy is offline
Member
 
Join Date: Nov 2011
Posts: 643
Blog Entries: 1

Mood
Sad

Default Re: SAUK Outage Jan 2012

Yush
Reply With Quote
  #10  
Old 25th January 2012, 23:44
Occultus Occultus is offline
Administrator
 
Join Date: Apr 2006
Location: SAUK HQ
Posts: 18,330

Mood
Tired

Default Re: SAUK Outage Jan 2012

No worries, guys. I'm looking forward to a decent night of sleep tonight
Reply With Quote
  #11  
Old 26th January 2012, 02:52
humourlessness humourlessness is offline
Banned
 
Join Date: Jan 2012
Location: australia
Posts: 1,108
Blog Entries: 22
Default Re: SAUK Outage Jan 2012

I cant activate my account
Reply With Quote
  #12  
Old 26th January 2012, 03:25
Detox Detox is offline
Member
 
Join Date: Apr 2010
Location: North West England
Posts: 2,980
Default Re: SAUK Outage Jan 2012

Thanks for fixing it.
Reply With Quote
  #13  
Old 26th January 2012, 07:53
humourlessness humourlessness is offline
Banned
 
Join Date: Jan 2012
Location: australia
Posts: 1,108
Blog Entries: 22
Default Re: SAUK Outage Jan 2012

Could someone send me an email again? I didn't receive the first one, therefore can't activate my account.
Reply With Quote
  #14  
Old 26th January 2012, 11:15
Occultus Occultus is offline
Administrator
 
Join Date: Apr 2006
Location: SAUK HQ
Posts: 18,330

Mood
Tired

Default Re: SAUK Outage Jan 2012

I've activated your account for you
Reply With Quote
  #15  
Old 26th January 2012, 21:49
catlover catlover is offline
Member
 
Join Date: May 2008
Location: usa
Posts: 1,211

Mood
Insomnious

Default Re: SAUK Outage Jan 2012

Thanks for all of your efforts. When I first noticed the site being down I feared that it had been closed. Glad to know that I was wrong.
Reply With Quote
  #16  
Old 27th January 2012, 03:32
humourlessness humourlessness is offline
Banned
 
Join Date: Jan 2012
Location: australia
Posts: 1,108
Blog Entries: 22
Default Re: SAUK Outage Jan 2012

Quote:
Originally Posted by Occultus
I've activated your account for you
thanks very much
Reply With Quote
  #17  
Old 11th February 2012, 15:10
Tamara Tamara is offline
Member
 
Join Date: Jan 2012
Location: Berkshire
Posts: 2,361
Default Re: SAUK Outage Jan 2012

Hello can my account be activated please I didnt get an email.
Reply With Quote
  #18  
Old 25th February 2012, 16:04
magoo magoo is offline
Member
 
Join Date: Jan 2011
Location: fife
Posts: 6

Mood
Torn

Default Re: SAUK Outage Jan 2012

well done to all involved would'nt know where to start myself. Pass the hammer i'll fix it
Reply With Quote
  #19  
Old 5th March 2012, 20:45
girlinterrupted girlinterrupted is offline
Member
 
Join Date: Nov 2006
Location: In my own head
Posts: 3,076
Blog Entries: 9
Default Re: SAUK Outage Jan 2012

Is it just me or has SAUK been up and down all day today?
Reply With Quote
  #20  
Old 5th March 2012, 20:46
Detox Detox is offline
Member
 
Join Date: Apr 2010
Location: North West England
Posts: 2,980
Default Re: SAUK Outage Jan 2012

Quote:
Originally Posted by girlinterrupted
Is it just me or has SAUK been up and down all day today?
Its been on the blink for me.
Reply With Quote
  #21  
Old 5th March 2012, 21:07
Alex_James Alex_James is offline
Member
 
Join Date: Dec 2011
Location: Gtr Manchester
Posts: 813
Default Re: SAUK Outage Jan 2012

I was lost last month when the site when down for a week, hope it's not the same problem? :\
Reply With Quote
  #22  
Old 5th March 2012, 21:12
Detox Detox is offline
Member
 
Join Date: Apr 2010
Location: North West England
Posts: 2,980
Default Re: SAUK Outage Jan 2012

Yeah its doing it to me.
SAUK is becoming self aware, it can't put up with all us unsocial people anymore!
Reply With Quote
  #23  
Old 5th March 2012, 21:17
girlinterrupted girlinterrupted is offline
Member
 
Join Date: Nov 2006
Location: In my own head
Posts: 3,076
Blog Entries: 9
Default Re: SAUK Outage Jan 2012

Quote:
Originally Posted by Detox
Yeah its doing it to me.
SAUK is becoming self aware, it can't put up with all us unsocial people anymore!
That or SA is infectious. Maybe it can`t handle crowds and is booting us out periodically in the hopes we get the message.
Reply With Quote
  #24  
Old 5th March 2012, 21:17
Detox Detox is offline
Member
 
Join Date: Apr 2010
Location: North West England
Posts: 2,980
Default Re: SAUK Outage Jan 2012

Reply With Quote
Reply


Forum Jump


All times are GMT +1. The time now is 02:43.


SAUK Award
Logo designed by abc
Powered by vBulletin
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.