<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>omelett.es &#187; DB</title>
	<atom:link href="http://omelett.es/journal/tag/db/feed/" rel="self" type="application/rss+xml" />
	<link>http://omelett.es</link>
	<description>The journal of team omelett.es</description>
	<lastBuildDate>Wed, 28 Jul 2010 16:36:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>DB Recovery &#8211; more haste, less speed</title>
		<link>http://omelett.es/journal/2009/05/db-recovery-more-haste-less-speed/</link>
		<comments>http://omelett.es/journal/2009/05/db-recovery-more-haste-less-speed/#comments</comments>
		<pubDate>Fri, 15 May 2009 07:49:40 +0000</pubDate>
		<dc:creator>Jake</dc:creator>
				<category><![CDATA[Tactile CRM]]></category>
		<category><![CDATA[DB]]></category>
		<category><![CDATA[DR]]></category>
		<category><![CDATA[Recovery]]></category>

		<guid isPermaLink="false">http://omelett.es/journal/?p=585</guid>
		<description><![CDATA[How our DR strategy worked out for real with zero data loss]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fomelett.es%2Fjournal%2F2009%2F05%2Fdb-recovery-more-haste-less-speed%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fomelett.es%2Fjournal%2F2009%2F05%2Fdb-recovery-more-haste-less-speed%2F&amp;source=omelettes&amp;style=normal&amp;service=bit.ly&amp;service_api=R_b3beb026b01fc583f65a465456cc4dd0" height="61" width="50" /><br />
			</a>
		</div>
<p>Yesterday we had one of our worst case business continuity issues &#8211; <strong>the good news, we recovered everything with NO data loss</strong>, the bad news, Tactile CRM was down for a few hours.</p>
<p>Our services are run on a load balanced web farm and backed up daily off-site and every 10 minutes on-site. Yesterday we had an issue with our main database and backups which meant that if we had had to restore from backups about 6 hours worth of users data would have been lost. Fortunately due to the excellent work of our team and hosting provider we worked through our backup and continuity process perfectly and nothing was lost. Here&#8217;s what happened:</p>
<ol>
<li>Our database server hung and on reboot said there were no drives in the RAID array present.</li>
<li>We immediately brought up our backup server and started restoring the database</li>
<li>We drove the main database server direct to the supplier for diagnostics (we thought it was suspicious that all the drives had failed &#8211; we were correct)</li>
<li>Whilst the database was being restore we got the good news we wanted &#8211; it was the backpane in the server and in 10 minutes a new one was installed, tested and verified.</li>
<li>The database server was driven back to the data centre</li>
<li>The database server was installed again and everything started working</li>
</ol>
<p>If we hadn&#8217;t taken the database off-site for diagnostics and restored from backups we&#8217;d have had the site down for about an hour less time but lost half a days worth of data. In the situation we thought the action we took was best for everyone, we hope you agree.</p>
<p><img class="alignnone size-full wp-image-586" title="db" src="http://omelett.es/wp-content/uploads/2009/05/db.jpg" alt="db" width="446" height="223" /></p>
<p>Here&#8217;s what we doing now to make sure this doesn&#8217;t happen again:</p>
<ol>
<li>Monitoring the database server more closely for the next week</li>
<li>Moving forward our plans to implement realtime failover/replication of the database</li>
<li>Building another database server to add to our cluster with less of a lag between backups</li>
</ol>
<p>If you have any questions at all about what happened, please drop us an email to support [at] tactile crm .com.</p>
<p>Photo Credit: <a href="http://www.flickr.com/photos/gi/">TheAlieness GiselaGiardino²³&#8217;s</a></p>
]]></content:encoded>
			<wfw:commentRss>http://omelett.es/journal/2009/05/db-recovery-more-haste-less-speed/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
