<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Hadoop HA Configuration</title>
	<atom:link href="http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/</link>
	<description>Hadoop and Cloudera&#039;s Products and Services</description>
	<lastBuildDate>Wed, 23 May 2012 10:20:43 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: masterpapers</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-16108</link>
		<dc:creator>masterpapers</dc:creator>
		<pubDate>Wed, 29 Jun 2011 17:09:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-16108</guid>
		<description>Great job, you&#039;ve helped me so much and I am so glad I have chosen your service.</description>
		<content:encoded><![CDATA[<p>Great job, you&#8217;ve helped me so much and I am so glad I have chosen your service.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ???????HBase (1) &#124; Brizzz.com &#8211; ???</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-15915</link>
		<dc:creator>???????HBase (1) &#124; Brizzz.com &#8211; ???</dc:creator>
		<pubDate>Wed, 20 Apr 2011 08:03:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-15915</guid>
		<description>[...] used&#160;DRBD and&#160;Heartbeat to have&#160;HA for the Hadoop NameNode, because this was the last Single Point of Failure (SPOF) in our [...]</description>
		<content:encoded><![CDATA[<p>[...] used&#160;DRBD and&#160;Heartbeat to have&#160;HA for the Hadoop NameNode, because this was the last Single Point of Failure (SPOF) in our [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nishat akhtar</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-13492</link>
		<dc:creator>nishat akhtar</dc:creator>
		<pubDate>Sun, 05 Sep 2010 19:13:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-13492</guid>
		<description>I am trying to set up hive on my hadoop. I am also able to start hive after mentioning the hadoop path but i am unable to create any tables as every time i am getting the metadata error. Can you please help me out to find the solution.</description>
		<content:encoded><![CDATA[<p>I am trying to set up hive on my hadoop. I am also able to start hive after mentioning the hadoop path but i am unable to create any tables as every time i am getting the metadata error. Can you please help me out to find the solution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PATRICK</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-13234</link>
		<dc:creator>PATRICK</dc:creator>
		<pubDate>Sat, 21 Aug 2010 17:11:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-13234</guid>
		<description>Charlie, et.al.,

Charlie - you are right, that appears to be an error in the blog. You do not want to set dfs.data.dir to a drbd partition. I&#039;m also glad that you resolved the split-brain issue.

That said, we want to clarify our position that we do not officially support nor fully endorse this method of doing NameNode HA. DRBD is hard to get right, and you can easily corrupt your fsimage file with a subtle mistake. Furthermore, the backup NameNode will start up and enter safe-mode (read only) until it has gone through and processed the edit log to reconstitute the up-to-date fsimage in-memory. This process can take several minutes, sometimes over an hour if you let the edit log grow unchecked. You can alleviate this by having the SecondaryNameNode checkpoint more often, and backing up the checkpoints so you have a fairly recent image as a fallback in case of catastrophic failure.

The community is hard at work at building high-availability natively into HDFS. Until then, our recommendation is to design your application around HDFS&#039;s availability semantics. This might include using Flume for reliable delivery, or having mirrored HDFS systems.</description>
		<content:encoded><![CDATA[<p>Charlie, et.al.,</p>
<p>Charlie &#8211; you are right, that appears to be an error in the blog. You do not want to set dfs.data.dir to a drbd partition. I&#8217;m also glad that you resolved the split-brain issue.</p>
<p>That said, we want to clarify our position that we do not officially support nor fully endorse this method of doing NameNode HA. DRBD is hard to get right, and you can easily corrupt your fsimage file with a subtle mistake. Furthermore, the backup NameNode will start up and enter safe-mode (read only) until it has gone through and processed the edit log to reconstitute the up-to-date fsimage in-memory. This process can take several minutes, sometimes over an hour if you let the edit log grow unchecked. You can alleviate this by having the SecondaryNameNode checkpoint more often, and backing up the checkpoints so you have a fairly recent image as a fallback in case of catastrophic failure.</p>
<p>The community is hard at work at building high-availability natively into HDFS. Until then, our recommendation is to design your application around HDFS&#8217;s availability semantics. This might include using Flume for reliable delivery, or having mirrored HDFS systems.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Charlie</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-13218</link>
		<dc:creator>Charlie</dc:creator>
		<pubDate>Fri, 20 Aug 2010 14:46:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-13218</guid>
		<description>Scrap question number 2, it was a split-brain with drbd which stopped the sync&#039;ing, :-/</description>
		<content:encoded><![CDATA[<p>Scrap question number 2, it was a split-brain with drbd which stopped the sync&#8217;ing, :-/</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Charlie</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-13217</link>
		<dc:creator>Charlie</dc:creator>
		<pubDate>Fri, 20 Aug 2010 13:29:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-13217</guid>
		<description>Firstly thanks for the post, its very helpful but I have a couple of questions:

1. You say in the few configuration options that you have specified that dfs.data.dir should be set to a directory on the drbd partition, but from what I understand about this option all it does is tells the data nodes where to store its blocks, is this right? If so, the masters/namenodes do not worry about this option and it should therefore not be set to a directory on the drbd partition.

2. I have set this up on 4 machines for testing, 2 namenodes with drbd/heartbeat and 2 nodes, of which 1 of them is also acting as the secondary namenode. All is working fine apart from the very big problem of when the namenode is switched over from 1 master to the other, the data stored in hdfs is no longer there, I therefore seem to have 2 different hdfs&#039;s, I can create directories in both and they only appear while that particular master is running. Any&#039;s idea&#039;s why this would happen?</description>
		<content:encoded><![CDATA[<p>Firstly thanks for the post, its very helpful but I have a couple of questions:</p>
<p>1. You say in the few configuration options that you have specified that dfs.data.dir should be set to a directory on the drbd partition, but from what I understand about this option all it does is tells the data nodes where to store its blocks, is this right? If so, the masters/namenodes do not worry about this option and it should therefore not be set to a directory on the drbd partition.</p>
<p>2. I have set this up on 4 machines for testing, 2 namenodes with drbd/heartbeat and 2 nodes, of which 1 of them is also acting as the secondary namenode. All is working fine apart from the very big problem of when the namenode is switched over from 1 master to the other, the data stored in hdfs is no longer there, I therefore seem to have 2 different hdfs&#8217;s, I can create directories in both and they only appear while that particular master is running. Any&#8217;s idea&#8217;s why this would happen?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hadoop/HBase automated deployment using Puppet at hstack</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-12824</link>
		<dc:creator>Hadoop/HBase automated deployment using Puppet at hstack</dc:creator>
		<pubDate>Thu, 01 Jul 2010 09:34:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-12824</guid>
		<description>[...] standalone puppet module to configure the Hadoop NameNode in High-Availability mode via DRBD, heartbeat and mon. For more details on this recipe check out the cloudera blog post on this topic. [...]</description>
		<content:encoded><![CDATA[<p>[...] standalone puppet module to configure the Hadoop NameNode in High-Availability mode via DRBD, heartbeat and mon. For more details on this recipe check out the cloudera blog post on this topic. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Carlos</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-12705</link>
		<dc:creator>Carlos</dc:creator>
		<pubDate>Wed, 09 Jun 2010 14:09:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-12705</guid>
		<description>I am confuse about the picture on this post.
We are trying to construct out namenode with HA , 
so we wanna using DRBD + HeartBeat to provide namenode a failover system.
But in the picture , it looks like the backup namemode have it&#039;s own datanode cluster , why ?
Do we need two cluster for the HA configuration ??</description>
		<content:encoded><![CDATA[<p>I am confuse about the picture on this post.<br />
We are trying to construct out namenode with HA ,<br />
so we wanna using DRBD + HeartBeat to provide namenode a failover system.<br />
But in the picture , it looks like the backup namemode have it&#8217;s own datanode cluster , why ?<br />
Do we need two cluster for the HA configuration ??</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sean</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-11914</link>
		<dc:creator>Sean</dc:creator>
		<pubDate>Tue, 18 May 2010 14:14:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-11914</guid>
		<description>Thx for posting!  Very helpful.  I was able to get most everything working.  Just a somewhat minor issue when failing over: DRBD can&#039;t release its resource resulting in an what Heartbeat calls an ungraceful shutdown forcing a reboot.  Master2 comes up fine though.</description>
		<content:encoded><![CDATA[<p>Thx for posting!  Very helpful.  I was able to get most everything working.  Just a somewhat minor issue when failing over: DRBD can&#8217;t release its resource resulting in an what Heartbeat calls an ungraceful shutdown forcing a reboot.  Master2 comes up fine though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff</title>
		<link>http://www.cloudera.com/blog/2009/07/hadoop-ha-configuration/comment-page-1/#comment-11612</link>
		<dc:creator>Jeff</dc:creator>
		<pubDate>Tue, 30 Mar 2010 17:46:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=987#comment-11612</guid>
		<description>The one thing missing from this post is the actual hadoop config info for the namenode and backup namenod. The post simply says you used the cloudera configuration tool to generate a custom config....As we all know this tool has been unavailable for quite some time. So a little more insight as to the actual hdfs/mapred configs would be like the icing on the cake that this post already is.</description>
		<content:encoded><![CDATA[<p>The one thing missing from this post is the actual hadoop config info for the namenode and backup namenod. The post simply says you used the cloudera configuration tool to generate a custom config&#8230;.As we all know this tool has been unavailable for quite some time. So a little more insight as to the actual hdfs/mapred configs would be like the icing on the cake that this post already is.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

