<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Small Files Problem</title>
	<atom:link href="http://www.cloudera.com/blog/2009/02/the-small-files-problem/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/</link>
	<description>Hadoop and Cloudera&#039;s Products and Services</description>
	<lastBuildDate>Wed, 23 May 2012 10:20:43 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: TreeMan</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-17418</link>
		<dc:creator>TreeMan</dc:creator>
		<pubDate>Mon, 19 Mar 2012 11:30:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-17418</guid>
		<description>begin to understand this,a good document !</description>
		<content:encoded><![CDATA[<p>begin to understand this,a good document !</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scribe日志收集系统介绍 &#124; 稀饭的国度</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-17238</link>
		<dc:creator>Scribe日志收集系统介绍 &#124; 稀饭的国度</dc:creator>
		<pubDate>Mon, 13 Feb 2012 14:17:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-17238</guid>
		<description>[...] （3）       HDFS小文件问题：http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</description>
		<content:encoded><![CDATA[<p>[...] （3）       HDFS小文件问题：http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Summer &#38; cicada~&#187; Blog Archive &#187; Hadoop Archive</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-16648</link>
		<dc:creator>Summer &#38; cicada~&#187; Blog Archive &#187; Hadoop Archive</dc:creator>
		<pubDate>Mon, 14 Nov 2011 13:35:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-16648</guid>
		<description>[...] The Small Files Problem [...]</description>
		<content:encoded><![CDATA[<p>[...] The Small Files Problem [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ???HDFS?????????? &#187; Allen&#39;s World</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-16614</link>
		<dc:creator>???HDFS?????????? &#187; Allen&#39;s World</dc:creator>
		<pubDate>Thu, 03 Nov 2011 10:47:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-16614</guid>
		<description>[...] http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</description>
		<content:encoded><![CDATA[<p>[...] <a href="http://www.cloudera.com/blog/2009/02/the-small-files-problem/" rel="nofollow">http://www.cloudera.com/blog/2009/02/the-small-files-problem/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: HDFS?????????? &#124; ??IT??</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-16112</link>
		<dc:creator>HDFS?????????? &#124; ??IT??</dc:creator>
		<pubDate>Thu, 30 Jun 2011 05:11:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-16112</guid>
		<description>[...] http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</description>
		<content:encoded><![CDATA[<p>[...] <a href="http://www.cloudera.com/blog/2009/02/the-small-files-problem/" rel="nofollow">http://www.cloudera.com/blog/2009/02/the-small-files-problem/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: HDFS?????????? &#124; ????</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-15908</link>
		<dc:creator>HDFS?????????? &#124; ????</dc:creator>
		<pubDate>Mon, 18 Apr 2011 08:08:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-15908</guid>
		<description>[...] http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</description>
		<content:encoded><![CDATA[<p>[...] <a href="http://www.cloudera.com/blog/2009/02/the-small-files-problem/" rel="nofollow">http://www.cloudera.com/blog/2009/02/the-small-files-problem/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scribe???????? &#124; ????</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-15836</link>
		<dc:creator>Scribe???????? &#124; ????</dc:creator>
		<pubDate>Mon, 04 Apr 2011 15:38:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-15836</guid>
		<description>[...] ?3?&#160;&#160;&#160;&#160;&#160;&#160; HDFS??????http://www.cloudera.com/blog/2009/02/the-small-files-problem/ [...]</description>
		<content:encoded><![CDATA[<p>[...] ?3?&#160;&#160;&#160;&#160;&#160;&#160; HDFS??????<a href="http://www.cloudera.com/blog/2009/02/the-small-files-problem/" rel="nofollow">http://www.cloudera.com/blog/2009/02/the-small-files-problem/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeans zhong</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-15692</link>
		<dc:creator>Jeans zhong</dc:creator>
		<pubDate>Tue, 08 Mar 2011 17:17:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-15692</guid>
		<description>Hii ,Tom, can you explain the following sentence  more detailedly ?or give me some reference information?
this sentence:
Reading through small files normally causes lots of seeks and lots of hopping from datanode to datanode to retrieve each small file, all of which is an inefficient data access pattern.
why cause seeks and hopping?  I  think it only need to read block of small file at the best datanode.why did you say like that.what &#039;s your reason? 
please help me .</description>
		<content:encoded><![CDATA[<p>Hii ,Tom, can you explain the following sentence  more detailedly ?or give me some reference information?<br />
this sentence:<br />
Reading through small files normally causes lots of seeks and lots of hopping from datanode to datanode to retrieve each small file, all of which is an inefficient data access pattern.<br />
why cause seeks and hopping?  I  think it only need to read block of small file at the best datanode.why did you say like that.what &#8216;s your reason?<br />
please help me .</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Q Ethan</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-15436</link>
		<dc:creator>Q Ethan</dc:creator>
		<pubDate>Thu, 10 Feb 2011 21:21:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-15436</guid>
		<description>If SequenceFiles are your way to address the small files problem, you may be interested in forqlift:

http://www.exmachinatech.net/go/forqlift/

forqlift is a commandline tool that makes it easy to import/export small files to/from SequenceFiles. (It&#039;s also free and open-source.)</description>
		<content:encoded><![CDATA[<p>If SequenceFiles are your way to address the small files problem, you may be interested in forqlift:</p>
<p><a href="http://www.exmachinatech.net/go/forqlift/" rel="nofollow">http://www.exmachinatech.net/go/forqlift/</a></p>
<p>forqlift is a commandline tool that makes it easy to import/export small files to/from SequenceFiles. (It&#8217;s also free and open-source.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hadoop binary files processing entroduced by image duplicates finder &#171; eldad levy&#39;s playground</title>
		<link>http://www.cloudera.com/blog/2009/02/the-small-files-problem/comment-page-1/#comment-15404</link>
		<dc:creator>Hadoop binary files processing entroduced by image duplicates finder &#171; eldad levy&#39;s playground</dc:creator>
		<pubDate>Sat, 05 Feb 2011 09:26:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=239#comment-15404</guid>
		<description>[...] about the small files problem, and some conclusions of a project that is dealing with huge amount of [...]</description>
		<content:encoded><![CDATA[<p>[...] about the small files problem, and some conclusions of a project that is dealing with huge amount of [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

