<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Hadoop World: Sqoop &#8211; Database Import for Hadoop</title>
	<atom:link href="http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/</link>
	<description>Hadoop and Cloudera&#039;s Products and Services</description>
	<lastBuildDate>Fri, 10 Feb 2012 20:11:24 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Arvind Prabhakar</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-16674</link>
		<dc:creator>Arvind Prabhakar</dc:creator>
		<pubDate>Fri, 18 Nov 2011 16:54:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-16674</guid>
		<description>Hi Sathi,

Sqoop does allow you to do incremental imports based on time-stamp columns or numeric columns. During such imports, Sqoop populates the data files in the same target directory using names that start from where the previous job may have left off. That way the target directory is always current with the latest changes.

Regarding the storage of a transactional table that is changing everyday, one way to use it for ETL use-cases would be to load it up in HBase. Storing it in flat files may not be a good idea since that could lead to duplication if previously imported data is updated.

Thanks,
Arvind Prabhakar</description>
		<content:encoded><![CDATA[<p>Hi Sathi,</p>
<p>Sqoop does allow you to do incremental imports based on time-stamp columns or numeric columns. During such imports, Sqoop populates the data files in the same target directory using names that start from where the previous job may have left off. That way the target directory is always current with the latest changes.</p>
<p>Regarding the storage of a transactional table that is changing everyday, one way to use it for ETL use-cases would be to load it up in HBase. Storing it in flat files may not be a good idea since that could lead to duplication if previously imported data is updated.</p>
<p>Thanks,<br />
Arvind Prabhakar</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sathi</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-16655</link>
		<dc:creator>sathi</dc:creator>
		<pubDate>Wed, 16 Nov 2011 07:28:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-16655</guid>
		<description>Does the current version of  sqoop have a way to capture only delta?if I have initial load of table A under /original ,and then I import incremental under /delta ,is there a better way to store a combined data under /latest which is basically merge of /original and /delta
In a standard ETL pipeline,If I am storing a transaction table which is changing everyday,what is the best way to store this evolving data within hdfs?</description>
		<content:encoded><![CDATA[<p>Does the current version of  sqoop have a way to capture only delta?if I have initial load of table A under /original ,and then I import incremental under /delta ,is there a better way to store a combined data under /latest which is basically merge of /original and /delta<br />
In a standard ETL pipeline,If I am storing a transaction table which is changing everyday,what is the best way to store this evolving data within hdfs?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Hammerbacher</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-13121</link>
		<dc:creator>Jeff Hammerbacher</dc:creator>
		<pubDate>Wed, 11 Aug 2010 17:15:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-13121</guid>
		<description>Hey Arushi,

Unsqoop is now available in CDH3b2 (see http://www.cloudera.com/blog/2010/07/what&#146;s-new-in-cdh3b2-sqoop for more details), so go and get it! Additionally, we have added a Sqoop task to Oozie (http://www.cloudera.com/blog/2010/07/whats-new-in-cdh3-b2-oozie), so you can tie the import, processing, and export phases together in a single Oozie workflow.

For further questions on Sqoop, check out the Sqoop mailing list at https://groups.google.com/a/cloudera.org/group/sqoop-user.

Regards,
Jeff</description>
		<content:encoded><![CDATA[<p>Hey Arushi,</p>
<p>Unsqoop is now available in CDH3b2 (see <a href="http://www.cloudera.com/blog/2010/07/what&#8217;s-new-in-cdh3b2-sqoop" rel="nofollow">http://www.cloudera.com/blog/2010/07/what&#8217;s-new-in-cdh3b2-sqoop</a> for more details), so go and get it! Additionally, we have added a Sqoop task to Oozie (<a href="http://www.cloudera.com/blog/2010/07/whats-new-in-cdh3-b2-oozie" rel="nofollow">http://www.cloudera.com/blog/2010/07/whats-new-in-cdh3-b2-oozie</a>), so you can tie the import, processing, and export phases together in a single Oozie workflow.</p>
<p>For further questions on Sqoop, check out the Sqoop mailing list at <a href="https://groups.google.com/a/cloudera.org/group/sqoop-user" rel="nofollow">https://groups.google.com/a/cloudera.org/group/sqoop-user</a>.</p>
<p>Regards,<br />
Jeff</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arushi</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-13118</link>
		<dc:creator>Arushi</dc:creator>
		<pubDate>Wed, 11 Aug 2010 09:38:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-13118</guid>
		<description>Any updates on unsqoop?

I wanted to get data using hadoop, modify it using mapreduce and put it back into the mysql.

Is it possible with latest sqoop or shall we wait for unsqoop to send data back to MySql?</description>
		<content:encoded><![CDATA[<p>Any updates on unsqoop?</p>
<p>I wanted to get data using hadoop, modify it using mapreduce and put it back into the mysql.</p>
<p>Is it possible with latest sqoop or shall we wait for unsqoop to send data back to MySql?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Oded</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-8191</link>
		<dc:creator>Oded</dc:creator>
		<pubDate>Mon, 14 Dec 2009 10:17:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-8191</guid>
		<description>any updates re unsqoop?</description>
		<content:encoded><![CDATA[<p>any updates re unsqoop?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daily Digest for December 11th</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-8100</link>
		<dc:creator>Daily Digest for December 11th</dc:creator>
		<pubDate>Fri, 11 Dec 2009 11:43:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-8100</guid>
		<description>[...] Shared Hadoop World: Sqoop &#8211; Database Import for Hadoop. [...]</description>
		<content:encoded><![CDATA[<p>[...] Shared Hadoop World: Sqoop &#8211; Database Import for Hadoop. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chetan Conikee</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-8084</link>
		<dc:creator>Chetan Conikee</dc:creator>
		<pubDate>Fri, 11 Dec 2009 00:08:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-8084</guid>
		<description>Typically Netezza and TeraData are bundled  with their own proprietary OBDC bindings [and] ship with a JAR file which essentially is a ODBC-JDBC bridge..
So if you have the specified bridge jar you should be able to Sqoop from/to Netezza or TeraData</description>
		<content:encoded><![CDATA[<p>Typically Netezza and TeraData are bundled  with their own proprietary OBDC bindings [and] ship with a JAR file which essentially is a ODBC-JDBC bridge..<br />
So if you have the specified bridge jar you should be able to Sqoop from/to Netezza or TeraData</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: aaron</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-8075</link>
		<dc:creator>aaron</dc:creator>
		<pubDate>Thu, 10 Dec 2009 19:53:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-8075</guid>
		<description>I&#039;m not certain. Sqoop uses JDBC and generates standard SQL, so it can connect to many database vendors out-of-the-box. Some vendors require nonstandard SQL, or implement JDBC drivers slightly different than the norm, for which we have to write special-case code.

We haven&#039;t explicitly tested it with Teradata. I&#039;d love to hear feedback as to how that experiment works out.</description>
		<content:encoded><![CDATA[<p>I&#8217;m not certain. Sqoop uses JDBC and generates standard SQL, so it can connect to many database vendors out-of-the-box. Some vendors require nonstandard SQL, or implement JDBC drivers slightly different than the norm, for which we have to write special-case code.</p>
<p>We haven&#8217;t explicitly tested it with Teradata. I&#8217;d love to hear feedback as to how that experiment works out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jing</title>
		<link>http://www.cloudera.com/blog/2009/12/hadoop-world-sqoop-database-import-for-hadoop/comment-page-1/#comment-8073</link>
		<dc:creator>Jing</dc:creator>
		<pubDate>Thu, 10 Dec 2009 19:39:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.cloudera.com/blog/?p=1694#comment-8073</guid>
		<description>Does Sqoop work with Teradata?</description>
		<content:encoded><![CDATA[<p>Does Sqoop work with Teradata?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

