Your browser is out of date

Update your browser to view this website correctly. Update my browser now


Hadoop: The Definitive Guide (4th Ed.)

by Tom White

The classic comprehensive guide to designing, building, and maintaining Hadoop applications.

Hadoop Application Architectures

by Mark Grover, Ted Malaska, Jonathan Seidman & Gwen Shapira

The "bible" for architecting end-to-end distributed applications with Hadoop.

Getting Started with Impala

by John Russell

Learn how to write, tune, and port SQL queries using Impala, and more.

Advanced Analytics with Spark

by Uri Laserson, Sean Owen, Sandy Ryza & Josh Wills

Recipes for using Apache Spark™ to solve a variety of advanced analytics problems.


Hadoop Security

by Ben Spivey & Joey Echeverria

Learn how to protect Hadoop clusters and data from unauthorized access in a centralized way.


Hadoop Operations

by Eric Sammer

Everything you need to know about deploying and running Hadoop systems in production.


Apache Sqoop Cookbook

by Kathleen Ting & Jarek Cecho

Learn how to deploy and apply Sqoop for batch data ingest into your Hadoop cluster.

HBase: The Definitive Guide

by Lars George

The definitive resource for developing and running distributed apps on HBase.


Using Flume

by Hari Shreedharan

Learn how to use Flume for streaming ingest and configure and maintain a Flume cluster.


Python for Data Analysis

by Wes McKinney

All the nuts & bolts of manipulating, processing, cleaning, and crunching data in Python.

Data Analytics with Hadoop

by Benjamin Bengfort & Jenny Kim

A practical guide to the world of clustered computing and analytics with Hadoop.

Architecting HBase Applications

by Jean-Marc Spaggiari & Kevin O'Dell

The ideal book for a deep dive into use cases, features, and troubleshooting.

Coming Soon:

Deploying Hadoop in the Cloud

by Bill Havanki

The definitive reference for running Hadoop in the public cloud.

Spark Cookbook

by Neelesh Salian

Recipes for successfully running Spark applications in production.

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.