El libro está inglés y tiene los siguientes contenidos:
• An overview of Hadoop’s core technologies including Hadoop Distributed File Systems (HDFS), MapReduce, YARN, and Spark
• Provides sample code and links to relevant tutorials
• Reviews the advantages of different databases, in terms of speed, scalability, security, configurability, text indexing, SQL support, and more
• Covers which databases are better for different purposes, such as transactional, relational analytics, sparse data, multi-tenant support, and more
• Gives overviews, sample code and links to tutorials for common databases used with Hadoop such as MongoDB, Cassandra, HBase, Hive, Shark, Blur, Accumulo, Memcached, Solr, and Giraph