Category Archives: hive

This is a post about configuring Hadoop, Hive and HBase on MAC as a single node installation

Overview

  • What is hive?: Hive is a data warehousing infrastructure based on Hadoop
  • What is Hbase?: Its a distributed, versioned, column-oriented NoSQL data store, modeled after Googles Bigtable. used to host very large tables — billions of rows *times* millions of columns.
  • What is hadoop?: Hadoop provides massive scale out and fault tolerance capabilities for data storage and processing on commodity hardware using map-reduce programming paradigm.

Continue reading