[an error occurred while processing this directive]
performed by Group 6
Running relational queries on Hadoop-based systems is an attempt of bringing the known relational model into the massively parallel world. However, the performance of these systems varies widely. Your task is to perform a comparision between two systems in this space, Actian Vortex and Cloudera Impala. The target is a 10TB implementation of the standard relational TPC-H benchmark on the SCILENS cluster of CWI.
This analysis will be using Actian Vortex and Cloudera Impala.
Must be generated using the TPC-H dbgen utility.