[an error occurred while processing this directive]
Project F: SQL-on-Hadoop BakeOff 2

performed by Group 6

Running relational queries on Hadoop-based systems is an attempt of bringing the known relational model into the massively parallel world. However, the performance of these systems varies widely. Your task is to perform a comparision between two systems in this space, Actian Vortex and Cloudera Impala. The target is a 10TB implementation of the standard relational TPC-H benchmark on the SCILENS cluster of CWI.

Idea
Technology

This analysis will be using Actian Vortex and Cloudera Impala.

Data

Must be generated using the TPC-H dbgen utility.

Literature
[an error occurred while processing this directive]