Practicing Cloudera Developer Exam CCA175 Check out wiki https://github.com/vukelan/cca175/wiki
- load data from HDFS
- store results back to HDFS
- join disparate datasets together
- calculate aggregate statistics
- calculate average or sum
- filter data into a smaller dataset
- write a query to produce ranked/sorted data ...