MapReduce-2 Application for processing raw data file and counting words
Technologies: Cloudera Hadoop CDH5,Maven, IntelliJ IDEA
###NOTE:MRv1 & MRv2 compatibility###
In general, the previous MapReduce runtime (aka MRv1) has been reused and no major surgery has been conducted on it.
Therefore, MRv2 is able to ensure satisfactory compatibility with MRv1 applications.
However, due to some improvements and code refactorings, a few APIs have been rendered backward-incompatible.
NEW API in package: org.apache.hadoop.mapreduce;
OLD API in package: org.apache.hadoop.mapred;
- Java-1.7
- Maven-2/3
- git
- Hadoop-2.3.0
$ git clone https://github.com/mehikmat/BigSchool.git
$ cd BigSchool
$ git checkout mr2-app
$ mvn clean package
$ yarn jar target/BigSchoolMapReduce-1.0.jar input/input.txt output/output.txt
OR
$ sh runner.sh
In case you get an error of type "Not valid JAR", check the jar path. It might be different from one plateform to another with mvn.
Browse http://localhost:8088 for job status