- Oracle Java 7+ (USB)
- Spark 1.5.1 (USB)
- Sparkling Water 1.5.6 (USB)
- Chicago Crime dataset (USB)
- Chicago Census dataset (USB)
- Chicago Weather dataset (USB)
- H2O python - to be installed (USB)
- H2O package (USB)
- Python 2.7 (pre-installed)
- Numpy 1.9.2 (pre-installed)
- $ pip install requests
- $ pip install tabulate
- Go to the Sparkling Water directory
- Build Sparkling Water ( creates python EGG ):
./gradlew build -x test - Run this line -
IPYTHON_OPTS="notebook" bin/pysparklingand locate the desired notebook file