Skip to content

Jarvin-M/Water-usage-prediction-data-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Authors

  • Chriss Santi - Docker and Environment setup
  • Jarvin Mutatiina - PySpark

Description

This project is water usage prediction pipeline using Apache Spark and Docker

Techs

Computing

  • Apache Spark

Files System

  • Hadoop HDFS

Database

  • Cassandra

Browser

  • Hue

Web

  • Flask
  • Html

Deployment

  • Local: Docker Compose (StandAlone for Hadoop and Spark)
  • Cloud: Docker Swarm or Another technology: yet to be define

Build the project

Local:

Still in development !!!!!

We provide a makefile located in the /docker/compose

  • First run the docker-compose file located in /docker/compose
  • your app files must be located in the directory app
  • build your submit image by running ./buildSubmit.sh located in /docker/scripts
  • run the submit image by runnnig the script ./submitJob.sh located in /docker/scripts

Cloud

TODO:

Well manage the environments variables:
(docker/scripts)

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors