GitHub - Jarvin-M/Water-usage-prediction-data-pipeline

Authors

Chriss Santi - Docker and Environment setup
Jarvin Mutatiina - PySpark

Description

This project is water usage prediction pipeline using Apache Spark and Docker

Techs

Computing

Apache Spark

Files System

Hadoop HDFS

Database

Cassandra

Browser

Hue

Web

Flask
Html

Deployment

Local: Docker Compose (StandAlone for Hadoop and Spark)
Cloud: Docker Swarm or Another technology: yet to be define

Build the project

Local:

Still in development !!!!!

We provide a makefile located in the /docker/compose

First run the docker-compose file located in /docker/compose
your app files must be located in the directory app
build your submit image by running ./buildSubmit.sh located in /docker/scripts
run the submit image by runnnig the script ./submitJob.sh located in /docker/scripts

Cloud

TODO:

Well manage the environments variables:
(docker/scripts)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
docker		docker
resources		resources
slides/images		slides/images
web		web
.gitignore		.gitignore
README.md		README.md
issue.md		issue.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Authors

Description

Techs

Computing

Files System

Database

Browser

Web

Deployment

Build the project

Local:

Still in development !!!!!

We provide a makefile located in the /docker/compose

Cloud

TODO:

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Authors

Description

Techs

Computing

Files System

Database

Browser

Web

Deployment

Build the project

Local:

Still in development !!!!!

We provide a makefile located in the /docker/compose

Cloud

TODO:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages