##Big Data and Cloud 1. nova OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more. Project Source: https://github.com/openstack/nova Project Homepage: http://www.openstack.org/ 1. boto Boto is a Python package that provides interfaces to Amazon Web Services. Project Source: https://github.com/boto/boto Project Documentation: http://docs.pythonboto.org/en/latest/ 1. aws-cli Universal Command Line Interface for Amazon Web Services Project Source: https://github.com/aws/aws-cli 1. mrjob Run MapReduce jobs on Hadoop or Amazon Web Services. Project Source: https://github.com/Yelp/mrjob Project Documentation: https://pythonhosted.org/mrjob/ 1. s3cmd Command line tool for managing Amazon S3 and CloudFront services. Project Source: https://github.com/s3tools/s3cmd Project Homepage: http://s3tools.org/s3cmd 1. dumbo Python module that allows one to easily write and run Hadoop programs. Project Source: https://github.com/klbostee/dumbo Project Homepage: http://klbostee.github.io/dumbo/ 1. dpark DPark is a Python clone of Spark, MapReduce alike computing framework supporting iterative computation. Project Source: https://github.com/douban/dpark Project Documentation (Chinese): https://github.com/jackfengji/test_pro/wiki 1. Hue Hue is an open source Web UI for doing big data with Hadoop. Project Source: https://github.com/cloudera/hue Project Homepage: http://gethue.com/ 1. falcon Falcon is a high-performance Python framework for building cloud APIs and web app backends. Project Source: https://github.com/racker/falcon Project Homepage: http://falconframework.org/ 1. gae-boilerplate Google App Engine Boilerplate gets your project off the ground quickly using the Google App Engine platform. Project Source: https://github.com/coto/gae-boilerplate Project Homepage: http://appengine.beecoss.com/ 1. goagent a gae proxy Project Source: https://github.com/goagent/goagent Project Homepage: https://code.google.com/p/goagent/ 1. libcloud Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API. Project Source: https://github.com/apache/libcloud Project Documentation: http://libcloud.apache.org/ 1. Fabulous Deploy django apps to Amazon EC2 with ONE command. Project Source: https://github.com/gcollazo/Fabulous Project Homepage: http://elweb.co/ 1. pyes pyes is a pythonic way to use ElasticSearch since 2010. Project Source: https://github.com/aparo/pyes Project Documentation: https://pypi.python.org/pypi/pyes/