Data Science Blog Post for Udacity Projekt #1
The first assignment of the Udacity “Data Scientist" nanodegree program is to analyze a dataset of choice. Here, Airbnb listings in Berlin (scraped on 18th of Dec. 2023) build the foundation of this analysis.
All code can be found here:
A comprehensive summary can be found on Medium: (https://medium.com/@nebel.johannes/sharing-is-caring-does-this-also-hold-for-airbnb-accommodations-in-berlin-7a847f4634c5)
For the assignment, the following questions were answered:
- Q1: What are the most prevalent room types in Berlin?
- Q2: Is there a difference in cleanliness between shared rooms and private rooms in Berlin?
- Q3: Are shared rooms cheaper per person than private rooms in Berlin?
- Q4: Do shared rooms and private rooms differ in value for money in Berlin?
The assignment was implemented in Python with Jupyter Notebook. The following packages have been used:
- pandas
- matplotlib.pyplot
- seaborn
From Airbnb (http://insideairbnb.com/get-the-data), the listings of Berlin (Germany) were taken (see listings.csv.gz in the repository).
Special thanks to Airbnb for publicly providing data.