Airbnb in BCN

gentrification in numbers

Barcelona is one of the most popular tourist destinations in Europe. The large scale of tourism in the city has entered the public debate years ago, as a lot of the city's apartments have been turned into vacation rentals and Airbnb is the biggest player in that field. This project is from the data mining to every part of the visualization my personal work and an attempt to shine a data driven light on the magnitude of Airbnb's presence in Barcelona.

Listing Locations

LOCATION

How many Airbnb listings are in Barcelona? To illustrate the scale of Airbnb rentals in Barcelona, the GPS coordinates of the listings are displayed over the city's map, with well known land marks added for orientation. The scale is further drawn out with some of the raw numbers. The different room types available on Airbnb are color-coded.


DISTRIBUTION

With the listing's coordinates, details about the accomodation itself and the host has been obtained. The distribution of the whole inventory between hosts and the distribution of the listings in the neighborhoods are analysed. The gentrification impact in each neighborhood could be calculated by combining the listing data with local census data.

Listing Distribution

Review Analysis

REVIEWS

Every reservation can result in a guest's review displayed on the listing page. All written reviews in all listings could be obtained, together with the corresponding score for the reservation experience. Additional data on the individual reviews, like creation date and language was included. AI techniques have been used for a semantic analysis of the written content, resulting in an exploratory analysis of the mental concepts, that guide a traveler's definition of a good stay.


DEVELOPMENT THROUGH TIME

After almost half a year, the whole process of collecting the inventory has been repeated, to see the development of the listing count through the travel seasons. Additionally, the planet has been facing the COVID-19 pandemic, which put tourism almost to a complete halt in BCN. The numbers can be used to derive the impact of the virus on Airbnb's presence in Barcelona. This work indicates a pre-COVID maximum count of ~18000 listings, which seems to confirm the estimated ~4000 disappeared listings as of now.

Count across time

Tessellation Plots

TESSELLATION PLOT

A different approach to displaying geographical data is explored in these Tessellation-Plots. The spatial information is delivered via triangles, where each of its vertices mark the LOCATION of an Airbnb listing. The resulting net of triangles, computed by an algorithm called "Delaunay tessellation", allows an intuitive assessment of the listing distribution in the city.

On top of that, the area of the triangles spanning the plane between 3 listings, can be used to calculate a DENSITY measure. Adding up the area of each triangle connected to a vertex and taking its inverse, results in the Delaunay Tessellation Field Estimator (DTFE), a density measure used in astronomy to calculate the density of matter in star clusters. The triangles, so far positioned in the x-y-plane, can now be used to display information in the 3rd dimension, by raising each vertex (listing) in z-direction, corresponding to a variable value, such as the calculated density of the DTFE.

The versatility of this approach is further explored by the visualization of the pricing data obtained for each listing. Again, the position of a vertex along the z-axis shows the average PRICE value per night of that specific listing. Areas on the map can then be compared qualitatively in regards to the price variable. To aid the ability to compare the magnitude of the measured variable, the color of the triangle's lines and area, code as well for the same measure as a spectrum from blue, over green, to red.

All in all, the Tessellation approach reminds of a hexbin plot. However, the advantage of the Tessellation lies clearly in the triangles not being of equal size. Thus, the visualization carries always implicitely information about the density on top of the pure location data. With this as a baseline, color and displacement along the z-axis can be used to visualize additional variables, making it suitable for display of complex multidimensional data in one plot.