TY - GEN
T1 - Predicting Eviction Status Using Airbnb Data in the Absence of Ground-Truth Eviction Records
AU - Tabar, Maryam
AU - Abdulla, Anusha
AU - Petersen, John Andrew
AU - Lee, Dongwon
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/3/10
Y1 - 2025/3/10
N2 - The eviction of tenants is a pressing problem, which is prevalent among low-income renters in the USA, and has devastating consequences. Despite the presence of various measures to combat evictions, identifying high-need regions and tenant groups is highly challenging in many regions due to a lack of access to eviction records (partly because of some infrastructural/policy constraints). In response to this information gap, this paper proposes a solution driven by Machine Learning (ML) to monitor eviction status at various spatial resolutions using Airbnb data when ground-truth eviction data is inaccessible. In particular, we begin by demonstrating the potential of utilizing Airbnb data to build ML-driven methods for distinguishing different neighborhoods across different spatial resolutions with respect to eviction status. We then proceed to develop an ML model capable of learning eviction status levels from Airbnb data, even in the absence of ground-truth labels. Empirical evidence is presented, showcasing the model's performance on par with several robust fully-supervised ML models that had access to ground-truth labels during training. Finally, we conduct a set of cross-region tests to comprehensively study the generalizability of the achieved performance across various unseen regions in the USA that were not used during model training. The code of this project can be accessed via https://github.com/maryam-tabar/Airbnb-Eviction.
AB - The eviction of tenants is a pressing problem, which is prevalent among low-income renters in the USA, and has devastating consequences. Despite the presence of various measures to combat evictions, identifying high-need regions and tenant groups is highly challenging in many regions due to a lack of access to eviction records (partly because of some infrastructural/policy constraints). In response to this information gap, this paper proposes a solution driven by Machine Learning (ML) to monitor eviction status at various spatial resolutions using Airbnb data when ground-truth eviction data is inaccessible. In particular, we begin by demonstrating the potential of utilizing Airbnb data to build ML-driven methods for distinguishing different neighborhoods across different spatial resolutions with respect to eviction status. We then proceed to develop an ML model capable of learning eviction status levels from Airbnb data, even in the absence of ground-truth labels. Empirical evidence is presented, showcasing the model's performance on par with several robust fully-supervised ML models that had access to ground-truth labels during training. Finally, we conduct a set of cross-region tests to comprehensively study the generalizability of the achieved performance across various unseen regions in the USA that were not used during model training. The code of this project can be accessed via https://github.com/maryam-tabar/Airbnb-Eviction.
UR - http://www.scopus.com/inward/record.url?scp=105001670653&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105001670653&partnerID=8YFLogxK
U2 - 10.1145/3701551.3703549
DO - 10.1145/3701551.3703549
M3 - Conference contribution
AN - SCOPUS:105001670653
T3 - WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
SP - 885
EP - 894
BT - WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
T2 - 18th ACM International Conference on Web Search and Data Mining, WSDM 2025
Y2 - 10 March 2025 through 14 March 2025
ER -