Applications of Machine Learning Methods to Predict Readmission and Length-of-Stay for Homeless Families: The Case of Win Shelters in New York City

Boyeong Hong, Awais Malik, Jack Lundquist, Ira Bellach, Constantine E. Kontokosta

Research output: Contribution to journalArticle


New York City faces the challenge of an ever-increasing homeless population with almost 60,000 people currently living in city shelters. In 2015, approximately 25% of families stayed longer than nine months in a shelter, and 17% of families with children that exited a homeless shelter returned to the shelter system within 30 days of leaving. This suggests that “long-term” shelter residents and those that re-enter shelters contribute significantly to the rise of the homeless population living in city shelters and indicate systemic challenges to finding adequate permanent housing. This article focuses on our preliminary work with Win (Women-in-Need) shelters to understand the factors that predict readmission and length-of-stay of homeless families. We create a unified, comprehensive database of the homeless population being served by Win shelters, accounting for more than 6,000 homeless families. We apply logistic regression models and an unsupervised clustering algorithm to identify predictors of re-entry and long-term length-of-stay. Citizenship, age, medical conditions, employment, and history of foster care or shelter stays as a child are found to be significant predictors. The results of the K-means clustering identify three primary groups, consistent with previous typologies characterized by transitionally homeless, episodically homeless, and chronically homeless.

Original languageEnglish (US)
Pages (from-to)89-104
Number of pages16
JournalJournal of Technology in Human Services
Issue number1
StatePublished - Jan 2 2018



  • Clustering
  • homelessness
  • machine learning
  • predictive modeling
  • shelter services

ASJC Scopus subject areas

  • Health(social science)
  • Social Sciences (miscellaneous)
  • Social Sciences(all)
  • Computer Networks and Communications

Cite this