Pre-Trained Nonresponse Prediction in Panel Surveys with Machine Learning

Authors

DOI:

https://doi.org/10.18148/srm/2025.v19i2.8473

Keywords:

GESIS Panel, German Internet Panel, FReDA, SOEP, Machine Learning, Nonresponse

Abstract

While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best adopt these techniques. Currently, practitioners need to wait until they accumulate enough data in their panel to train and evaluate their own modeling options. This paper presents a novel “cross-training” technique in which we show that the indicators of nonresponse are so ubiquitous across studies that it is viable to train a model on one panel study and apply it to a different one. The practical benefit of this approach is that newly commencing panels can potentially make better nonresponse predictions in the early waves because these pre-trained models make use of more data. We demonstrate this technique with five panel surveys which encompass a variety of survey designs: the Socio-Economic Panel (SOEP), the German Internet Usage Panel (GIP), the GESIS Panel, the Mannheim Corona Study (MCS), and the Family Demographic Panel (FREDA). We demonstrate that nonresponse history and demographics, paired with tree-based modeling methods, make highly accurate and generalizable predictions across studies, despite differences in panel design. We show how cross-training can effectively predict nonresponse in early panel waves where attrition is typically highest. 

Downloads

Published

2025-08-08 — Updated on 2025-08-19

Versions

How to Cite

Collins, J., & Kern, C. (2025). Pre-Trained Nonresponse Prediction in Panel Surveys with Machine Learning. Survey Research Methods, 19(2), 123–137. https://doi.org/10.18148/srm/2025.v19i2.8473 (Original work published August 8, 2025)

Issue

Section

Articles

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.