Poster Paper:
Machine Learning Algorithms for Longitudinal Clustering
*Names in bold indicate Presenter
Data for this analysis came from an RCT that used an automated and personalized text message intervention to remind college-going students of required college enrollments tasks and connected them with counselor-based support via text message communication. These types of low-cost behavioral nudges are increasing popular in policy research. From this application, we examine data from students’ engagement with the intervention across several time points. The study included 20-time points with more than 20,000 thousand students participating. Student engagement in the outreach varied. For example, we observe variation in the intensity of engagement regarding the number of messages sent to counselors during each time point as well as in student engagement as measured by message character count over time.
We use longitudinal clustering with various constructed student engagement trajectories as model inputs. With these clustering algorithms, we classify students into different engagement groups that reveal the typical patterns of interaction in the intervention. Preliminary analysis shows that the variation in cluster assignment is driven by variation in behavior during the students' senior year of HS (in contrast to other time periods of the intervention, such as during the spring of students’ HS junior year). Each cluster corresponds to different levels of engagement. These clustering methods help to illuminate patterns of student behavior within an intervention as well as to inform the time periods over which student behavior was most differentiated. This work is a precursor to estimating treatment effects for various outcomes using the constructed trajectory clusters.