Panel Paper:
Using Predictive Analytics for Early Identification of Short-Term Disability Claimants Who Exhaust Their Benefits
*Names in bold indicate Presenter
Data. We use Integrated Benefits Institute (IBI) Health and Productivity Benchmarking Data from 2011 through 2015, including 820,751 closed STDI claims from 8,587 small, medium, and large businesses associated with 9 disability insurance carriers and third-party leave administrators. The data include claim outcomes and claimant, employer, and insurance plan design characteristics. The primary outcome of interest is exhaustion of the STDI benefit.
Methods. We fit several predictive models to the data, including logistic regression, regularized logistic regression (using an elastic net), and random forests. We randomly divide our sample into training and testing sets, and select a model based on the area under the receiver operating characteristic curve. Individuals are flagged as having a high probability of exhausting their benefits using a predicted probability threshold, which was chosen to balance the tradeoff between sensitivity and specificity. We report the predictive performance of our models when applied to the test set. We perform the analysis for claims with benefit duration of 26 weeks, first using the full sample, then sequentially eliminating claims that resolved within 2, 4, and 6 weeks. Comparing across durations illustrates the potential efficiency gains of waiting to allow some claims to resolve on their own.
Results. In the logistic regression, the factors most strongly associated with exhaustion of STDI benefits are age, diagnosis, and employer industry. Waiting to allow some claims to resolve without intervention improves the efficiency of targeting efforts. Modeling based on observable factors helps further narrow the target population, with the machine learning techniques we use expected to outperform logistic regression in predictive performance.
Conclusions. Our approach could represent significant savings through efficient targeting of interventions to those STDI claimants who are most likely to benefit from them. We simulate the cost and benefit of several existing early intervention proposals under our predictive modeling framework.
Full Paper: