Abstract

Summary Using criminal population conviction histories of recent offenders, prediction mod els are developed that predict three types of criminal recidivism: general recidivism, violent recidivism and sexual recidivism. The research question is whether prediction techniques from modern statistics, data mining and machine learning provide an improvement in predictive performance over classical statistical methods, namely logistic regression and linear discrim inant analysis. These models are compared on a large selection of performance measures. Results indicate that classical methods do equally well as or better than their modern counterparts. The predictive performance of the various techniques differs only slightly for general and violent recidivism, whereas differences are larger for sexual recidivism. For the general and violent recidivism data we present the results of logistic regression and for sexual recidivism of linear discriminant analysis.

Keywords

RecidivismLogistic regressionConvictionLinear discriminant analysisPsychologyStatisticsSex offensePopulationComputer sciencePoison controlMachine learningCriminologyHuman factors and ergonomicsDemographyMathematicsSexual abuseLawMedicineSociologyMedical emergency

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
176
Issue
2
Pages
565-584
Citations
123
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

123
OpenAlex

Cite This

Nikolaj Tollenaar, P.G.M. van der Heijden (2012). Which Method Predicts Recidivism Best?: A Comparison of Statistical, Machine Learning and Data Mining Predictive Models. Journal of the Royal Statistical Society Series A (Statistics in Society) , 176 (2) , 565-584. https://doi.org/10.1111/j.1467-985x.2012.01056.x

Identifiers

DOI
10.1111/j.1467-985x.2012.01056.x