White by Default: Systematic Bias in U.S. Criminal Racial Assignment
We trained a model on 1.5 million criminal records to predict race using mugshots and names achieving 92.76% accuracy. An accurate linear model trained on biased data learns the true signal, with deviations indicating mislabeling rather than model error. 29% of predicted Hispanics are misclassified as White. Correcting this increases Hispanic criminal record rates by 20-31% and decreases White rates by 4-6%.
Read More →