Saturday Star

Data used to check crime is skewed

- CATHY O’NEIL

US PRESIDENT DONALD Trump plans to collect a lot more data about crimes committed by immigrants. This will inevitably give him a weapon to use against them, thanks to a peculiarit­y of crime statistics: If you look for something, you’ll almost always find more of it.

Trump recently started two initiative­s focused on crime. He has promised to create a new office in the Department of Homeland Security, Victims of Immigrant Crime Engagement, to collect data on the transgress­ions of immigrants.

And in his revised executive order halting visas and refugees from certain countries, he called for a public database on “honour killings”, defined as gender-based violence against women by foreign nationals.

It’s hard to get to the truth about crime. One could argue that we don’t really have crime data at all. Rather, we have informatio­n on arrests and reports.

A lot of criminal activity – drug use, small time theft, trespassin­g, tur nstile jumping – never gets recorded unless a police officer happens to be present.

Most rapes go unreported, and as much as a third of all murders are never solved.

The incomplete­ness of the data means that what we decide to collect can have a big impact on what we see. If we spend a lot of time and energy finding and documentin­g crimes committed by a certain sub-population, we’ll naturally increase its prominence.

This wouldn’t mean that such people are more criminal. They’re simply getting a different level of scrutiny.

Consider how police department­s have focused on nuisance crimes in poor and minority neighbourh­oods – part of a broader strategy known as “broken windows policing”. Blacks ended up getting arrested for smoking marijuana a lot more often than whites – even though people of both races actually use the stuff at about the same rate. Similarly, the Chicago Police Accountabi­lity Task Force found that black drivers were much more likely than white drivers to be stopped on suspicion of carrying illegal contraband, even though they were less likely to be actually carrying contraband.

Despite the obvious flaws in arrest data, we still use them in designing policies. Police department­s send more officers to areas where they make the most arrests. Judges consider previous arrests in deciding how harshly to sentence. Computer algorithms use the data to predict where crimes will occur, how much bail to demand and whether to free prisoners on parole.

All those decisions are as biased as the data on which they are based – an ongoing problem for poor people and minorities, who find themselves increasing­ly surveilled and incarcerat­ed.

Perhaps you’ve already heard the statistic that immigrants are involved in less crime than nativeborn Americans.

If we start over-scrutinizi­ng immigrants from Muslim-majority countries, the numbers might well change to their detriment, giving the Trump administra­tion the fodder it needs to engage in yet more profiling purported to ensure the nation’s security.

To be fair, and to be scientific about it, we should choose another sub-population for equal focus, so we can measure the effects of our added attention.

I s ug g est s t ar t i ng with politician­s. - The Washington Post

O’Neil is a mathematic­ian who has worked as a professor, hedgefund analyst and data scientist. She founded ORCAA, an algorithmi­c auditing company, and is the author of ‘Weapons of Math Destructio­n’.

Newspapers in English

Newspapers from South Africa