The Maui News

People, homes vanish due to 2020 census’ new privacy method

- By MIKE SCHNEIDER

The three-bedroom colonial-style house where Jessica Stephenson has lived in Milwaukee for the last six years bustles with activity on any given weekday, filled with the chattering of children in the day care center she runs out of her home.

The U.S. Census Bureau says no one lives there.

“They should come and see it for themselves,” Stephenson said.

From her majority-Black neighborho­od in Wisconsin to a community of Hasidic Jews in New York’s Catskill Mountains to a park outside Tampa, Florida, a method used by the Census Bureau for the first time to protect confidenti­ality in the 2020 census has made people and occupied homes vanish — at least on paper — when they actually exist in the real world.

It’s not a magic trick but rather a new statistica­l method the bureau is using called differenti­al privacy, which involves the intentiona­l addition of errors to data to obscure the identity of any given participan­t.

Bureau officials say it’s necessary to protect privacy in a time of increasing­ly sophistica­ted data mining, as technologi­cal innovation­s magnify the threat of people being “re-identified” through the use of powerful computers to match census informatio­n with other public databases. By law, census answers are supposed to be confidenti­al.

But some city officials and demographe­rs think it veers too far from reality — and could cause errors in the data used for drawing political districts and distributi­ng federal funds.

At least one analysis suggests that differenti­al privacy could penalize minority communitie­s by undercount­ing areas that are racially and ethnically mixed. Harvard University researcher­s found that the method made it more difficult to create political districts of equal population and could result in fewer majoritymi­nority districts.

The Census Bureau, for its part, argues that the data is every bit as good as in past censuses and that the low-level inaccuraci­es don’t present a large-scale problem.

What’s certain is that the method can produce weird, contradict­ory and false results at the smallest geographic levels, such as neighborho­od blocks.

For example, the official 2020 census results say 54 people live in Stephenson’s census block in midtown Milwaukee, but also that there are no occupied homes. In reality almost two dozen houses occupy the car-lined streets, some dating back more than a century. Forty-eight of the residents living in the block are Black, according to the census, though it’s difficult to know for sure, given the whimsy of differenti­al privacy.

In another case, the census lists no people living in the Flatwoods Conservati­on Park outside Tampa, even though it says there is a home occupied by people. According to Hillsborou­gh County spokesman Todd Pratt, two county employees live there while maintainin­g security for the park.

And in an enclave of Hasidic Jews located in Kiamesha Lake, New York, 81 people are recorded as residents, but the census officially says there are no occupied homes. Sullivan County property records show almost a dozen homes whose residents have ties to the Vizhnitzer Hasidic community.

The unreliable data has created headaches for city managers and planners of small communitie­s who worry that it may not be valid for decisionma­king. Eric Guthrie, a senior demographe­r at the Minnesota State Demographi­c Center, said he has been contacted by a half-dozen city managers from around the state who were concerned about potential impacts to state and federal funding.

“I explain to them there’s not a method for correcting it, that it’s not an error in the traditiona­l sense,” Guthrie said. “The bug is there by design.”

The scale of the changes become clearer when viewed through a broader lens. For Florida, the nation’s third most populous state with more than 21 million residents, the 2020 census listed 15,000 neighborho­od blocks as having a total of 200,000 residents but no occupied homes. On the flip side, 1,200 of the state’s 484,000 blocks were listed as having occupied homes but no population, according to Rich Doty, geographic informatio­n system coordinato­r and research demographe­r at the University of Florida’s Bureau of Economic and Business Research.

“We expected these anomalies, as we were warned about this by the Census Bureau and other states,” Doty said. “We just didn’t expect this many.”

Ahead of the release of census data used for drawing congressio­nal and legislativ­e districts in August, acting Census Bureau director Ron Jarmin warned that its applicatio­n could produce some “fuzzy” figures at the neighborho­od block level and urged data users to combine blocks to get accurate results. But the bureau also says that despite the implementa­tion of differenti­al privacy, the quality of the 2020 data isn’t any worse than previous censuses based on measuremen­ts of data quality.

That claim is hard to evaluate since the raw data without the applicatio­n of differenti­al privacy is not being made public, said Stefan Rayer, a University of Florida demographe­r.

“We have to take their word for it,” Rayer said.

Using test data, the Harvard researcher­s found that differenti­al privacy was more likely to undercount mixed-race and mixed-partisan precincts, “yielding unpredicta­ble racial and partisan biases,” because it prioritize­s the accuracy of the population count for the largest racial group in a given area.

“Our findings underscore the difficulty of balancing accuracy and respondent privacy in the Census,” they said in a report.

The Census Bureau disagrees, and so far the courts have found no reason to stop it.

Differenti­al privacy was unsuccessf­ully challenged by the state of Alabama earlier this year. In a declaratio­n for that lawsuit, the Census Bureau’s chief scientist, John Abowd, called the data “extremely accurate” and said the use of differenti­al privacy showed no bias regarding racial or ethnic minorities.

“Redistrict­ers can remain confident in the accuracy of the population counts and demographi­c characteri­stics of the voting districts they draw, despite the noise in the individual building blocks,” Abowd said.

Not everyone believes the technique is the right way to protect confidenti­ality.

Two University of Minnesota researcher­s wrote in a recent paper that a Census Bureau experiment failed to show genuine threats to confidenti­ality and that any risks of re-identifica­tion were similar to random guessing of households’ characteri­stics.

One of them, demographe­r Steven Ruggles, said during a presentati­on this month that the Census Bureau’s fear of reidentifi­cation and the resulting justificat­ion for using differenti­al privacy could undermine confidence in the census data.

“It should not justify the degradatio­n of the statistica­l infrastruc­ture of our country,” Ruggles said. “The whole thing is likely to backfire.”

 ?? AP photo ?? A neighborho­od in Milwaukee is one of many places in the country where a new method used by the U.S. Census Bureau to protect confidenti­ality in the 2020 census has made people and occupied homes vanish — at least on paper — when they actually exist in the real world.
AP photo A neighborho­od in Milwaukee is one of many places in the country where a new method used by the U.S. Census Bureau to protect confidenti­ality in the 2020 census has made people and occupied homes vanish — at least on paper — when they actually exist in the real world.

Newspapers in English

Newspapers from United States