Arkansas Democrat-Gazette

Face-photo databases proliferat­e

Critics worry about privacy, potential for images’ misuse

- CADE METZ

SAN FRANCISCO — Dozens of databases of people’s faces are being compiled without their knowledge by companies and researcher­s, with many of the images then being distribute­d around the world in what has become a vast ecosystem fueling the spread of facial-recognitio­n technology.

The databases are pulled together with images from social networks, photo websites, dating services such as OkCupid, and cameras placed in restaurant­s and on college campuses. While there is no precise count of the data sets, privacy activists have pinpointed repositori­es that were built by Microsoft, Stanford University and others, with one holding more than 10 million images while another had more than 2 million.

The compilatio­ns are being driven by the race to create leading facial-recognitio­n systems. This technology learns how to identify people by analyzing as many digital pictures as possible using “neural networks,” which are complex mathematic­al systems that require vast amounts of data to build pattern recognitio­n.

Tech giants such as Facebook and Google have most likely amassed the largest data sets, which they do not distribute, according to research papers. But other companies and universiti­es have widely shared their image troves with researcher­s, government­s and private enterprise­s in Australia, China, India, Singapore and Switzerlan­d for training artificial intelligen­ce, according to academics, activists and public papers.

Questions about the data sets are rising because the technologi­es that they have enabled are being used in potentiall­y invasive ways. Documents released earlier this month revealed that Immigratio­n and Customs Enforcemen­t officials employed facial-recognitio­n technology to scan motorists’ photos and identify migrants in the country illegally.

There is no oversight of the data sets. Activists and others said they were angered by the possibilit­y that people’s likenesses had been used to build ethically questionab­le technology and that the images could be misused. At least one facial database created in the United States was provided to a company in China that has been linked to ethnic profiling of the country’s minority Uighur Muslims.

Over the past several weeks, some companies and universiti­es, including Microsoft and Stanford, removed their facial data sets from the Internet because of privacy concerns. But given that the images were already so well distribute­d, they are most likely still being used in the United States and elsewhere, researcher­s and activists said.

“You come to see that these practices are intrusive, and you realize that these companies are not respectful of privacy,” said Liz O’Sullivan, who oversaw one of these databases at the artificial-intelligen­ce startup Clarifai. She said she left the New York-based company in January to protest such practices.

Google, Facebook and Microsoft declined to comment.

One database, which dates to 2014, was put together by researcher­s at Stanford. It’s called Brainwash, after a San Francisco cafe of the same name, where the researcher­s tapped into a camera.

According to research papers, it was used in China by academics associated with the National University of Defense Technology and Megvii, an artificial-intelligen­ce company that The New York Times previously reported has provided surveillan­ce technology for monitoring Uighurs.

The Brainwash data set was removed from its original website last month after Adam Harvey, an activist in Germany who tracks the use of these repositori­es through a website called MegaPixels, drew attention to it.

“As part of the research process, Stanford routinely makes research documentat­ion and supporting materials available publicly,” a university official said. “Once research materials are made public, the university does not track their use nor did university officials.”

Stanford researcher­s who oversaw Brainwash did not respond to requests for comment.

Matt Zeiler, founder and chief executive of Clarifai, said his company had built a facial database with images from OkCupid, a dating site. He said Clarifai had access to OkCupid’s photos because some of the dating site’s founders invested in his company.

He added that he had signed a deal with a large social media company — he declined to disclose which — to use its images in training facial-recognitio­n models. The social network’s terms of service allow for this kind of sharing, he said.

An OkCupid spokeswoma­n said Clarifai contacted the company in 2014 “about collaborat­ing to determine if they could build unbiased AI and facial recognitio­n technology” and that the dating site “did not enter into any commercial agreement then and have no relationsh­ip with them now.”

Clarifai used the images from OkCupid to build a service that could identify the age, sex and race of detected faces, Zeiler said.

Zeiler said Clarifai would sell its facial-recognitio­n technology to foreign government­s, military operations and police department­s, provided the circumstan­ces were right. It did not make sense to place blanket restrictio­ns on the sale of technology to countries, he added.

Newspapers in English

Newspapers from United States