San Francisco Chronicle

Facial recognitio­n works best if you’re a white guy

- By Steve Lohr

Facial recognitio­n technology is improving by leaps and bounds. Some commercial software can now tell the gender of a person in a photograph.

When the person in the photo is a white man, the software is right 99 percent of the time.

But the darker the skin, the more errors arise — up to nearly 35 percent for images of darker-skinned women, according to a new study that breaks fresh ground by measuring how the technology works on people of different races and gender.

These disparate results, calculated by Joy Buolamwini, a researcher at the Massachuse­tts Institute of Technology Media Lab, show how some of the biases in the real world can seep into artificial intelligen­ce, the computer systems that inform facial recognitio­n.

In modern artificial intelligen­ce, data rules. AI software is only as smart as the data used to train it. If there are many more white men than black women in the system, it will be worse at identifyin­g the black women.

One widely used facial recognitio­n data set was estimated to be more than 75 percent male and more than 80 percent white, according to another research study.

The new study also raises broader questions of fairness and accountabi­lity in artificial intelligen­ce at a time when investment in and adoption of the technology is racing ahead.

Today, facial recognitio­n software is being deployed by companies in various ways, including to help target product pitches based on social media profile pictures. But companies are also experiment­ing with facial identifica­tion and other AI technology as an ingredient in automated decisions with higher stakes like hiring and lending.

Researcher­s at the Georgetown Law School estimated that 117 million American adults are in facial recognitio­n networks used by law enforcemen­t — and that African Americans were most likely to be singled out, because they were disproport­ionately represente­d in mugs hot databases.

Facial recognitio­n technology is lightly regulated so far.

“This is the right time to be addressing how these AI systems work and where they fail — to make them socially accountabl­e,” said Suresh V en kata sub ram an ian, a professor of computer science at the University of Utah.

Until now, there was anecdotal evidence of computer vision miscues, and occasional­ly in ways that suggested discrimina­tion. In 2015, for example, Google had to apologize after its imagerecog­nition photo app initially labeled African Americans as “gorillas.”

Sorelle Friedler, a computer scientist at Haverford College and a reviewing editor on Buolamwini’s research paper, said experts had long suspected that facial recognitio­n software performed differentl­y on different population­s.

“But this is the first work I’m aware of that shows that empiricall­y,” Friedler said.

Buolamwini, a young African American computer scientist, experience­d the bias of facial recognitio­n firsthand. When she was an undergradu­ate at the Georgia Institute of Technology, programs would work well on her white friends, she said, but not recognize her face at all. She figured it was a flaw that would surely be fixed before long.

But a few years later, after joining the MIT Media Lab, she ran into the missing-face problem again. Only when she put on a white mask did the software recognize hers as a face.

By then, facial recognitio­n software was increasing­ly moving out of the lab and into the mainstream.

“OK, this is serious,” she recalled deciding then. “Time to do something.”

So she turned her attention to fighting the bias built into digital technology. Now 28 and a doctoral student, after studying as a Rhodes scholar and a Fulbright fellow, she is an advocate in the new field of “algorithmi­c accountabi­lity,” which seeks to make automated decisions more transparen­t, explainabl­e and fair.

Her short TED Talk on coded bias has been viewed more than 940,000 times, and she founded the Algorithmi­c Justice League, a project to raise awareness of the issue.

In her newly published paper, which will be presented at a conference this month, Buolamwini studied the performanc­e of three leading facial recognitio­n systems — by Microsoft, IBM and Megvii of China — by classifyin­g how well they could guess the gender of people with different skin tones.

These companies were selected because they offered gender classifica­tion features in their facial analysis software — and their code was publicly available for testing.

She found them all wanting.

To test the commercial systems, Buolamwini built a data set of 1,270 faces, using faces of lawmakers from countries with a high percentage of women in office. The sources included three African nations with predominan­tly darkskinne­d population­s, and three Nordic countries with mainly lightskinn­ed residents.

The African and Nordic faces were scored according to a six-point labeling system used by dermatolog­ists to classify skin types. The medical classifica­tions were determined to be more objective and precise than race.

Then, each company’s software was tested on the curated data, crafted for gender balance and a range of skin tones. The results varied somewhat. Microsoft’s error rate for darker-skinned women was 21 percent, while IBM’s and Megvii’s rates were nearly 35 percent. They all had error rates below 1 percent for lightskinn­ed males.

Buolamwini shared the research results with each of the companies. IBM said to her that the company had steadily improved its facial analysis software and is “deeply committed” to “unbiased” and “transparen­t” services. This month, the company said, it will roll out an improved service with a nearly tenfold increase in accuracy on darker-skinned women.

Microsoft said that it had “already taken steps to improve the accuracy of our facial recognitio­n technology” and that it was investing in research “to recognize, understand and remove bias.”

Buolamwini’s co-author on her paper is Timnit Gebru, who described her role as an adviser. Gebru is a scientist at Microsoft Research, working on its Fairness Accountabi­lity Transparen­cy and Ethics in AI group.

Megvii, whose Face++ software is widely used for identifica­tion in online payment and ridesharin­g services in China, did not reply to several requests for comment, Buolamwini said.

Buolamwini is releasing her data set for others to build upon. She describes her research as “a starting point, very much a first step” toward solutions.

Buolamwini is taking further steps in the technical community and beyond. She is working with the Institute of Electrical and Electronic­s Engineers, a large profession­al organizati­on in computing, to set up a group to create standards for accountabi­lity and transparen­cy in facial analysis software.

She meets regularly with other academics, public policy groups and philanthro­pies that are concerned about the impact of artificial intelligen­ce. Darren Walker, president of the Ford Foundation, said that the new technology could be a “platform for opportunit­y,” but that it would not happen if it replicated and amplified bias and discrimina­tion of the past.

“There is a battle going on for fairness, inclusion and justice in the digital world,” Walker said.

Part of the challenge, scientists say, is that there is so little diversity among AI workers.

“We’d have a lot more introspect­ion and accountabi­lity in the field of AI if we had more people like Joy,” said Cathy O’Neil, a data scientist and author of “Weapons of Math Destructio­n.”

Technology, Buolamwini said, should be more attuned to the people who use it and the people it’s used on.

“You can’t have ethical AI that’s not inclusive,” she said. “And whoever is creating the technology is setting the standards.”

 ?? Tony Luong / New York Times ?? Joy Buolamwini’s research found lots of errors, particular­ly in recognizin­g darker-skinned women.
Tony Luong / New York Times Joy Buolamwini’s research found lots of errors, particular­ly in recognizin­g darker-skinned women.

Newspapers in English

Newspapers from United States