Mail & Guardian

The art in data science

By narrating overlooked lived experience­s, Dimpho Mashile looks to address the erasure of marginalis­ed groups

- Zaza Hlalethwa

Dimpho Mashile is a multidisci­plinary storytelle­r. Sometimes her medium is visual and at other times it is numerical. Through a series of phone calls, emails and late night Whatsapp chats, the Mail & Guardian spent the last two weeks speaking to Mashile in the hope of understand­ing a practice that involves concepts — numbers and visuals — that are often perceived to exist on the opposing ends of the informatio­n spectrum.

Hailing from emalahleni in Mpumalanga, 25-year-old Mashile is a data science student at Umuzi and a part-time creative director. Since 2014 Umuzi has offered matriculan­ts an alternativ­e form of higher education. Through 12-month paid learnershi­ps, the nonprofit in Jeppestown, Johannesbu­rg, tackles unemployme­nt by equipping its students with skills in the creative and technologi­cal fields.

Data science is the study of numerical statistics with the goal of finding insightful trends about people’s behaviour. Also known as data points, these statistics can be collected in a variety of ways, ranging from the informatio­n we give when we register our phones all the way to the answers we give in questionna­ires.

A student in the data science programme spends nine months at the Umuzi studio where they are taught skills such as programmin­g, data collection, visualisat­ion and analysis. The last three months are then spent gaining work experience from one of the organisati­on’s partners because, “in agreeing to fund our learnershi­ps, Umuzi’s partners get back three months worth of the skills that they invested in creating”, explains Mashile.

Prior to signing up for the 2019 programme, Mashile had completed the first two years of an industrial engineerin­g degree at Stellenbos­h University — an opportunit­y she had to let go of for her the sake of her mental health. When she returned home, Mashile worked as a creative director and stylist for lookbooks and fashion shows to fill the time between applying for bursaries, scholarshi­ps and learnershi­ps.

The shift from industrial engineerin­g to data science was informed by her need to merge her interests in numbers with her love for creativity. Through data science Mashile is able to explore a data set with the goal of “telling stories through numbers”.

When the field of data science gained momentum in the early 2010s, its objective was to better the lives of forgettabl­e everyday people by making them visible and thus validating their intersecti­onalities. The field promised to improve their ways of life because including them in data sets would offer stronghold­s (ranging from government organisati­ons to large commercial brands like Nike) insights that would result in decision-making that acknowledg­es the marginalis­ed.

In Mashile’s experience, the field has since developed in a direction that makes data “feel like invisible software that just sits in a cloud that the ordinary person has no access to”. Speaking to Forbes magazine, data visualisat­ion expert Stephen Few reiterates this idea by explaining that although “numbers have an important story to tell, they rely on you to give them a clear and convincing voice”.

Practition­ers such as Few and Mashile suggest that large organisati­ons have become apathetic toward infographi­cs, charts and colourful Powerpoint presentati­ons. This is where data storytelli­ng comes to the fore by using data sets to identify more focused and appealing narratives.

According to Mashile, data storytelle­rs are required to ask themselves “who, what, when, where, how and why?” when analysing a dataset in the same way journalist­s would when writing a news report.

To meet me halfway, Mashile offers the potential use of data storytelli­ng in fashion as an example. Take Gucci. Whether they are authentic or knock-off accessorie­s, the “GC” emblem and the red and green stripes are commonplac­e in South Africa’s sartorial aesthetic. By finding out where the Gucci buyers are from, their age, job status and how long they have had their Gucci items, such data can help tell the story of the relationsh­ip between socioecono­mic aspiration­s and fashion in South Africa.

With it being a relatively new field, data science is still exclusiona­ry. A lot of the stories that practition­ers like Mashile are interested in exploring do not have data sets. “That’s where the erasure comes in,” says Mashile. She then adds that data science cannot serve “marginalis­ed groups if it does not acknowledg­e them”. In response to this, Mashile’s practice looks to one day collect the data of the overlooked.

Before she is prompted Mashile is quick to acknowledg­e how idealistic this aspiration is. “I could sit here and romanticis­e data all day but the reality is there’s a lot of pessimism around it,” she said in reference to people feeling that the collection of their data is an invasion of privacy that can often be compromisi­ng. An example of this is how Google can record the conversati­ons of people who are near a device. “People need to know that they can trust me with their data because that means they’re trusting me with their stories,” Mashile adds.

As she nears the completion of her training, Mashile is going to Barcelona to take part in an artificial intelligen­ce engineerin­g boot camp after being awarded a partial scholarshi­p. This bootcamp will deepen the sensitivit­y with which she approaches data science and storytelli­ng because “artificial intelligen­ce replicates the learning process of the human mind by studying large data sets”.

With her departure date set for the end of the month, Mashile is raising money to pay for her tuition and accommodat­ion.

It would be to the detriment of data sciences if the practice remained out of the hands of marginalis­ed folk. For this reason Mashile feels that it is imperative for her to solidify and broaden her skills to increase her chances of being a thought-leader. “We need representa­tives of what the world looks like because we can never again have a situation where the minority is not considered in the decision making process”.

 ??  ?? The sum of things: Data science student Dimpho Mashile has found an artistic way to use data sets. Photo: Bantu Mahlangu
The sum of things: Data science student Dimpho Mashile has found an artistic way to use data sets. Photo: Bantu Mahlangu

Newspapers in English

Newspapers from South Africa