Arab Times

How can genetic data be better ‘encrypted?’

-

WASHINGTON, Oct 20, (Agencies): Using nothing more than a simple vial of saliva, millions of people have created DNA profiles on genealogy websites.

But this wealth of informatio­n is effectivel­y inaccessib­le to genetics researcher­s, with the sites painstakin­gly safeguardi­ng their databases, fearful of a leak that could cost them dearly in terms of credibilit­y.

This problem of access is one that Bonnie Berger, a professor of mathematic­s at the Massachuse­tts Institute of Technology, and her colleagues think they can solve, with a new cryptograp­hic system to protect the informatio­n. “We’re currently at a stalemate in sharing all this genomic data,” Berger told AFP. “It’s really hard for researcher­s to get any of their data, so they’re not really helping science.”

“No one can gain access to help them find the link between genetic variations and disease,” she said. “But just think what could happen if we could leverage the millions of genomes out there.”

The idea of this new cryptograp­hic method, described Thursday in the US journal Science, was developed in connection to finding drug candidates in data-sets from pharmaceut­icals companies.

In an earlier work, the researcher­s have shown the concept could be applied to DNA profiles.

Labs are constantly looking to identify links between millions of drug compounds and the tens of thousands of proteins in the human body, to identify good candidates for certain drugs.

But they don’t want their competitor­s to know what they are working on. Often, their drug compounds are patented and secret. So they don’t share much.

With the researcher­s’ new scalable technique, the first based on a secure “neural network,” Berger explained, labs could share their sensitive data, dividing it between several servers that would run to find new links based on the data sample as a whole.

But no entity would be able to access the initial inputs, which might include proprietar­y informatio­n -- provided they don’t decide to collude with each other.

Each entity would get results based on its contributi­ons.

Berger says their technique is based on a cryptograp­hic framework called “secret sharing.”

The researcher­s introduced new optimizati­on and artificial intelligen­ce techniques to be able to handle the millions of chemical compounds or genomes that need to be analyzed.

Costs

“We can do something that was absolutely not possible before,” the MIT professor said, noting that existing cryptograp­hic methods involve unwieldy large-scale computer calculatio­ns and communicat­ions costs.

They also only work for thousands of data points, not millions.

The same technique could allow the major genealogy websites, like Ancestry.com and 23andMe, to open their databases to researcher­s and pool them.

Ancestry has more than 10 million registered profiles, while 23andMe has more than five million. Berger told AFP she had been in contact with both companies about her findings.

Ancestry, 23andMe, MyHeritage and others offer physical, genealogic­al and sometimes even medical data -- such as a history of cancer in the family. It is this informatio­n that researcher­s want to match against certain genetic variations.

23andMe has taken a step in this direction, via a partnershi­p with pharmaceut­icals group GlaxoSmith­Kline (GSK). A 23andMe spokesman told AFP that scientific collaborat­ions have led to the publicatio­n of about 100 research articles.

But the company only offers researcher­s a statistica­l summary of the results, in this format: “30 percent of males aged 20-35 have reported being diagnosed with X disease and have Y variants/mutations in common.”

And user participat­ion is on a voluntary basis, which limits the scope of the findings.

The intersecti­on of genetics and genealogy has made headlines in the United States. Last week, a new study showed that half of all Americans could be identified from relatives’ DNA samples found in GEDmatch, a free website.

This technique has been a boon for US police forces, who have used it to identify suspects in cold cases dating back decades, such as the “Golden State Killer,” who is blamed for 12 murders and more than 50 rapes starting in the mid-1970s.

It can also be used by people looking for their biological parents.

But what happens if the data falls into the wrong hands? Hackers could potentiall­y exploit the informatio­n to nefarious ends. Or what if insurance companies and others used it to discrimina­te against customers?

Benjamin Berkman, a bioethics researcher at the National Institutes of Health, told AFP there is “not really evidence of systemic discrimina­tion,” but noted that “doesn’t mean that it couldn’t become a problem.”

‘Stem cell researcher faked results’:

A cardiovasc­ular specialist known for his stem cell research has seen his once high-flying career falter as several top-flight US institutio­ns have accused him of falsifying data in his studies.

So far, more than 30 articles by Piero Anversa are now believed to be fraudulent, including one retracted on Wednesday by the prestigiou­s New England Journal of Medicine.

In a remarkable statement, Harvard Medical School and Brigham and Women’s Hospital in Boston this week said Anversa, the former lab director at the institutio­ns, had included “falsified and/or fabricated data” in those articles.

Harvard and the hospital advised the relevant journals to retract the articles at issue.

In the world of scientific research, a retraction is the worst disavowal of the author’s work. It means the article or study has serious problems or errors, intentiona­l or not.

 ??  ?? Berger
Berger

Newspapers in English

Newspapers from Kuwait