Business Standard

Making Wikipedia more useful

- DEVANGSHU DATTA

Singer and composer, Dido Florian Cloud de Bouneviall­e O’malley Armstrong decided, understand­ably enough, that she needed to shorten her name when she went profession­al. By the late 1990s, she had attained a fair degree of success as just “Dido”.

A few years later, she ended up cursing herself for picking that particular diminutive. By the time the online crowdsourc­ed encycloped­ia Wikipedia launched in 2001, she was certainly “notable” enough to merit an entry there.

Some vandals with a warped sense of humour kept inserting the letter “L” into the middle of her profession­al name, and this happened persistent­ly, over a period of several years.

That particular problem has been finally fixed but the “Wiki” continues to provide fake, or incomplete informatio­n, in many of its 40 million-plus entries. This is inherent to the Wiki's model. The encycloped­ia's USP is crowd-sourcing; it allows anybody to make an entry about pretty much anything and it’s designed for easy cross-referencin­g and linking.

The assumption is, eventually, through a process of editing, argument and debate, most of the entries will get fixed. However, when it comes to any hot-button subject, this assumption is probably dubious. The biographic­al entries of politician­s, media persons and celebritie­s are regularly vandalised or “injected” with fake news. There are unending flame wars on topics like the Armenian Genocide, or the Aryan Invasion Theory.

Academics and schoolteac­hers frequently complain about Wikipedia. Students are known to copy-paste entries wholesale from the crowdsourc­ed online encycloped­ia. The more honest among them even cite it as a reference! Academics use it themselves, though secretivel­y, and doctors and pharmacolo­gists have been known to look it up as well.

It is a good place to look up a physics equation, or a mathematic­al theorem, so long as you can ignore the background noise. There are often fierce debates about the actual origin of a theorem, or a scientific theory — ethnic nationalis­ts like to “prove” it was Newton, Leibnitz or Madhava who discovered calculus, for example.

So, the Wiki is an invaluable, if fallible, source of knowledge. It has many multiples more in the way of entries across its 290odd languages than any other encycloped­ic source. Even encycloped­ic aggregator­s, such as Xrefplus and Galenet have less than 4 million entries between them. Moreover, the Wiki is free and that’s not an inconsider­able factor when impecuniou­s students are seeking informatio­n.

As of now, guesstimat­es suggest that around 80-85 per cent of Wiki informatio­n is accurate. That is less than 95 per cent that is considered the gold standard for “normal” encycloped­ias where the entries are written by commission­ed experts and fact- checked by other experts. But 80 per cent is a reasonable baseline for somebody who’s starting to research a topic and it does suggest that crowd-sourcing works, even though it is a long way from perfection.

A new initiative could bump up the Wiki accuracy quotient by several notches. One of the problems with Wiki citations is that the reference is often not available online for primar y viewing. This is especially true in the case of books and scientific papers. Scientific papers are often behind paywalls, with only the abstract available for viewing . A book may not be available online at all, or it may be a paid resource that has to be bought, and in many Wiki citations, there may not even be an ISBN to facilitate search. Not every researcher can afford to pay significan­t sums to simply check if a reference is accurately cited.

The Internet Archive, (sometimes called the Wayback Machine, though that is only one of the Archive's functions) is trying to sort this issue out. It will tr y to scan the relevant pages of every book that is cited in a Wiki reference. This would enable researcher­s to click on the citation and go to the relevant page and check for accuracy. As of now, the archive has done this for 130,000 entries in the English, Arabic and Greek editions. This itself involved scanning 50,000 books.

A new initiative could bump up the Wiki accuracy quotient by several notches. One of the problems with Wiki citations is that the reference is often not available online for primary viewing

 ??  ??

Newspapers in English

Newspapers from India