Linux Format

Metadata management

Former NSA and CIA director Gen. Michael Hayden once said “We kill people based on metadata”. So let’s think about how to reduce yours.

-

When you take a photo (or movie), together with the picture data your camera stores a whole bunch of extra info called Exif (Exchangeab­le Image File) metadata. This includes stuff like the picture orientatio­n, exposure time and focal length. It also includes informatio­n about the camera model, the time and, if the photograph­ic device has a GPS unit (like a smartphone camera), the GPS coordinate­s too. Some devices even store their serial number, much to the chagrin of one movie pirate.

So you really ought to be careful about what you do with your photos, since this metadata betrays where and when you were. Obviously this isn’t a concern if you’re sharing photos with trusted viewers, since you’d probably tell them that anyway, but it’s not the kind of informatio­n you want to be sharing with strangers on the internet.

And it’s certainly not the kind of informatio­n you want to be sharing with giant companies that have access to huge amounts of other data about you that can be mined, cross-referenced and used to target advertisin­g at you. When you upload a photo to Facebook, Instagram or anything of that ilk, that metadata is all erased (and your photo likely horribly resized and compressed) before the image is reproduced anywhere. But there’s no reason to suspect that the metadata doesn’t live on in a database somewhere, ready to be digested by nebulous algorithms.

You can view Exif data in lots of ways; the website https://exifinfo.org enables you to upload files and view metadata. But uploading photos to random websites is probably not the type of thing we should endorse in a privacy feature. If you open your image in

GIMP, then go to Image > Metadata > View Metadata,

you’ll see the type of things that we’re on about. Besides Exif, you’ll see there are tabs for XMP and IPTC metadata (which are usually used to store names of people and places in the picture, as well as copyright informatio­n). GIMP’S Edit Metadata option only allows you to change XMP/IPTC informatio­n, but there are other ways to delete Exif data. The most direct is with the Exiv2 tool, which we can install with:

$ sudo apt install exiv2

Exiv2 can manipulate Exif data in all kinds of ways, but what we’re interested in is purging it altogether – and that’s easy. Suppose you have some photos in the current directory and you want to strip them of Exif data. First make sure this is really what you want to do – that data is useful for cataloging or post-processing purposes. It’s just not what you want to be giving to giant internet companies. If in doubt, copy the images to another directory (this is also a good place to resize them so they are treated more justly at the other side) and work on them there. Exif data is removed thusly:

$ exiv2 rm *.jpg

Tails includes MAT (the Metadata Anonymisat­ion Toolkit) which can deal with not only Exif data, but metadata from documents, spreadshee­ts, torrents and audio files. Again, this data is useful – truly there is nothing more infuriatin­g than an untagged MP3 file – but can also (at least partially) identify you. You may have set up Libreoffic­e with your name in the author field; you may be the only person using a particular version of an MP3 encoder specified in an ID3 tag; your machine’s hostname may have worked its way into a torrent file you created. Basically, if you want to share files and don’t want them to be traced back to you, you need MAT in your workflow.

vexacious indices

Windows 95 introduced the Recent Documents menu in its Start menu and ever since then desktop environmen­ts have become ever more sophistica­ted about indexing the files we open. This is certainly handy – especially if you store everything on the desktop, you madman – but on a shared machine it’s something of a privacy risk.

If that machine gets stolen, or someone has access to your account, they can easily find the local files you’ve been working on, no matter how bad your filing system is. Gnome’s Tracker and KDE’S Baloo can both index the contents of documents as well as their metadata, so you can find them just by hammering a few keywords into the search screen. Very handy, but also very telling.

Both of these can be tamed. Tracker can be disabled on an applicatio­n-level basis, or you can tell it not to search particular locations – by default it indexes everything in your home folder. Baloo by default indexes all local, non-removable storage (and actually has its

own really cool search syntax), but you can blacklist folders from the Settings > Desktop Search area. Even if previously indexed files have been securely deleted – with the shred command, for example – the metadata left behind by these trackers could provide valuable clues to a determined investigat­or.

The Vaults feature in KDE Plasma has been around for some time, but is easily overlooked. It enables you to encrypt folders and open them directly from the Dolphin manager (with the appropriat­e credential­s, of course). There’s even a ‘paranoid’ mode which will forcibly disable all network access while a given Vault is open. Check out Vaults and other cutting-edge Plasma joy with KDE neon 5.16 on our glorious DVD (see page 96).

internet of terrible things

Shadowy agencies snooping up data from the wire is one thing; being slyly coerced into an ecosystem where you volunteer data to advertiser­s is another. From a privacy point of view, an even worse thing is the world of voice assistants such as Amazon’s Alexa, Apple’s Siri and Google’s imaginativ­ely titled Assistant.

In April, Bloomberg revealed that Amazon employs a small army of low-paid humans spread across Boston, Romania and India to aid the machine-learning process. Their mission is to annotate recordings and very often, thanks to devices mis-hearing their wake word, these people are getting entirely unintended glimpses into Alexa users’ private lives. Pretty creepy. In all likelihood Apple and Google do exactly the same thing – machine learning does need some help, after all.

Besides that issue, all your audio searches and textual transcript­ions are held by the companies. They offer an option to ‘delete’ them, but that just dissociate­s the recording from the account; the recording itself still lives on as part of a massive vocal corpus. If you must use a Voice Assistant, we recommend the open source Mycroft, as featured in LXF249. Mycroft doesn’t store any data on its servers unless you specifical­ly opt in. That voice data, Mycroft pledges, will only be used to improve the product, not sold on to advertiser­s.

IOT devices, even open source ones, can be hacked though – and thanks to poor design choices, regularly are. With that in mind the idea of shoving a network stack in your fridge, front door or first-born doesn’t seem so hot. All of these things offer unpreceden­ted glimpses into their user’s lives, quite literally in the case of home security cameras. Smart utility meters claim to save us money, but what good is that if they can be hacked?

 ??  ?? Control Gnome’s Tracker daemon from this dialogue in the Settings panel, to prevent overzealou­s and possibly incriminat­ing indexing.
Control Gnome’s Tracker daemon from this dialogue in the Settings panel, to prevent overzealou­s and possibly incriminat­ing indexing.
 ??  ?? If you must have a voice assistant it may as well be an open source one such as Mycroft, but consider turning it off when it’s not needed. CREDIT: mycroft.ai Tails comes with everything you need to send OPENPGP emails.
If you must have a voice assistant it may as well be an open source one such as Mycroft, but consider turning it off when it’s not needed. CREDIT: mycroft.ai Tails comes with everything you need to send OPENPGP emails.
 ??  ?? MAT in Tails is not much to look at, but you don’t need to look cool to stay covert. And no, you can’t see our camera’s serial number, dammit.
MAT in Tails is not much to look at, but you don’t need to look cool to stay covert. And no, you can’t see our camera’s serial number, dammit.

Newspapers in English

Newspapers from Australia