MAC SOFTWARE
Resolving riddles and restrictions with what you want to run on your Mac Copied links are broken
We resolve restrictions with your Mac apps
QWhen I copy an Internet link containing accented characters from a PDF using Preview, why does it always break, but copies correctly when the document is opened in Acrobat Reader?
AThis is a strange problem which results from an unfortunate coincidence of features in Unicode, PDF and macOS. In Unicode text encoding, most accented characters (and others in non-Roman scripts such as Korean) have two different codes which can produce them. For example, the Nordic letter Å can be assigned the ‘code point’ of an alphabetical character, or that for the symbol for a unit of length, the Ångström. The same character is thus represented by two different code points.
Most PDF don’t store text in Unicode, but use old variants of encoding tables that don’t make any such distinction, and represent both those Unicode forms using a single character.
When macOS converts that character from PDF, it can choose to provide it in either of the Unicode encodings, and almost invariably macOS, and Preview, opts for the form that’s most compatible with file-naming conventions on the Mac, known as Form D. Apps like Acrobat Reader use their own PDF engine, and opt for the other Unicode character, Form C.
While Form D may be better for filenames, the characters most usually entered at the keyboard, so used in Internet links, are Form C. You’ll therefore find copying from Acrobat Reader less troublesome, or you can convert using my free tools Apfelstrudel and unorml from bit.ly/mf343texutil.