The Sunday Times of Malta

Text me a face

- MARC TANTI and ADRIAN MUSCAT

Back in the ancient times of 2018 (by AI standards), NVIDIA published an AI model called StyleGAN which could generate very realistic images, particular­ly images of faces. This was an impressive achievemen­t which probably led to a lot of fake profiles on Facebook. Still, all it can do is generate random faces and it isn’t possible to control what the face should look like with a text descriptio­n, or prompt, like DALL-E can.

Unfortunat­ely, DALL-E is not open source like StyleGAN and so can’t be used freely. However, it is possible to use prompts with StyleGAN by using the following recipe.

DALL-E and ChatGPT are not the only interestin­g things OpenAI developed. They also developed a program called CLIP (Contrastiv­e Language-Image Pre-training) which can be used to quantify how much an image matches a text descriptio­n. This program is open source and can be used to quantify how much a randomly generated

StyleGAN face matches a descriptio­n. The face is random because StyleGAN, being a GAN (Generative Adversaria­l Network), converts a bunch of random numbers into an image. Any set of random numbers will give a proper face and slightly changing the numbers will slightly change the image. By optimising these numbers in such a way that CLIP increases the image’s score for a given descriptio­n, eventually, you will arrive at a face that matches the descriptio­n.

The University of Malta, together with Threls Ltd, have implemente­d this process in a mobile app that allows you to generate and edit face images via text prompts. These faces are free to use, even commercial­ly, since AI-generated photos cannot be copyrighte­d and belong in the public domain.

The project face:LIFT (faces: Lifelike Images From Text) is funded by the Malta Council for Science and Technology FUSION programme, and a demonstrat­or version will be available both as a mobile app and as a website. For more informatio­n, visit www.um.edu.mt/projects/facelift/.

Marc Tanti is a lecturer with the Institute of Linguistic­s and Language Technologi­es and Adrian Muscat is a professor with the Department of Communicat­ions and Computer Engineerin­g within the Faculty of ICT, University of Malta.

 ?? ?? NASA’s James Webb Space Telescope’s NIRCam (near-infrared camera) instrument has captured new detail of the Horsehead Nebula (right image), providing the sharpest infrared images to date and portraying the region’s complexity with unpreceden­ted spatial resolution. The ethereal clouds that appear blue at the bottom of the image are dominated by cold, molecular hydrogen. Red-coloured wisps extending above the main nebula represent mainly atomic hydrogen gas. In this area, known as a photodisso­ciation region, ultraviole­t light from nearby young, massive stars creates a mostly neutral, warm area of gas and dust between the fully ionised gas above and the colder nebula below. PHOTO: NASA, ESA, CSA, K. MISSELT (UNIVERSITY OF ARIZONA) AND A. ABERGEL (IAS/UNIVERSITY PARIS-SACLAY, CNRS)
NASA’s James Webb Space Telescope’s NIRCam (near-infrared camera) instrument has captured new detail of the Horsehead Nebula (right image), providing the sharpest infrared images to date and portraying the region’s complexity with unpreceden­ted spatial resolution. The ethereal clouds that appear blue at the bottom of the image are dominated by cold, molecular hydrogen. Red-coloured wisps extending above the main nebula represent mainly atomic hydrogen gas. In this area, known as a photodisso­ciation region, ultraviole­t light from nearby young, massive stars creates a mostly neutral, warm area of gas and dust between the fully ionised gas above and the colder nebula below. PHOTO: NASA, ESA, CSA, K. MISSELT (UNIVERSITY OF ARIZONA) AND A. ABERGEL (IAS/UNIVERSITY PARIS-SACLAY, CNRS)
 ?? ?? These people don’t exist. They are images that were generated by our app using the descriptio­n ‘An attractive Mediterran­ean woman looking happy’ for the left and the same descriptio­n but with ‘man’ instead of ‘woman’ for the right.
These people don’t exist. They are images that were generated by our app using the descriptio­n ‘An attractive Mediterran­ean woman looking happy’ for the left and the same descriptio­n but with ‘man’ instead of ‘woman’ for the right.

Newspapers in English

Newspapers from Malta