In Cinema and Machine Vision I write about the concept of computational ekphrasis:

p170

In philosophy, there is a term to describe the intersemiotic correspondence between words and images: ekphrasis, from the Greek ek meaning “out” and phrásis meaning “speak.” In ancient Greece, ekphrasis was taught as a rhetorical technique consisting of rich verbal descriptions designed to conjure vivid images in the reader’s mind—a literary form of description that could “paint a picture with words.” Through ekphrasis, the reader could be prompted to imagine absent or impossible objects, the classic example being Homer’s elaborate description of Achilles’s shield in The Iliad (Heffernan 2004, 10) […] In the wake of generative AI systems, words and images find novel forms of fluid permutation and latency in a shared representational space, albeit in high-dimensional space that is not directly interpretable to humans. We can call this computational ekphrasis: ‘calculation from/out of speech’.

I developed this into a book chapter in an edited volume called The Aesthetics of Machine Vision: Critical Terms and Ideas, edited by Kathrin Maurer, Rikke Munck Petersen, Dominique Routhier, Kassandra Wellendorf, and Kristin Veel, coming out with MIT Press in 2025. And some of these ideas are also in a conference presentation I did for PPIG VL/HCC 2024, see abstract here.

Since I make reference to the classic example of ekphrasis given by Heffernan, and in keeping with designs for previous publications, mine and others, I made an image of the Shield of Achilles to be included in the book:

Synthetic Shield of Achilles ― my humble contribution to ekphrastic art

There are many other renditions of the famous shield, from an illustration in Pope’s translation of the Iliad from 1720 shown below, to Angelo Monticelli’s interpretationfrom 1820, to the 1917 version by John Flaxman, to a contemporary reconstruction (and book) by Kathleen Vail. To my knowledge, the one above is the first one to be AI generated.

Image from page 171 of volume 5 of The Iliad of Homer. Translated by Mr. Pope, by Homer. Original held and digitized by the British Library.

In the book chapter I go into details about why I think ekphrasis is a useful concept to help us make sense a world inundated with synthetic imagery. But I thought some might be curious to know more details about how I made the synthetic Shield of Achilles. I generated the image by passing some of Homer’s original description of the shield to a text-to-image model. Here are the lines I used in my prompt (minus the page numbers) from Wikipedia:

  1. The Earth, sky and sea, the sun, the moon and the constellationsPleiadesHyadesOrion, and Ursa Major (484–89)
  2. “Two beautiful cities full of people”: in one a wedding and a law case are taking place (490–508); the other city is besieged by one feuding army and the shield shows an ambush and a battle (509–40)
  3. A field being plowed for the third time (541–49)
  4. A king’s estate where the harvest is being reaped (550–60)
  5. A vineyard with grape pickers and children (561–72)
  6. A “herd of straight-horned cattle”; the lead bull has been attacked by a pair of savage lions which the herdsmen and their dogs are trying to beat off (573–86)
  7. A picture of a sheep farm (587–89)
  8. A dancing floor where young men and women are dancing and courting (590–606)
  9. The great stream of Ocean (607–609)

I used the FLUX.1 [schnell] model by Black Forest Labs, a 12 billion parameter model running locally in my laptop (8GB VRAM), through ComfyUI. I also used three LORAs to approximate the style of a historical artefact. Besides text above, the prompt has a lot more description including trigger words for the LORAs.

Here is a longer post about interfaces like Comfy where I go into detail about how dragging PNG images into the interface loads the workflows used (under certain conditions). If you really want to recplicate all the workflow, feel free to take my image above.