Discussion about this post

User's avatar
Eric Antonow's avatar

This is great. I'm curious about the criteria of 'a description a human could have given the image'. It begs the questions for me, 'Where does description end and storytelling begin?' I wonder about these tools being able to imagine (what a word!) a story from an image.

Expand full comment
Jon Wagner's avatar

I'm curious about how these AI tools treat different genres of written expression. For example, what would you get if you ask for a "caption," a "description," an "encyclopedia entry" or a "critical review"? Or, to play the Cindy Sherman card, you asked instead for that particular kind of caption she referenced in her "movie still" work. Going a little farther afield, how would it handle a request for an "oral response" to a photo from a child, adult or professional photographer? To my mind, these questions all get at what AI can do in framing searches and constructions for known and unknown audiences. And if it could do well enough at that, specifying the audiences--for a given caption or comment or description, as well as for a particular visual image or material--might be one of it's more powerful applications.

Expand full comment
3 more comments...

No posts