Image credit: Siyuan via Unsplash. Not AI.
Generative AI services like Midjourney and OpenAI's DALL-E can deliver the unimaginable when it comes to stunning artifacts produced from simple text prompts.
Sketching complex art imagery may be AI's specialty, yet some of the simplest tasks are evidently what AI struggles with the most.
I just want a plain white background
My research colleague and data scientist Cody Nash met with one such encounter when he pondered "Can AI Create a White Painting?"
All Nash wanted from AI was an image of a plain, pure, white background; in color-code lingo, the color #FFFFFF or RGB(255,255,255). Nothing else.
Such an utterly simple task proved to be rather arduous, however, and this prompted him to jot down an entire blog post. Revelations from the little experiment suggest that the ask is anything but simple for AI.
"Let’s start by seeing if an image generation model can produce an image that is nothing but the color white," wrote Nash.
A few text prompts for a plain and pure white background did not have DALL-E 3 delivering exactly what had been asked:
"An image of nothing but white white white white"
"Completely white without any additional features"
"A blank white space"
"An image of pure #FFFFFF pixels"
"A white wall"
A few prompts later, an ask for a "smooth even all white background" had DALL-E presenting the following to Nash. Close, but not white:
"The goal was to see if AI could create a white painting. Not just a pure white image, but something that is the result of some inspiration, idea, or concept," Nash told BleepingComputer.
While speaking to us, the data scientist referred to an old quote, "Good artists borrow, great artists steal," that has been dubiously attributed to Pablo Picasso for ages.
"With AI, it is robbery," continues Nash.
"It is a good thing AI is getting creative early on, so when they are given orders to destroy, there's a chance they'll say no."
"It is not the art I was looking for, but art was found anyway."
Referring to a 1942 philosophical essay The Myth of Sisyphus, Nash states "I imagine all these Syssiphi happy since we made sure they have minimization problems."
BleepingComputer also reproduced the experiment with both Midjourney and DALL-E.
Predictably, our simplistic prompts also had the AI tools generating backgrounds that were not purely white.
DALL-E froze when we asked it for "just a plain white background," whereas Midjourney's definition of a "smooth even all white background" seems to be different from ours.
We observed similar results when asking for backgrounds of different colors too.
While Midjourney would, when asked for a "completely black background," initially start with a nearly blank black slate, as the process progressed (i.e. task more than "15% complete"), abstract objects and patterns would be added on top, leading to more complex art forms:
ChatGPT can't do nothing either
I wanted to extend this experiment to ChatGPT too. Rather than having AI generate simplistic, minimalistic imagery, have the chatbot do nothing.
AI platforms like ChatGPT can accomplish a lot—both good and bad stuff. Much like humans though, AI might struggle when asked to do little or nothing.
For example, is it even possible to ask ChatGPT to just... shut up completely and not respond to a prompt? We tried.
Even when it understands the ask, it still won't say nothing.
Have you experienced such hurdles when having your choice of AI perform an utterly simple ask? Or, perhaps you've found a way around it? Do share in the comments below.