The quality of the image really depends on the quality of the prompt, and a LOT of cherry picking.
I find that big sleep is also a better model than the one linked here (deep daze), generally.
I’ve generated several hundred images myself and found a few real treasures. Here’s a few of my personal favourites:
“A painting of a murder in the style of Monet” [0]
“A photo of fellas in Paris” [1]
“A painting of Thanos wearing the Infinity Gauntlet in the style of Rembrandt” [2]
I definitely agree that in the general case the examples are underwhelming, but I believe there is a lot of potential here. Personally I’m super excited to unlock the potential of human-guided, AI-assisted creative tooling. Some Colab notebooks let you active explore the latent space of a model to direct the results where you want them to go. As the generate-adjust feedback loop gets tighter we’re gonna see some crazy things.