That's a cool visualisation of what kind of visual input you can feed into the process with ControlNet.
And it really makes it clear that what AI images is good for if communicating a general idea. I think comparing AI generated or Assisted images or videos to photography is probably the closest analogous medium we have, but I think AI images are stort if in-between that and more classical art. You have more control over the more technical aspects of the image, as you can alter those things with big strokes, but you've given up too much control to really infused it with artistic intent. Even when photography, where you are generally limited by reality, you can better infused artistic intent into the picture, because you carefully examine what makes that object of the picture unique. Even if you try to direct AI models, it limit their scope they will always add whether the most average expression of what they're adding, because that what it looks for in the training; the commonalities/averages of whatever it was trained on.
Even ControlNet is just a way to claw back a little more control over the process. I wouldn't actually call the examples I've seen of ControlNet to be examples of fine control. I'm struggling to find a way to clearly communicate it, but it's like the difference between 3D art that is trying to look like 2D, and actual 2D. There's always something lost in the translation.
Most artistic disciplines are their own language, and I just don't think we have a way to communicate that language without actually doing the art, and art requires artistic intent, which I don't think is possible with the current AI tools. Maybe it will be at some point, but artistic intent and control over the process are so interconnected that the balance becomes very difficult.
I mean, the hostility is entirely understandable. The current form of generative art is meant to replace artists. It is part of what is currently devastating peoples livelihoods, although I think some companies and clients are already learning that it currently leads to lower overall quality, due to how much harder it is to implement changes based on feedback. It lowers the overall quality bar, although it does have the potential to raise the floor a little. The larger models that are causing this hype are quite literally trained on the work of unwilling artists.
It is the most disrespectful and clearly ethically wrong basis to build it on, and it really begs the question of whether the ends justify the means. Beyond that, art is just not an area where we need AI. It largely hurts artists, is super energy demanding so it actively hurts the environment for no real benefit.
The energy would be so much better used solving actual problems, so more people could spend time doing things they enjoy. If some people enjoy AI generation, then that's fine but I think it shouldn't replace a passionate, skill-based workforce.