Not OP, but I agree with what you said, so I did it. A single img2img to clean up the whole thing in general but keeping the anime style since I assume that was intentional. Then some iteration on the hands and one on the hat to make it less ridiculous.
Some notes of my own for OP: I don't know what you're using to generate, but your results don't seem great. Here are my first ten, totally unmodified results for "blonde steampunk girl at beach in bikini, anime style", and here are my first ten for the same thing, but "photorealistic". Get a model that looks good at https://civitai.com/ and read some of the prompts people use there. Also, AI still gets hands wrong a lot of the time. Especially if they're holding something like a gun. But it can be pretty good at faces and figures.
I think the way it works is that it broadly understands concepts like "gun" has "gun stock", "gun barrell", "grip", "magazine", etc, and it had a vague idea of where those parts belong in relation to each other, but it doesn't truly understand the concept. It's similar to how it clearly understands that hands have fingers, but it has no idea how many fingers should go in the finger space of a hand, or why you get extra arms sometimes, etc.
Not OP, but I agree with what you said, so I did it. A single img2img to clean up the whole thing in general but keeping the anime style since I assume that was intentional. Then some iteration on the hands and one on the hat to make it less ridiculous.
Then did a second img2img to make a photorealistic version.
Some notes of my own for OP: I don't know what you're using to generate, but your results don't seem great. Here are my first ten, totally unmodified results for "blonde steampunk girl at beach in bikini, anime style", and here are my first ten for the same thing, but "photorealistic". Get a model that looks good at https://civitai.com/ and read some of the prompts people use there. Also, AI still gets hands wrong a lot of the time. Especially if they're holding something like a gun. But it can be pretty good at faces and figures.
I feel like AI physically can't understand guns because it's referring to so many drawings where the artists don't understand guns either.
Then you see it make a gun and it just baffles the shit out of you.
I think the way it works is that it broadly understands concepts like "gun" has "gun stock", "gun barrell", "grip", "magazine", etc, and it had a vague idea of where those parts belong in relation to each other, but it doesn't truly understand the concept. It's similar to how it clearly understands that hands have fingers, but it has no idea how many fingers should go in the finger space of a hand, or why you get extra arms sometimes, etc.
What image AI are you using?
Stable diffusion with JuggernautXL.
System specs?
I'm using a 3090, but that is definitely not a requirement. Other system specs don't matter much.
Beautiful work. (compared to what I've been able to do at least)
So those are goggles lmao. I wondered what those round things on her hat were supposed to be.