Phredreeke, on 06 May 2021 - 07:17 AM, said:
Pixellated faces are also very different from photorealistic images, yet that face model which was shown some pages ago produced rather decent results with pixel art faces from video games.
Or the ESRGAN model you're playing around with here evidently tries to add teeth, eyes and nose in places where they do belong. It makes a lot of errors and produces garbage, but nonetheless it's trying to work in the right direction. It seems not entirely implausible that ESRGAN would learn certain patterns from pairs of vanilla sprites and sebab's higher-res versions and produce something decent or maybe even novel and interesting.
I believe it is worth trying out, if the new sprites are exactly 2x the original size, and someone with a powerful enough GPU for model training is willing to spend time and effort on that. It might fluke just as well, but if the manpower cost of training such a model is not exceedingly high, it seems like worth a try.
Phredreeke, on 06 May 2021 - 07:17 AM, said:
It's not a bad idea per se but there are already several ESRGAN models that can make decent upscales of the same kind as you're suggesting here. My entire point is that maybe if the source and target material differ more, the AI will learn to improvise in some consistent manner.