Maybe there are straightforward ways to improve the results by spending more compute time. But as presented it's more "turn one picture into a convincing character in your Sims game", not "turn one picture into an animated character in your next TikTok video"