It’s mostly bias in the training data. Most people aren’t posting mediocre images of themselves online so models rarely see that. Most are also finetuned to specifically avoid outputting that kind of stuff because people don’t want it.
Out of focus is easy for most base models but getting an average looking person is harder.
I would usually try to add things to the prompt you’d expect to find in a more casual scenario, like “smartphone” with half weight or something, or “video”, or maybe like “Facebook”. Just meta information you think attaches to more casual photos. Maybe even add “photo”.
It’s mostly bias in the training data. Most people aren’t posting mediocre images of themselves online so models rarely see that. Most are also finetuned to specifically avoid outputting that kind of stuff because people don’t want it.
Out of focus is easy for most base models but getting an average looking person is harder.
I would usually try to add things to the prompt you’d expect to find in a more casual scenario, like “smartphone” with half weight or something, or “video”, or maybe like “Facebook”. Just meta information you think attaches to more casual photos. Maybe even add “photo”.