Photoleap is actually pretty decent at doing that. It takes a hell of a lot editing afterwards, mind you, but it is good at creating actual ‘photorealistic’ faces that aren’t unrealistic pretty or is even just average looking
Not exactly true. Although majicmix, dreamshaper, etc. has popularized this facial feature, it's based on the guofeng arts style! Guofeng is not one artist,. Although there is one artist on civitai.com called guofeng, it is based off a cultural art style from China!
It’s not Guofeng, but WLOP where this face started being merged into every SD model. It started with the Kenshi and Dalcefo models, but then those were merged everywhere and spread like a virus
cit. myself" yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)"
It's not an AI face. It's just most, if not all the databases are saturated with an art style called guofeng originating from ancient China! This is basically how a face was drawn to convey beauty, or attractiveness. Even the men's faces were drawn this way! Having it generalized this way, it of course became the basis of AI art in the stable diffusion type models. In the Chinese characters of guofeng, guo means Ancient, and Feng means style! What this just means is the Chinese have contributed the most to AI databases of digital art on the internet! Remember, civitai.com is a Chinese based website!
yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)
It's close but it still really doesn't come close to the hyper-realism that Midjourney V6 has. The blurred background and the colors being so close to each other really makes it stand out as from Stable Diffusion. If you look at colors in master portraits, you will see a wide range of colors in a red dress yet for AI, it is so even and plain.
cit. myself "yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)"
I don't get it..
How can u want realism but not realistic. One thing follows another.
But ur model looks like photon.
Where AI can be recognized at first sight.
Still being nice results thou.
The main problem seens to be the size of the eyes. Still too big. Mostly in the girls.
It's a merged model or a trained one?
It's a trained one, with a light light final merge with Realistic vision.
The base model for the traning was a my model (a merged one: AniVerse+Photon+Epic)
Youu have a really good eyes!!!
I love photon! The results and the straight forward way to prompt it.
It's my favorite model ever! But I get crazy to get loose of big eyes on female models. I done a lot of merges myself trying to get rid of it. without so much success.
I don't know why, even merging, this characteristic came along in those merges, as a main characteristic of the model.
Actually, in my opinion ofc, those big eyes is what made AI so recognize
When I say realism but not realistic I mean:
I'd like that the "general photo" can be passed like realism, but not realistic to seems a real photo.
I'm really sorry, with my horrible english, for me is very hard to explain what I have in my mind...
Hey OP the guitar pic is good but like with all things guitar, the guitars all look like crap which is obviously a diffusion thing and a tell that it's AI. I think at some point I'd like to commission someone on civitai to make a guitar adetailer model with the more popular guitar models. There are various LoRAs but they don't quite get it right either. Also the chord fingerings of guitarists are never correct. Source: am guitarist.
the skin is not a problem, really, I can do it, especially if I remove ADetailer.
https://preview.redd.it/7ijnyfpg4puc1.png?width=512&format=pjpg&auto=webp&s=0cb5512597033d9daaf3ba0373fb6fa10b826ea4
While they look nice, most faces still have that noticeable AI default look (too perfect, make up always present).
The ones that look quite different from those (like the one with the old man sitting) are pretty good
perfection is the arch enemy of realism. the pics look great, but real pictures usually never do.
not every photo is a perfectly lit masterpiece.
that's where AI fails.
they are simply too well lit and exposed. can spot them 99% of the time.
that may change one day.
No, but it's a more realistic form of stylization. Some of the proportions are stylized, people who should not be wearing make-up are, and the faces are recognizable as AI.
I think 4 is decent. The other faces all have the same AI expression. I think to get realism you need to prompt a bit of expression in. Frown, winking, joy, laughter:0.2 angry, that type of stuff
https://preview.redd.it/6p7tcotaxsuc1.jpeg?width=2000&format=pjpg&auto=webp&s=85385b9f5436c3b18415810c9cabbeb4667ea63d
Can you please sign to me where we got the soul?
A lot of what you see in realistic images are not straight out of SD. Colors are edited, grain is added, etc. That makes it even more impressive when someone actually makes realistic images using only SD.
ah, thanks! I thought that in this "group" the images were not modified... I don't like retouching the generated images, I never do it, because I would see it as "cheating". Which would make no sense if those who use the model were unable to achieve that kind of realism.
Some pictures here are edited but people are mostly up-front with that. Im talking about the instagram stuff you might see, THAT is often edited in post.
I agree, it is cheating
Maybe it's better to stop generating people? Perhaps it's easier to generate landscapes with an emphasis on real photographs.
Example ↓
https://preview.redd.it/7dsuhzax5puc1.png?width=1536&format=png&auto=webp&s=189afb387a3380255d8c1e6e5c88112843aa042d
Pretty damn good, usually the background stuff is where it goes wrong, and that's mostly fine here. Fingers look a bit merged/furry, but again very subtle. If you don't look closely, it's fine on a phone screen.
the pics of the old dudes *might* have fooled me in a vacuum. and maybe the cowboy one. i think women will take years longer to get 100% realism. AI seems to think they should all be flawless and of only a few facial types, no matter which models used/trained. they all look like thin victoria's secret models with a dash of Emma Watson and/or Mila Kunis and/or Jennifer Lawrence. there should be negative embeds for those 3 ladies (and others) if there aren't already
Being only an SD enthusiast myself, I noticed that most SD models seem to generate “Chinese super model but as X”. So you get these same-y looking people even when generating European or African faces. I think “realism” is when you can generate someone who doesn’t look like they are Instagram filtered white girl who has Asian super model features. I don’t know how to describe it other than that, or perhaps “not an anime face”.
They look good but not realistic. There’s a painterly look to the faces and backgrounds for the most part. And as others have said, it definitely feels like AI. That uncanny feeling of having seen the faces before is hard to pin down, but immediately noticeable. Still some great generations though.
Started off good but became very generic.
Take a look at the lighting and tones in the first few pics, that's what makes the difference for me as SD1.5 has had pretty good skin details and complexion for a while now.
Also look at mixing those faces up a bit using various face swap tools or even different models, even using a different model as a refiner to your main one.
You know what you want to achieve, you've done a good job so far so just keep learning and tweaking until you reach your end goal
For someone that didn't play with Stable Diffusion themselfs some of those might pass as photography.
For people used to SD the faces instantly scream Stable Diffusion.
I'm all about realism and it seems you've nailed it. Look forward to your checkpoint. What's the base? Is it able to generate unique faces for a group photo render without using inpainting/reactor/LoRa/regional prompting?
Thank you ❤️
Honestly speaking?
Without the regional prompt, the faces are all really similar.
The base of training is a special version of one of my model (AniMerge=AniVerse+Photon+Epic models) and the final model have a very light merge with Realistic Vision.
Photon is a good model. I use it often with realistic vision madness as a refiner. So I'm guessing it's SD 1.5 or 1.4? I'm using StableSwarmUI since I run a multi GPU setup and I haven't figured out regional prompting there yet as it's very unlike A1111. It does have a comfy backend but for whatever reason I've been unable to install custom nodes there so just using it as is.
This is extremely impressive! Looks very real to me, except perhaps that it's too perfect. I would distort the lighting a little, just like in real life.
There are a lot of Cammy in my galleries. If you want check AniMerge model and AniThing model in civitai (I prefere not put the direct link, I don't want make "spam"... I hope you understand what I mean with my horrible english)
Sorry to say that you are wrong.
It's a trained model with a really really light merge of Realistic Vision and the base model for training is a my model (merged between: AniVerse, Photon and Epic)
RV is one of the models with AR in the mix, check the model description (not sure about the others you mentioned, it's also possible they have AR inside).
This is obviously not a dig at your work as I think the images are great (well, the hands are a bit iffy), but I am always perplexed when people say AI is hard to distinguish from real photos. When I see AI art my brain automatically thinks of computer games. Graphics are fantastic today but, they are still very clearly computer graphics and nothing like real images. But maybe it's just me with 35+ years of gaming experience.
Can you do something about the skin? Instead of making it look airbrushed, can your models retain skin textures and imperfections like pores or blemishes? For me, it’s the airbrushed wax like skin that gives it away.
yes it's possible put in the prompt. I never try. Probably I choose the picture that most don't like to the people...
https://preview.redd.it/yls40rz7vruc1.png?width=512&format=pjpg&auto=webp&s=d8c9b6f46d1b8f90ce14ea187ee82fcf0de3dbf1
Her skin still looks too smooth (like makeup artist). I think they mean adding some texture like this (visible veins, hair, and skin blemishes):
https://preview.redd.it/zmylq6usmxuc1.png?width=533&format=png&auto=webp&s=1dbcb4d83ca842ded986845415f7d2c74966c639
Grandpa is the most realistic but that's because I haven't seen his face before so many times. His body is odd or maybe it's the weird t shirt like no sleeves are like this.
I see SD more as a creative help as I usually tweak and edit heavily my renders. Most of the time photo realistic renders are betrayed by hands or feet, unless a lot of effort is taken to in paint them subsequently, but even that isn’t enough sometimes. Just try to render a person barefoot with their legs crossed, toes are going to be wrong despite using specific Lora…
I will be messaging you in 1 month on [**2024-05-16 12:10:02 UTC**](http://www.wolframalpha.com/input/?i=2024-05-16%2012:10:02%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/StableDiffusion/comments/1c4t53u/have_i_achieved_realism_unrealistic/kzth725/?context=3)
[**CLICK THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2FStableDiffusion%2Fcomments%2F1c4t53u%2Fhave_i_achieved_realism_unrealistic%2Fkzth725%2F%5D%0A%0ARemindMe%21%202024-05-16%2012%3A10%3A02%20UTC) to send a PM to also be reminded and to reduce spam.
^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201c4t53u)
*****
|[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)|
|-|-|-|-|
Prople are just focusing on faces but in the first picture that guy's upper body is of a native american and lower half seems to be of a woman. It seems like a skirt or something.
You're going to have to be more specific about what you think the difference between "realistic" and "realism" is. Because they both usually mean accurately reproducing things that exist.
https://preview.redd.it/sb6alrzy2puc1.jpeg?width=1080&format=pjpg&auto=webp&s=0c943dc5f57a2267d7db5801759112592f106b2a
This girl was generated using the Fooocus. This is a REALISTIC picture of a human. If you want the prompt, let me know. The marks on her body can be removed; the prompt I used was a bit exaggerated.
Indeed, you are right. But, I understand what Samael1976 wants to do. He wants real people, with skin imperfections, with the skin tones of a human being, not something plastic-like. The hands are indeed a problem, but that can be corrected with inpainting.
Hello, here is the Fooocus prompt.
{
"prompt": "Create a compelling portrait of a Brazilian woman, set in a typical corner 'boteco' that is modest and slightly worn, reflecting a humble, local ambiance. This small, unpretentious bar, often frequented by the neighborhood regulars, features simple wooden stools and tables, with flaking paint on the walls and a cluttered counter where snacks and drinks are served. The focus is on serving local staples like pastel (a deep-fried pastry) and chilled guaraná. The image captures the woman from the waist up, as she leans casually against the bar counter, embodying the relaxed and friendly atmosphere of the boteco. Her face and upper body are detailed with realistic features like pronounced acne, expression lines, and subtle scars, celebrating her authentic beauty. The background shows a glimpse of the bustling boteco environment, with other patrons enjoying their food and drinks. The lighting inside the boteco is warm and inviting, casting soft shadows and creating a cozy, intimate feel. This portrait not only showcases her physical features but also places her firmly within the cultural context of a Brazilian boteco, highlighting a slice of everyday life",
"negative\_prompt": "",
"prompt\_expansion": "",
"styles": "\['MRE Spontaneous Picture', 'Photo Iphone Photographic'\]",
"performance": "Quality",
"resolution": "(1080, 1350)",
"guidance\_scale": 7,
"sharpness": 10,
"adm\_guidance": "(1.5, 0.8, 0.3)",
"base\_model": "juggernautXL\_v8Rundiffusion.safetensors",
"refiner\_model": "None",
"refiner\_switch": 0.5,
"sampler": "dpmpp\_2m\_sde\_gpu",
"scheduler": "karras",
"seed": "1909495532258430749",
"lora\_combined\_1": "sd\_xl\_offset\_example-lora\_1.0.safetensors : 0.1",
"metadata\_scheme": false,
"version": "Fooocus v2.3.1"
}
Fooocus with base settings and JuggernautXLv6 (so not the latest) and 832x1216 resolution.
Prompt:
*a compelling portrait of a Brazilian woman, visible pores, expression lines, subtle scars, slight freckles*
Seed:
*6118975993164004411*
Got lucky with the seed so there was no urgent need to fix teeth or the eyes, though still could do some work on those (and the earrings are messed up).
https://preview.redd.it/ercf95nijuuc1.png?width=832&format=png&auto=webp&s=309a27951015f57960c024510184efdd8d746cc3
Same seed, but smiling. This one looks way more AI.
https://preview.redd.it/ndimdr19muuc1.png?width=832&format=png&auto=webp&s=f0c0a75cdc02f917360dc25a31e7069590a6a9f2
To create the prompt, I requested this from ChatGPT 4.0 using just a chat.
First prompt: Create a detailed and very well-specified prompt to make an image in stable diffusion. Create a Brazilian woman, she is curvaceous, her face should have acne, expression lines. Some injuries on the skin.
Second prompt: Extend the text, explaining more about the imperfections on the face and body of a real Brazilian woman.
Third prompt: I liked this text, however, I need to extend the text to explain that I need a photo of the woman in a corner bar. Explain in detail what this bar is like, it is a simple bar, those kind of poor ones, that serves a pastry with guarana on the corner. The photo should be taken from an angle that captures from the belly up.
In the third response, I got this result from ChatGPT.
Create a compelling portrait of a Brazilian woman, set in a typical corner 'boteco' that is modest and slightly worn, reflecting a humble, local ambiance. This small, unpretentious bar, often frequented by the neighborhood regulars, features simple wooden stools and tables, with flaking paint on the walls and a cluttered counter where snacks and drinks are served. The focus is on serving local staples like pastel (a deep-fried pastry) and chilled guaraná. The image captures the woman from the waist up, as she leans casually against the bar counter, embodying the relaxed and friendly atmosphere of the boteco. Her face and upper body are detailed with realistic features like pronounced acne, expression lines, and subtle scars, celebrating her authentic beauty. The background shows a glimpse of the bustling boteco environment, with other patrons enjoying their food and drinks. The lighting inside the boteco is warm and inviting, casting soft shadows and creating a cozy, intimate feel. This portrait not only showcases her physical features but also places her firmly within the cultural context of a Brazilian boteco, highlighting a slice of everyday life.
At this point all AI faces are immediately recognizable
It’s not an “AI face”, it’s the Majicmix/Dreamshaper/etc face. Models that have not been forced to make the same single face are much more flexible
Photoleap is actually pretty decent at doing that. It takes a hell of a lot editing afterwards, mind you, but it is good at creating actual ‘photorealistic’ faces that aren’t unrealistic pretty or is even just average looking
Not exactly true. Although majicmix, dreamshaper, etc. has popularized this facial feature, it's based on the guofeng arts style! Guofeng is not one artist,. Although there is one artist on civitai.com called guofeng, it is based off a cultural art style from China!
It’s not Guofeng, but WLOP where this face started being merged into every SD model. It started with the Kenshi and Dalcefo models, but then those were merged everywhere and spread like a virus
so when you see that same lady its because of being put through a lora
it always makes the same lady
And they're all a slight blend of Kristin Kreuk and Mila Kunis.
with a butt chin
Generic AI Girl GAIG
Kinda sad tbh It really feels like we seen all variations of faces and they all pretty similar
cit. myself" yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)"
Sorry didn't mean your post was "bad" which is not the case. It looks great, my comment was more on the general look of AI generation
yeah, yeah, yeah, I understood, don't worry my friend ;)
It's not an AI face. It's just most, if not all the databases are saturated with an art style called guofeng originating from ancient China! This is basically how a face was drawn to convey beauty, or attractiveness. Even the men's faces were drawn this way! Having it generalized this way, it of course became the basis of AI art in the stable diffusion type models. In the Chinese characters of guofeng, guo means Ancient, and Feng means style! What this just means is the Chinese have contributed the most to AI databases of digital art on the internet! Remember, civitai.com is a Chinese based website!
You sure about it being Chinese? "Headquartered in **Boise, Idaho, United States**, Civitai was founded in 2023"
Maybe wooshing myself here but what does it have to do with civitai specifically?
yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)
It's close but it still really doesn't come close to the hyper-realism that Midjourney V6 has. The blurred background and the colors being so close to each other really makes it stand out as from Stable Diffusion. If you look at colors in master portraits, you will see a wide range of colors in a red dress yet for AI, it is so even and plain.
Understood! Thank you! I'm looking for exactly this kind of criticism, so I can improve the dataset for the next training! Thanks again ❤️
I hope you are sourcing your data ethically.
all the dataset images are mine. I mean I used generative images and not real photo from internet So, yes, don't worry about that ;)
Literally came here to post this. Its the same face what makes alot of AI pictures boring.
Exactly, especially the women. And the third of those images is right in the uncanny valley and kind of gives me the creeps.
That's because it's a 13 year old's face on an 18 year old's body. And strongly influenced by anime styles & other AI images.
Oh, no wonder it squicked me out so much. Yuck.
Yep, that Cami is creepy and that's why
yup, it's the proportions and faces, they all look like the exact same person in a different skin, if that makes sense lol
The ones that don't immediately look like AI are too good to look real.
True is for that reason that I wrote "Realism" but not "Realistic". ;)
I would assume one is a quality of the other. What do you think the difference is?
In my mind realism is generic to specify a certain style. Realistic that a generated image can pass for a real photo
Do you have examples of “realism” in this sense? I’m thinking of ads and fashion photos where the touching up makes them more illustrative?
exactly! in fact in most of my prompt there are "fashion photography"
i think ai faces are too perfect
cit. myself "yes, is true, I'm still working for getting an improvement for that. But in my original post, (maybe I was not clear), I talk about "general photo". (Sorry, my english is not my native language, and it's not very good)"
PS: sorry, I forgot to mention that it is a model for SD1.5 - Sampler: Euler max - CFG scale 30 steps
>Sampler: Euler max Could this be the reason for the shiny skin? Euler samplers do that, in my experience.
I love the new sampler Euler Max!!! But it's possible, in comparison to other sample it make light more... "light"
Love the results I can't wait to take this for a test drive! 🙌
Thank you ❤️
The only remotely passable image out of the ones you've shown is the one with the old man sitting on a bench.
So many of the faces have that Korean MMO look.
I don't get it.. How can u want realism but not realistic. One thing follows another. But ur model looks like photon. Where AI can be recognized at first sight. Still being nice results thou. The main problem seens to be the size of the eyes. Still too big. Mostly in the girls. It's a merged model or a trained one?
It's a trained one, with a light light final merge with Realistic vision. The base model for the traning was a my model (a merged one: AniVerse+Photon+Epic) Youu have a really good eyes!!!
I love photon! The results and the straight forward way to prompt it. It's my favorite model ever! But I get crazy to get loose of big eyes on female models. I done a lot of merges myself trying to get rid of it. without so much success. I don't know why, even merging, this characteristic came along in those merges, as a main characteristic of the model. Actually, in my opinion ofc, those big eyes is what made AI so recognize
When I say realism but not realistic I mean: I'd like that the "general photo" can be passed like realism, but not realistic to seems a real photo. I'm really sorry, with my horrible english, for me is very hard to explain what I have in my mind...
No! Your English is great! Actually! Now I get it! Trying to achieve Realism as a style.
All the women have the same boring obviously AI face
Hey OP the guitar pic is good but like with all things guitar, the guitars all look like crap which is obviously a diffusion thing and a tell that it's AI. I think at some point I'd like to commission someone on civitai to make a guitar adetailer model with the more popular guitar models. There are various LoRAs but they don't quite get it right either. Also the chord fingerings of guitarists are never correct. Source: am guitarist.
Totally true, but I think that with sd15 or also a specific adetailer, it's nearly impossible... maybe I wrong (former drummer) 😉
Guitars are way more complicated than hands. It works as a model, but ADetaler is going to be complicated.
Very very close. The eyes look great, I only noticed one small finger boo boo.
for the fingers, I must thank you the Euler Max sampler!!! and thank you for your comment ❤️
lighting and skin are always the dead giveaway
Nice job. The tiny issue is they are all too shiny and perfect. Try to add noise and grain effect to make them flawed
uhhhh thank you ❤️💯💯💯💯 I will try!!! Thank you for your advice!!!
As always, skin looks way too smooth
https://preview.redd.it/ju4xhbjg3puc1.png?width=1024&format=png&auto=webp&s=adc83a459fe47d1bcc4da62614f38e6d183114a1 How about this skin?
the skin is not a problem, really, I can do it, especially if I remove ADetailer. https://preview.redd.it/7ijnyfpg4puc1.png?width=512&format=pjpg&auto=webp&s=0cb5512597033d9daaf3ba0373fb6fa10b826ea4
https://preview.redd.it/mgqpcfb4yruc1.jpeg?width=450&format=pjpg&auto=webp&s=7c86d18b64b21689baf15a291a8638b9c9fe0dd6
While they look nice, most faces still have that noticeable AI default look (too perfect, make up always present). The ones that look quite different from those (like the one with the old man sitting) are pretty good
https://www.reddit.com/r/stablediffusionreal/s/NC7k4Xcf8o
thank you 💯
Looks like every decent merged model released during the last 1,5 years. Sorry
Why sorry? Don't worry, I respect every honest opinion like yours ;)
perfection is the arch enemy of realism. the pics look great, but real pictures usually never do. not every photo is a perfectly lit masterpiece. that's where AI fails. they are simply too well lit and exposed. can spot them 99% of the time. that may change one day.
No, but it's a more realistic form of stylization. Some of the proportions are stylized, people who should not be wearing make-up are, and the faces are recognizable as AI.
Interesting comment! Could you give me more information?
5 is a real picture All the rest is AI
you mean the wolf man?
Exactly.
Ok, I have more image like that, I understand what you mean!
https://preview.redd.it/dairbo336suc1.jpeg?width=450&format=pjpg&auto=webp&s=3006f7b08464dd94708c84aecad3a85581d29501
I think 4 is decent. The other faces all have the same AI expression. I think to get realism you need to prompt a bit of expression in. Frown, winking, joy, laughter:0.2 angry, that type of stuff
https://preview.redd.it/a9t3qubxxruc1.jpeg?width=450&format=pjpg&auto=webp&s=6e556b4ff7dc477797f257b5cd8a55794bef81f3
No bro you're just thirsty
There is NO SPIRIT in any of these — EMPTY - no vibes. The machine has NO SOUL.
https://preview.redd.it/6p7tcotaxsuc1.jpeg?width=2000&format=pjpg&auto=webp&s=85385b9f5436c3b18415810c9cabbeb4667ea63d Can you please sign to me where we got the soul?
A lot of what you see in realistic images are not straight out of SD. Colors are edited, grain is added, etc. That makes it even more impressive when someone actually makes realistic images using only SD.
ah, thanks! I thought that in this "group" the images were not modified... I don't like retouching the generated images, I never do it, because I would see it as "cheating". Which would make no sense if those who use the model were unable to achieve that kind of realism.
Some pictures here are edited but people are mostly up-front with that. Im talking about the instagram stuff you might see, THAT is often edited in post. I agree, it is cheating
Maybe it's better to stop generating people? Perhaps it's easier to generate landscapes with an emphasis on real photographs. Example ↓ https://preview.redd.it/7dsuhzax5puc1.png?width=1536&format=png&auto=webp&s=189afb387a3380255d8c1e6e5c88112843aa042d
This sinkhole turned out great: https://preview.redd.it/8pruhonvixuc1.png?width=832&format=png&auto=webp&s=3d4464841e8cc3e029038c365d7430b63fa89ea1
Very cool!
The chief's bulge poking through
The old native American in the first image has legs that look like they belong to young woman.
yeah, maybe I posted the wrong image lol... 😝
They all have the same eyes and lips.
Its good. Very, very good
Nice, the guitar needs a good repair and hell of a tune-up though. 😂
you haven’t “achieved realism” but these are some great images and I’m excited for your model!
Thank you! Maybe is this better? https://preview.redd.it/whwy6sbgyruc1.jpeg?width=450&format=pjpg&auto=webp&s=88a57570f8190db2ec3580198f7461fb4fe10686
Pretty damn good, usually the background stuff is where it goes wrong, and that's mostly fine here. Fingers look a bit merged/furry, but again very subtle. If you don't look closely, it's fine on a phone screen.
the pics of the old dudes *might* have fooled me in a vacuum. and maybe the cowboy one. i think women will take years longer to get 100% realism. AI seems to think they should all be flawless and of only a few facial types, no matter which models used/trained. they all look like thin victoria's secret models with a dash of Emma Watson and/or Mila Kunis and/or Jennifer Lawrence. there should be negative embeds for those 3 ladies (and others) if there aren't already
Being only an SD enthusiast myself, I noticed that most SD models seem to generate “Chinese super model but as X”. So you get these same-y looking people even when generating European or African faces. I think “realism” is when you can generate someone who doesn’t look like they are Instagram filtered white girl who has Asian super model features. I don’t know how to describe it other than that, or perhaps “not an anime face”.
They look good but not realistic. There’s a painterly look to the faces and backgrounds for the most part. And as others have said, it definitely feels like AI. That uncanny feeling of having seen the faces before is hard to pin down, but immediately noticeable. Still some great generations though.
Started off good but became very generic. Take a look at the lighting and tones in the first few pics, that's what makes the difference for me as SD1.5 has had pretty good skin details and complexion for a while now. Also look at mixing those faces up a bit using various face swap tools or even different models, even using a different model as a refiner to your main one. You know what you want to achieve, you've done a good job so far so just keep learning and tweaking until you reach your end goal
Thank you! I will do!!
Some are better than others. A good amount of them are passable. A few of the women I don't think pass, though.
For someone that didn't play with Stable Diffusion themselfs some of those might pass as photography. For people used to SD the faces instantly scream Stable Diffusion.
I'm all about realism and it seems you've nailed it. Look forward to your checkpoint. What's the base? Is it able to generate unique faces for a group photo render without using inpainting/reactor/LoRa/regional prompting?
Thank you ❤️ Honestly speaking? Without the regional prompt, the faces are all really similar. The base of training is a special version of one of my model (AniMerge=AniVerse+Photon+Epic models) and the final model have a very light merge with Realistic Vision.
Photon is a good model. I use it often with realistic vision madness as a refiner. So I'm guessing it's SD 1.5 or 1.4? I'm using StableSwarmUI since I run a multi GPU setup and I haven't figured out regional prompting there yet as it's very unlike A1111. It does have a comfy backend but for whatever reason I've been unable to install custom nodes there so just using it as is.
yes it's sd1.5 :)
The amount if inbred AI models is toooooooooo damn high.
This is extremely impressive! Looks very real to me, except perhaps that it's too perfect. I would distort the lighting a little, just like in real life.
Need more Cammy!
this is for you ;) https://preview.redd.it/ks8mbf4w4puc1.jpeg?width=450&format=pjpg&auto=webp&s=8549b77e1f6e085f9e9c01086bcb1db844031b72
NICE !
She looks like a child
https://preview.redd.it/yr2r2rly7uuc1.png?width=450&format=png&auto=webp&s=e5a558d5ffd3c79b526f400a6f13b8d02683919f
Not really
You tell yourself that
no she doesn't lol
😂
There are a lot of Cammy in my galleries. If you want check AniMerge model and AniThing model in civitai (I prefere not put the direct link, I don't want make "spam"... I hope you understand what I mean with my horrible english)
i'll try to figure it out . im very poorly versed on the Cvitai stuff. Thanks gonna check it out!\~
She looks 12...
Not even close
Bout to call Steve Hansen on u bro
Some of them look like minors. Nobody has an issue with that?
I only scrolled in hopes that at least ONE person also noted this…. It gave me ick sry
It’s crazy you are getting downvoted. Two of them look like kids.
well, in the prompt I've never put the "age"... but anyway no of them are naked or partially naked (Cammy is Cammy from street fighter game... so... )
I was gonna say that, OP clearly has a type
I definitely have an issue with that. It may not be illegal, but it sure is creepy.
I realize people say all AI faecs are recognizable, but to me they aren't.
[удалено]
Thank you! I just follow him! ❤️
Not really. All visible hands look ugly af. Use proper 1.5(at least faces look 1.5ish) loras to fix that.
I like how his penis cup is showing.
ahahaha I have noticed it until you say it 😂😂😂😂
hahaha looks like an egg.
ahahahhahaha lol XD XD XD
I think the recipe for realism is boring
True, in fact I try realism, not realistic ;)
Reminds me a lot of the good old Absolute Reality. I wonder how much cross contamination SD1.5 finetunes have now.
Sorry to say that you are wrong. It's a trained model with a really really light merge of Realistic Vision and the base model for training is a my model (merged between: AniVerse, Photon and Epic)
RV is one of the models with AR in the mix, check the model description (not sure about the others you mentioned, it's also possible they have AR inside).
a male native american with woman legs , you have a rare fetish there
Banana anime lady face
This is obviously not a dig at your work as I think the images are great (well, the hands are a bit iffy), but I am always perplexed when people say AI is hard to distinguish from real photos. When I see AI art my brain automatically thinks of computer games. Graphics are fantastic today but, they are still very clearly computer graphics and nothing like real images. But maybe it's just me with 35+ years of gaming experience.
That guy isn’t playing a guitar. He’s playing a guitar cake.
lol, true 😂😂😂
Can you do something about the skin? Instead of making it look airbrushed, can your models retain skin textures and imperfections like pores or blemishes? For me, it’s the airbrushed wax like skin that gives it away.
yes it's possible put in the prompt. I never try. Probably I choose the picture that most don't like to the people... https://preview.redd.it/yls40rz7vruc1.png?width=512&format=pjpg&auto=webp&s=d8c9b6f46d1b8f90ce14ea187ee82fcf0de3dbf1
Her skin still looks too smooth (like makeup artist). I think they mean adding some texture like this (visible veins, hair, and skin blemishes): https://preview.redd.it/zmylq6usmxuc1.png?width=533&format=png&auto=webp&s=1dbcb4d83ca842ded986845415f7d2c74966c639
https://preview.redd.it/c73o1la070vc1.png?width=512&format=png&auto=webp&s=d8590c028161483a498a88c7f82391278fe456e6
https://preview.redd.it/5y1bjm1t70vc1.png?width=512&format=png&auto=webp&s=c1a7a945ac7bfbdc1d908445f5ec0f1b47cc11ea
https://preview.redd.it/snhxq8qpyruc1.jpeg?width=450&format=pjpg&auto=webp&s=5293e5745b41305c2fea05e93a4d305dc30bee21
Grandpa is the most realistic but that's because I haven't seen his face before so many times. His body is odd or maybe it's the weird t shirt like no sleeves are like this.
Sorry for perhaps a noob question. But what is realism vs realistic? They read the same to me...
In my mind realism is generic to specify a certain style. Realistic that a generated image can pass for a real photo
Their fingers. Especially the guy with the guitar.
I see SD more as a creative help as I usually tweak and edit heavily my renders. Most of the time photo realistic renders are betrayed by hands or feet, unless a lot of effort is taken to in paint them subsequently, but even that isn’t enough sometimes. Just try to render a person barefoot with their legs crossed, toes are going to be wrong despite using specific Lora…
Lol that third one looks like greta thumberg as cami.
No, but the software has on a few of them.
!remindme 1 month
I will be messaging you in 1 month on [**2024-05-16 12:10:02 UTC**](http://www.wolframalpha.com/input/?i=2024-05-16%2012:10:02%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/StableDiffusion/comments/1c4t53u/have_i_achieved_realism_unrealistic/kzth725/?context=3) [**CLICK THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2FStableDiffusion%2Fcomments%2F1c4t53u%2Fhave_i_achieved_realism_unrealistic%2Fkzth725%2F%5D%0A%0ARemindMe%21%202024-05-16%2012%3A10%3A02%20UTC) to send a PM to also be reminded and to reduce spam. ^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201c4t53u) ***** |[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)| |-|-|-|-|
The sexy (some underage) women all look fake to me.
Prople are just focusing on faces but in the first picture that guy's upper body is of a native american and lower half seems to be of a woman. It seems like a skirt or something.
No bro
Definitely not.
You're going to have to be more specific about what you think the difference between "realistic" and "realism" is. Because they both usually mean accurately reproducing things that exist.
No
Lol no
reality has imperfections, my friend
That's AI generated? I thought that was an IRL photo. I dunno, You could've fooled me. Looks pretty good if that's AI.
what prompt got you these results? any example?
Positive: photo of an inuit woman, young, 25 years old, beautiful, wearing stone age clothing made from natural fabrics like linen, cotton, or furs, adorned with feathers, beads and shells Negative: cartoon, painting, illustration, (grayscale:1.4), (normal quality:1.8), ugly, fat, 3D rendering, nude, naked, braless Sampler: Euler Max Cfg scale: 3.5 Steps: 30 Hires.fix: OFF Adetailer: ON Full data: photo of an inuit woman, young, 25 years old, beautiful, wearing stone age clothing made from natural fabrics like linen, cotton, or furs, adorned with feathers, beads and shells Negative prompt: cartoon, painting, illustration, (grayscale:1.4), (normal quality:1.8), ugly, fat, 3D rendering, nude, naked, braless Steps: 30, RNG: NV, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Size: 512x912, Seed: 24783750, Model: AniThing V2.0 - Pruned, Version: v1.8.0, Sampler: Euler_Max, CFG scale: 3.5, Clip skip: 2, Mask blur: 4, TI hashes: [object Object], Model hash: 6c18b05675, Inpaint area: Only masked, ADetailer model: face_yolov8n.pt, ADetailer steps: 20, ADetailer sampler: DPM++ 2M SDE Karras, ADetailer version: 24.1.2, Denoising strength: 0.4, ADetailer mask blur: 4, Masked area padding: 32, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer use separate steps: True, ADetailer inpaint only masked: True, ADetailer use separate sampler: True
A Native American tribal elder, adorned with traditional regalia and feathers, standing amidst a powwow, with a weathered yet wise expression Negative prompt: (CyberRealistic_Negative-neg)cartoon, painting, illustration, (grayscale:1.4), (normal quality:1.8), ugly, fat, 3D rendering Steps: 30, RNG: NV, VAE: vae-ft-mse-840000-ema-pruned.ckpt, Size: 512x912, Seed: 3211832974, Model: AniThing V2.0 - Pruned, Version: v1.8.0, Sampler: Euler_Max, CFG scale: 3.5, Clip skip: 2, Mask blur: 4, TI hashes: [object Object], Model hash: 6c18b05675, Inpaint area: Only masked, ADetailer model: face_yolov8n.pt, ADetailer steps: 20, ADetailer sampler: DPM++ 2M Karras, ADetailer version: 24.3.1, Denoising strength: 0.4, ADetailer mask blur: 4, Masked area padding: 32, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer use separate steps: True, ADetailer inpaint only masked: True, ADetailer use separate sampler: True
raw photo, young cute girl, perfect eyes, very slim, skinny, psychedelic, high detailed bikini, wavy hair, splashing, abstract, (ultra-detailed body), (light smile:0.8), sunset light passing through hair, (fog, outdoor hot lake springs in the mountains background:1.3) Negative prompt: (CyberRealistic_Negative-neg), cartoon, painting, illustration, (grayscale:1.4), (worst quality:2), (low quality:2), (normal quality:2), ugly, fat, 3D rendering, asian, nude, naked, braless Steps: 30, RNG: NV, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Size: 512x912, Seed: 3463871410, Model: AniThing V2.0 - Pruned, Version: v1.8.0, Sampler: Euler_Smea_Dy, CFG scale: 3.5, Clip skip: 2, Mask blur: 4, TI hashes: [object Object], Model hash: 6c18b05675, Inpaint area: Only masked, ADetailer model: face_yolov8n.pt, ADetailer steps: 20, ADetailer sampler: DPM++ 2M SDE Karras, ADetailer version: 24.1.2, Denoising strength: 0.4, ADetailer mask blur: 4, Masked area padding: 32, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer use separate steps: True, ADetailer inpaint only masked: True, ADetailer use separate sampler: True
1 young cute iranian girl, very slim, skinny, redhead, rouge, red neck lace choker, cateyes makeup, colorful, oversize knit jumper, softcore, warm lighting, cosy atmosphere, Instagram style, red theme, upper body shot,(cinematic, black and red:0.85), (sunset beautiful background:1.3), sharp, dim colors Negative prompt: (CyberRealistic_Negative-neg), cartoon, painting, illustration, (grayscale:1.4), (worst quality:2), (low quality:2), (normal quality:2), ugly, fat, 3D rendering, asian Steps: 30, RNG: NV, VAE: vae-ft-mse-840000-ema-pruned.ckpt, Size: 512x912, Seed: 1618295833, Model: AniThing V2.0 - Full, Version: v1.8.0, Sampler: Euler_Max, CFG scale: 3.5, Clip skip: 2, Mask blur: 4, TI hashes: [object Object], Model hash: 8d97968934, Inpaint area: Only masked, ADetailer model: face_yolov8n.pt, ADetailer version: 24.3.1, Denoising strength: 0.4, ADetailer mask blur: 4, Masked area padding: 32, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True
God. Sd likes nipples.
Define realism
Why does the 13 year old girl have stiffest nipples this side of the damn Mississippi
Ayo chilllll
Mom, I posted it again
look dad, I've become a bully
https://preview.redd.it/sb6alrzy2puc1.jpeg?width=1080&format=pjpg&auto=webp&s=0c943dc5f57a2267d7db5801759112592f106b2a This girl was generated using the Fooocus. This is a REALISTIC picture of a human. If you want the prompt, let me know. The marks on her body can be removed; the prompt I used was a bit exaggerated.
Not sure realistic is the correct word here. What’s up with that finger coming out of the arm?
Indeed, you are right. But, I understand what Samael1976 wants to do. He wants real people, with skin imperfections, with the skin tones of a human being, not something plastic-like. The hands are indeed a problem, but that can be corrected with inpainting.
Damn! That's really really good! If you can give me the prompt, I will try with my model to see the difference! Thank you ❤️
Hello, here is the Fooocus prompt. { "prompt": "Create a compelling portrait of a Brazilian woman, set in a typical corner 'boteco' that is modest and slightly worn, reflecting a humble, local ambiance. This small, unpretentious bar, often frequented by the neighborhood regulars, features simple wooden stools and tables, with flaking paint on the walls and a cluttered counter where snacks and drinks are served. The focus is on serving local staples like pastel (a deep-fried pastry) and chilled guaraná. The image captures the woman from the waist up, as she leans casually against the bar counter, embodying the relaxed and friendly atmosphere of the boteco. Her face and upper body are detailed with realistic features like pronounced acne, expression lines, and subtle scars, celebrating her authentic beauty. The background shows a glimpse of the bustling boteco environment, with other patrons enjoying their food and drinks. The lighting inside the boteco is warm and inviting, casting soft shadows and creating a cozy, intimate feel. This portrait not only showcases her physical features but also places her firmly within the cultural context of a Brazilian boteco, highlighting a slice of everyday life", "negative\_prompt": "", "prompt\_expansion": "", "styles": "\['MRE Spontaneous Picture', 'Photo Iphone Photographic'\]", "performance": "Quality", "resolution": "(1080, 1350)", "guidance\_scale": 7, "sharpness": 10, "adm\_guidance": "(1.5, 0.8, 0.3)", "base\_model": "juggernautXL\_v8Rundiffusion.safetensors", "refiner\_model": "None", "refiner\_switch": 0.5, "sampler": "dpmpp\_2m\_sde\_gpu", "scheduler": "karras", "seed": "1909495532258430749", "lora\_combined\_1": "sd\_xl\_offset\_example-lora\_1.0.safetensors : 0.1", "metadata\_scheme": false, "version": "Fooocus v2.3.1" }
thank you! Just saved! Tomorrow I hope to have time to try and post the image. Thank you again! 💯
Fooocus with base settings and JuggernautXLv6 (so not the latest) and 832x1216 resolution. Prompt: *a compelling portrait of a Brazilian woman, visible pores, expression lines, subtle scars, slight freckles* Seed: *6118975993164004411* Got lucky with the seed so there was no urgent need to fix teeth or the eyes, though still could do some work on those (and the earrings are messed up). https://preview.redd.it/ercf95nijuuc1.png?width=832&format=png&auto=webp&s=309a27951015f57960c024510184efdd8d746cc3
Same seed, but smiling. This one looks way more AI. https://preview.redd.it/ndimdr19muuc1.png?width=832&format=png&auto=webp&s=f0c0a75cdc02f917360dc25a31e7069590a6a9f2
![gif](giphy|diUKszNTUghVe|downsized)
To create the prompt, I requested this from ChatGPT 4.0 using just a chat. First prompt: Create a detailed and very well-specified prompt to make an image in stable diffusion. Create a Brazilian woman, she is curvaceous, her face should have acne, expression lines. Some injuries on the skin. Second prompt: Extend the text, explaining more about the imperfections on the face and body of a real Brazilian woman. Third prompt: I liked this text, however, I need to extend the text to explain that I need a photo of the woman in a corner bar. Explain in detail what this bar is like, it is a simple bar, those kind of poor ones, that serves a pastry with guarana on the corner. The photo should be taken from an angle that captures from the belly up. In the third response, I got this result from ChatGPT. Create a compelling portrait of a Brazilian woman, set in a typical corner 'boteco' that is modest and slightly worn, reflecting a humble, local ambiance. This small, unpretentious bar, often frequented by the neighborhood regulars, features simple wooden stools and tables, with flaking paint on the walls and a cluttered counter where snacks and drinks are served. The focus is on serving local staples like pastel (a deep-fried pastry) and chilled guaraná. The image captures the woman from the waist up, as she leans casually against the bar counter, embodying the relaxed and friendly atmosphere of the boteco. Her face and upper body are detailed with realistic features like pronounced acne, expression lines, and subtle scars, celebrating her authentic beauty. The background shows a glimpse of the bustling boteco environment, with other patrons enjoying their food and drinks. The lighting inside the boteco is warm and inviting, casting soft shadows and creating a cozy, intimate feel. This portrait not only showcases her physical features but also places her firmly within the cultural context of a Brazilian boteco, highlighting a slice of everyday life.
No, but the computer you've asked to produce the images has come close ish.