• aubertlone@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    8 months ago

    Keep up the good work

    I’m finally upgrading to a nice rig next month, so image generations will take 1-2 seconds and not 26 like currently…

      • aubertlone@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        8 months ago

        It’s on its way.

        Can I brag for just a second?

        I got a new job in August and been saving. Getting Lenovo legion 9i pro with the 4090 upgrade.

        It’s in the mail already.

        I’ll share that on my old rig, images generated in about TWENTY-SIX seconds…

        It was a little painful to experiment with that lag time. Thanks again for sharing knowledge!!

    • BlaTRUS@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 months ago

      So far this is pretty much the only thing I’ve wanted to mess with that my AMD GPU is keeping me from lol.

      I think it’s possible to run stable diffusion on AMD cards, it’s just more complicated and I haven’t had the drive to bother trying to set it up.

    • Cavendish@lemmynsfw.comOP
      link
      fedilink
      English
      arrow-up
      5
      ·
      edit-2
      8 months ago

      No controlnet or inpainting. Everything was generated in one go with a single prompt. I’ll sometimes use regional prompts to set zones for head and torso (usually top 40% is where the head goes, bottom 60% for torso/outfit). But even when I have regional prompting turned off, it will still generate a 3/4 / cowboy shot.

      I assume you pulled the prompt out of one of my images? If not, you can feed them into pngchunk.com. Here’s the general format I use with regional prompting:

      *scene setting stuff*
      ADDCOMM
      *head / hair description*
      ADDROW
      *torso/body/pose*
      

      The loras that are in the top (common) section are weighted pretty low, 0.2 - 0.3, because they get repeated/multiplied in each of the two regional rows. So I think at the end they’re effectively around 0.6 - 0.8.

      prompt example
      photo of a young 21yo (Barbadian Barbados dark skin:1.2) woman confident pose, arms folded behind back, poised and assured outside (place cav_rdrguarma:1.1),
      (Photograph with film grain, 8K, RAW DSLR photo, f1.2, shallow depth of field, 85mm lens),
      masterwork, best quality, soft shadow
       (soft light, color grading:0.4)
      
      ADDCOMM
      
      sunset beach with ocean and mountains and cliff ruin in the background ,
      (amethyst with violet undertones hair color in a curly layers style:1.2),
       perfect eyes, perfect skin, detailed skin
      
      ADDROW
      
      choker ,
      (pea green whimsical unicorn print bikini set:1.1) (topless:1.3) cameltoe (undressing, panty pull:1.4) 
      (flat breast, normal_nipples :1.4),
      (tan lines, beauty marks:0.6)
      (SkinHairDetail:0.8)
       
      

      It may be that you’re not describing the clothing / body enough? My outfit prompts are pretty detailed, so I think that goes a long way for Stable Diffusion to determine how to frame things.