So you want to generate some awesome anime waifu/husbando with Stable Diffusion, only to discover that prompts like “cute anime girl” or “cool anime guy” don’t produce good results? Well, here is a quick sharing of my experiments with various models, poses, and styles – Hope it helps to save you some time.
TABLE OF CONTENTS
BASICS & BASELINE
Before we touch on the prompts, here are a couple of things that need to be addressed – Especially for absolute beginners.
The “original” Stable Diffusion model is cool, but it is not specialized for anime. So here’s a quick disclaimer – All the images in this guide are generated with a model called “Blood Orange Mix”. I will leave the links below.
P.S. While this guide is based on the flat 2D anime style – The generic prompts in this guide will still apply if you are looking at 3D, cel-shaded, or realistic styles.
The prompts can differ wildly depending on the model that you are using. For example:
- In “Stock Stable Diffusion”, an anime prompt looks something like this –
an angry anime girl eating a book, messy blue hair, red eyes, wearing an oriental dress, in a messy room with many books, trending on artstation, SOME ANIME STUDIO, in XYZ style
- In Waifu Diffusion, Anything, and most anime-trained models, it is a lot more straightforward like this –
1girl, angry, eating a book, messy blue hair, red eyes, oriental dress, indoor, books
To keep things simple, I like to “engineer” my prompts in the following order –
SUBJECT (BOY/GIRL/IT/THEM), ACTION-POSE-EXPRESSION, HAIR-EYES-DRESS-ACCESSORIES, BACKGROUND, STYLES-HINTS.
P.S. The order does matter, which is why the subject and action come first.
Negative prompts make a whole world of difference, don’t skip them… No need to write confusing negative prompts, I usually start with just this –
3D, realistic, (worst quality, low quality:1.4). That’s all. Add more only as required.
STABLE DIFFUSION ANIME PROMPTS
All right, let us now get into the details and examples of anime prompts in stable diffusion.
SUBJECT – AGE, PHYSIQUE, EMOTION
The order of the prompt matters, so I always start with the main subject(s) and action/story.
- Age – I have seen some people define exact ages like
16 years old girland
30 years old man… I think the text inversion pretty much only understood
man. So yep, don’t bother to give exact ages.
- Male –
shota, boy, teen boy, man, elderly man
- Female –
loli, girl, teen girl, woman, elderly woman
- Male –
- Physique –
scrawny body, slim body, thin body, big body, large body, muscular, buff, sexy, gorgeous, elegant
- Emotions & Expressions –
smile, expressionless, sad, cry, angry, embarrassed, maniac, scared, shocked, ominous, ahegao. Nope, no luck with
aqua crying face.
Poses are a bottomless pit… Having some experience with photography definitely helped. I will just give a few “common examples” here:
- Arms/Hands –
hand on face, hand on hip, hand in pocket, hand over head, arms crossed, finger on lip
- Looking –
looking over back/shoulder, looking into distance, looking to at the viewer
- Sitting –
sit with arched back, sit with legs crossed, sit with hand on chin
Just do a “portrait poses” search on the Internet and you will get a ton more… But Stable Diffusion may not understand all of them.
bald– No hair.
shaved– May sometimes give “buzzcut”. It is possible to get a bald girl, but you will need to increase the weight like
hairin the negative prompt.
- Better than the generic “short hair” –
mohawk, bowl cut, messy hair, curly hair, spiky hair.
dreadlocks– Applicable to both boys/girls.
- Hairstyles of the ladies –
pixiecut, bob hair, short hair with long locks, hair bun, braided hair, braided bun, hair drill, pigtail, twin tails, ponytail.
- Bangs and “extras” –
bangs, blunt bangs, crossed bangs, hair over one eye, forehead, ahoge.
Some “specific” hairstyles such as undercut, wolf cut, hime cut, and crew cut don’t really work… I guess it depends on the model that you are working with.
CLOTHING – TOPS & DRESSES
- For the guys… I guess girls can wear all “guy clothes”, so unisex –
t shirt, long sleeve shirt, sleeveless shirt, tank top, jacket, trench coat, high neck top, turtle neck, blazer, overall, coverall, jersey, suit.
- Tops for the girls –
tube top, halter top, crop top, corset top, sports bra, bralette.
- Dresses –
halter dress, babydoll dress, corset dress, gothic dress, victorian dress, kimono, yukata, hanfu, qipao, hanbok.
CLOTHING – BOTTOMS, ACCESSORIES, MATERIALS
- Bottoms –
shorts, pants, jeans, skirt, mini skirt
- “Modifiers” –
leather, denim, lace, frills, pleated, layered, mesh, cutout, embroidery. These will fundamentally change the “original clothes”, for example,
leather jacket, denim skirt, mesh shirt.
- Accessories –
necktie, bowtie, ribbon, choker, necklace, socks, stocking, garters, glasses, headress.
There are a lot more, but I will leave this as it is –
crown, laurel, circlet, watch, armband, hair clip, hair band, ring, piercing, bunny ears, cat ears, dog ears, headphones, smartphone
VIEWS & ANGLES
- Works –
top down view, side view, back view. Can’t seem to get it right, no matter the model –
bottom up view.
- Zoom & Angle –
close up, extreme close up, cowboy shot, mid shot, dutch angle shot, fisheye shot. For those who are not into photography and videography –
- “Cowboy shot” is pretty much from hip to head.
- “Mid shot” for “midrange”.
- “Dutch angle” is the fancy way of saying “tilted angle”.
- “Fisheye” is an ultra-wide angle shot, with a very distinct circular distortion.
bird's eye view, tilt shiftseems to work for landscape, for certain models only.
- Self-explanatory –
bokeh, blurred background.
- Gotchas –
body shotseems to center on the chest area, excluding the head. Good for “aesthetic shots”, but no good if you are doing a “head shot”.
head shotsometimes mistaken to be THE HEADSHOT.
upside downis weird. Just plain weird.
Lastly, we have “style hints” sitting at the end of the prompt. While these are completely optional, I will not recommend sleeping on them… As you can see, they can totally transform “normal anime” into “more like artwork”.
abstract art, graffiti, splatter, pop art, line art, pixel art, limited palette, flat color, polaroid, papercut, outline, film noir, monochrome, ukiyoe, watercolor, pastel color, chinese ink painting, traditional art style.
- There are a lot more, but I shall leave it once again –
cyberpunk, steampunk, atompunk, arcane, dystopian, sci-fi, apocalyptic.
masterpiece, key visualSeems to have some effect on the overall image quality.
best quality, sharp image, professional artworkSeemingly has “little effect”.
intricate, highly detailedAdds more details. But kind of a double-edged sword when the image becomes too busy.
EXTRA BITS & LINKS
That’s all for the guide, and here is a small section on some extras and links that may be useful to you.
So, how do we “put everything together” to create awesome anime pics? Here is an example.
- Prompt :
gorgeous woman, long black hair, blunt bangs, ponytail, red eyes, smile, (peace sign:1.2), intricate oriental dress, clouds, trees, (chinese ink painting, traditional art style:1.4), (limited palette:1.1), masterpiece, best quality, highly detailed
- Negative Prompt :
3D, realistic, (worst quality, low quality:1.4), mutated, missing fingers, extra fingers
Yep, that’s the fun part of AI image generation. Mix and match different styles and see what it creates – “monochrome with red splatters”, “traditional Ukiyoe with watercolor”, and “line art film noir”.
LINKS & REFERENCES
- Orange Mix – Hugging Face
- Female Poses – Digital Photography School
- Need Inspirations?
- Copy the tags from Safebooru (The SFW version of Danbooru)
- Copy prompts from other people – Lexica | Prompthero | MajinAI
Thank you for reading. We have come to the end of this guide, and I hope that it has helped you to level up. If you want to share anything with this guide, please feel free to comment below. Good luck and happy creating!