Wan 2.2 Video Generation
Wan 2.2 Prompting Guide
Wan 2.2 works best when you treat the prompt like a shot direction, not a still-image caption. Think in terms of subject, environment, motion, camera, and visual style.
Best Overall Formula
Subject + environment + action + camera + lighting + style + constraints
Why Wan 2.2 Feels Different
Wan 2.2 is a video model, so it needs motion language and time-based instructions instead of a pile of visual adjectives.
Single Biggest Rule
Do not prompt it like a still image.
Best Order to Write
Who + where + what moves + camera + look + negatives
Prompt Anatomy
1) Subject
Establish the main subject clearly so the model knows what matters most in the shot.
2) Environment
Anchor the subject in a specific place with enough detail to keep the scene coherent.
3) Action
Describe visible motion using verbs and simple time-based beats.
4) Camera
Specify framing, angle, and movement instead of leaving the shot logic vague.
Text-to-Video
For text-to-video, front-load the worldbuilding because there is no source frame to anchor the shot.
Image-to-Video
For image-to-video, focus more on what in the image starts moving, what remains stable, and what the camera should do.
Motion Guidance
Low Motion
- breathing
- blinking
- slight smile
- hair moving in breeze
- small head turn
Medium Motion
- walking slowly
- turning toward camera
- raising a hand
- looking back
- stepping forward
High Motion
- running
- fighting
- crowd chaos
- fast camera movement
- heavy environmental action
Camera Movement
Wan 2.2 usually behaves best when you keep the camera instructions simple and readable.
- slow push-in
- dolly in
- dolly out
- pan left / right
- tilt up / down
- tracking shot
- orbit around subject
- locked-off shot
Lighting and Style
Useful Negatives
- no flicker
- no jitter
- no distorted hands
- no duplicate person
- no warped face
- no extra fingers
Strong Master Template
[What the subject does].
[Secondary motion in the scene].
The camera [specific camera movement and framing].
[Lighting / color / lens / style].
[Constraints / negatives].
Bottom Line
What do we see, what moves, and how does the camera watch it?