Seedance 2.1 Prompt Guide

Master the art of prompting to create stunning AI-generated videos. This guide covers prompt techniques, multimodal references, and real-world examples for Seedance 2.1 (also applicable to Seedance 2.0 and Seedance 2.0 Fast).

01General Tips

1.1 Basic Prompt Formula

Seedance deeply follows natural language logic, so you can flexibly combine the following elements based on your needs.

Subject

Required

The logical foundation of your prompt — clearly define WHO is performing WHAT action.

Motion

Required

Describe movement, pacing, and action flow so the model knows how the scene unfolds.

Environment

Optional

Describe the spatial background, lighting details, or a specific visual style to set the overall tone.

Aesthetics

Optional

Color palette, film grain, lens style, or art direction to shape the look and feel.

Camera

Optional

Use camera choreography — dolly, orbit, tracking, crane — for immersive audiovisual output.

Audio

Optional

Ambient sound, dialogue, music mood, or voiceover direction for richer results.

1.2 Multimodal Reference Control

Beyond text descriptions, you can also "feed" reference materials to lock in the ideal standard for your visuals. Seedance supports deep referencing of images, audio, and video.

  • Clearly specify references in your prompt — e.g., "use the composition from Image 1" or "follow the action from Video 2".
  • The model extracts core features from reference objects and combines them with your text, ensuring high fidelity and creativity.

02Text in Video

2.1 Slogan / Title Text

[Text Content] + [Appearance Timing] + [Position] + [Appearance Method], [Text Style (color, font)]. Seedance can automatically match appropriate text styles based on context.

Animated Slogan with Product

Output
Reference Input
Seedance branded fried chicken box reference
Image 1

Prompt

Hand-drawn comic style, three people sitting together eating the fried chicken from Image 1, the atmosphere is friendly and joyful, then the scene gradually blurs, and the text "Joy is in Seedance" appears in the center of the screen.

2.2 Subtitles

Subtitles appear at the bottom of the screen with the content "...", synchronized with the audio rhythm.

Narrated Landscape with Subtitles

Output
Reference Input
Night sky and mountain landscape reference
Image 1

Prompt

Generate a video with voiceover narration. A deep, calm male voice says: "In the grand universe, our world is but a fleeting moment. Yet within it, life thrives against all odds." The scene should slowly transition from night to dawn, with stars gradually fading and the sun rising behind the mountains. Subtitles appear at the bottom of the screen following the narration.

Office Conversation with Subtitles

Output
Reference Input
Two people chatting at a cafe table reference
Image 1

Prompt

The two people in the image are chatting in an office. The woman speaks first, saying: "You always arrive just on time — do you enjoy that feeling of cutting it close?" The man laughs and responds: "I have my own rhythm." The dialogue is casual and natural, with subtitles appearing at the bottom of the screen matching each line.

2.3 Speech Bubbles

[Character] says: "...", speech bubbles appear around the character with the dialogue text.

Campus Running Scene with Bubbles

Output
Reference Input
Two people walking in a hallway reference
Image 1

Prompt

The two people from Image 1 are wearing sportswear and running on a school track. The girl looks at the boy and says confidently with a smile: "We can definitely do it!" Speech bubbles appear around each speaking character with the corresponding dialogue.

Strawberry Farm Scene with Bubble

Output
Reference Input
Character front and side reference portraits
Image 1 & Image 2

Prompt

Referencing the girl's appearance from Image 1 and Image 2, the girl is in a strawberry garden, picks a strawberry, takes a bite, and says with a smile: "This is the real deal!" A speech bubble appears around the girl with the dialogue text.

03Image Reference

3.1 Multi-angle Subject Reference

Reference / Extract / Combine + [Image N]'s [Subject], generate [Scene Description], maintaining consistent [Subject] features.

3C Digital Product

Output
Reference Input
Seedance camera front, angle, and rear reference views
Image 1, 2, 3

Prompt

Extract the camera from Image 1, Image 2, and Image 3, replace the background with white. The camera sits on a white table, the lens focuses on the camera in close-up, then slowly rotates around the camera as the main subject.

Household Items

Output
Reference Input
Seedance thermos product reference — standalone and hand-held views
Reference Images

Prompt

The background is a warm-toned home scene. A medium shot shows the thermos bottle from the reference image. The camera smoothly pushes in to a close-up, then a hand naturally enters the frame from off-screen, gently grips the bottle and lifts it. The camera follows as the hand slightly rotates to showcase the product.

Character Reference

Output
Reference Input
Woman character front, side, and back reference views
Image 1, 2, 3

Prompt

Reference the woman's appearance from Image 1, Image 2, and Image 3, generate a scene of her eating cake at a coffee shop.

3.2 Multi-image Reference

Reference / Extract / Combine / Follow / Generate + [Image N]'s [Referenced Element], generate [Scene Description], maintaining consistent features.

Logo Reference

Output
Reference Input
Seedance logo and cyberpunk character reference
Image 1 (Logo) & Image 2 (Character)

Prompt

The background is a neon-lit futuristic urban sky corridor. Reference the girl from Image 2, then the scene gradually blurs, and the Logo from Image 1 appears. Overall style is 3D cyberpunk sci-fi animation.

Multi-subject Reference

Output
Reference Input
White cat and Samoyed dog reference photos
Image 1 & Image 2

Prompt

Reference the cat and dog from the images. In a cozy apartment, the dog is lying down eating dog food. The cat walks over and extends a paw to touch the dog. The dog stops eating when it sees the cat, and the cat snuggles up beside the dog. The scene uses warm color tones.

Multi-element Reference

Output
Reference Input
Character, outfit, boy, diner, and logo reference panels
Image 1–5

Prompt

The scene is set inside the restaurant from Image 4. The girl from Image 1 is wearing the outfit from Image 2. The boy from Image 3 is a customer who walks up to ask the girl for her contact information. The logo from Image 5 is always displayed in the bottom-right corner of the screen.

Multi-panel Storyboard

Output
Reference Input
Four-panel fight scene storyboard reference
Storyboard Image

Prompt

Reference the storyboard in the image and generate an intense fight scene. Each panel's composition should appear in order, followed by an intense battle between the two characters.

Storyboard with Characters

Output
Reference Input
Girl and dad character refs with dining and kitchen storyboard panels
Image 1–4 (Girl, Dad, Storyboard panels)

Prompt

Follow the storyboard composition from Image 3. A girl is waiting for her dad to finish cooking. She says: "Dad, I'm hungry! Is dinner ready?" The girl's appearance references Image 1. Then the camera pans right to switch to Image 4's scene and composition. The dad's appearance references Image 2. The dad replies: "Almost done, just wait a little!" Then the camera cuts back to a close-up of the daughter looking slightly disappointed, saying: "Still not ready? It smells so good..." Then switch to a close-up of the dad saying: "It's almost done for real. Stop rushing and go wash your hands first!"

04Video Reference

4.1 Action Reference

Reference [Video N]'s [Action Description], generate [Scene Description], maintaining consistent action details.

Film / Action Scene

Output
Reference Input
Video 1
Anime girl and boy character reference portraits
Image 1 & Image 2

Prompt

Reference the character actions and camera language from Video 1, generate a fight scene between Image 2 and Image 1. Image 2 is the character on the left, Image 1 is the character on the right. With intense background music.

Marketing / Product Ad

Output
Reference Input
Video 1

Prompt

Reference the running form of the horse from Video 1, generate a golden horse galloping on a grassland, then freeze-frame its magnificent running pose, transforming into a horse-shaped gold pendant.

4.2 Camera Movement Reference

Reference [Video N]'s [Camera Movement Description], generate [Scene Description], maintaining consistent camera movement.

Tech Park Concept Video

Output
Reference Input
Video 1
Futuristic Seedance tech park skyline reference
Image 1

Prompt

Reference the camera movement from Video 1 to create a concept video for a tech park. Use the high-rise building from Image 1 as the visual center, with the same first-person diving perspective, highlighting the tech aesthetic of the park in Image 1.

4.3 Effects Reference

Reference [Video N]'s [Effects Description], generate [Scene Description], maintaining consistent effects.

Film / Particle Effects

Output
Reference Input
Video 1
Woman in blue hanfu playing flute on mountain reference
Image 1

Prompt

Reference the golden particle effects from Video 1, have the character from Image 1 play a flute while surrounded by the same particle effects.

Fun / Wings Effect

Output
Reference Input
Video 1 (Wings effect)
Schoolgirl walking under cherry blossoms reference
Image 1 (Girl)

Prompt

Reference the effects from Video 1 to make the girl from Image 1 grow the same wings, with the wing generation trajectory matching exactly.

05Video Editing

5.1 Add / Remove / Modify Elements

Add, remove, or replace elements in a video. When uploading videos in order, use Video 1, Video 2… Video N in your prompt.

Add Elements

Output
Reference Input
Video 1 (Original)

Prompt

Add fried chicken, pizza, and other snacks on the counter in Video 1.

Remove Elements

Output
Reference Input
Video 1 (Original)

Prompt

Clear the other parts and tools from the desktop in Video 1, keep the desktop clean and tidy — only the items they're holding in their hands should remain.

Modify Elements

Output
Reference Input
Video 1 (Original)
Seedance face cream product reference
Image 1

Prompt

Replace the perfume in Video 1 with the face cream from Image 1, keeping the motion and camera movement unchanged.

5.2 Video Extension

Extend [Video N] forward/backward + [Description]. The model captures the connecting portion for seamless compositing.

Extend Backward

Output
Reference Input
Video 1 (Original)

Prompt

Generate the content after Video 1. Two late-arriving men run toward them, all five people finally meet and chat happily.

Extend Forward

Output
Reference Input
Video 1 (Original)

Prompt

Extend Video 1 forward with an over-the-shoulder shot of the man in white. The man in white says: "It's not that bad. You're just stressed. Everyone goes through this, you just need to keep going."

5.3 Track Completion

[Video 1] + [Transition Description] + connect to [Video 2]. Supports up to 3 video inputs with a total duration not exceeding 15 seconds.

Leaf Transition Between Scenes

Output
Reference Input
Video 1
Video 2

Prompt

Video 1, at the moment the leaf touches the ground, golden particle effects burst out, a gust of wind blows, then connect to Video 2.