CivArchive
    Audio-Driven Video Generation | Wan2.2-S2V - v1.0
    NSFW
    Preview 97019974

    Try it out first to decide whether to install it. in case you are not satisfied.

    https://www.runninghub.ai/post/1961049452163305473?inviteCode=rh-v1213

    use my invitation code(rh-v1213) , you'll get 1000 points

    1. Upload a audio file and set duration

    2. Enter prompts

    You can find the associated model at Civitai.com

    [wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors]

    [wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors]

    [wan_2.1_vae.safetensors]

    Wan2.2-S2V is an AI video generation model that can convert static images and audio inputs into video content. The model can generate videos up to minute-level duration in a single generation, providing new solutions for video creation in digital human livestreaming, film production, and education industries.

    The model performs well in film and television application scenarios, capable of generating facial expressions, body movements, and camera language. It supports full-body and half-body character generation, able to complete various content creation needs such as dialogue, singing, and performance.

    Description

    FAQ

    Comments (5)

    oron123509Aug 28, 2025
    CivitAI

    Thanks. Which folder in Comfy is for the wav2vec2 english ?

    MoreColorsAug 28, 2025

    audio_encoders

    R240Aug 29, 2025

    dont have and audio encoders file

    manbut117487Aug 31, 2025

    you gotta make the folder.

    iharriAug 29, 2025
    CivitAI

    Any instructions on how to use this?

    Workflows
    Wan Video 14B t2v

    Details

    Downloads
    498
    Platform
    CivitAI
    Platform Status
    Available
    Created
    8/28/2025
    Updated
    6/11/2026
    Deleted
    -

    Files

    audioDrivenVideo_v10.zip

    Mirrors

    CivitAI (1 mirrors)