CivArchive
    Ace Step 1.5 Turbo and SFT - TEXT to AUDIO model with Ollama - v1.0
    Preview 120369307

    V1.2 Ace Step 1.5. Turbo and SFT model with Ollama Text to Audio/Song (examples below)

    • small update to GUI, system prompts and SFT sampler "engine"

    • bypass "ConditioningZeroOut" node in subgraph of "sampler SFT" in case you want to revert back to v1.0 "engine"


    V1.0 Ace Step 1.5. Turbo and SFT model with Ollama Text to Audio/Song

    Ace Step uses TAGS and LYRICS to create a song. These can be generated by Ollama or by own prompts.

    • Can use any Song, Artist as reference or any other description to generate tags and lyrics.

    • Will output up to two songs, one generated by Turbo model, the other by the SFT model (experimental).

    • Keyscales, bpm and song duration can be randomized.

    • able to use dynamic prompts.

    • creates suitable songtitle and filenames with Ollama.

    • Lora Loader included, hope to see some Loras soon!

    Important: Do not use sage attention in your comfyui starting parameters, avoid --lowvram setting, as this might force Texencoder to run very slow on CPU instead of GPU. Recommend to toggle Link visibility to hide the wires.


    Download Files:

    Ollama Models, required for tags, lyrics and songtitle, you can choose 1,2 or 3 different models, tags and lyrics might need a bigger model >7b, songtitle can use a smaller model:


    Update 9th of Feb 26: Alternative Turbo and SFT Models :


    Which models to start with ? => Turbo, SFT-Shift1 and Llama3-NeuralDaredevil for Ollama.

    • My current choice: Turbo-SFT merge_ta_0.5, SFT-Shift1 and Llama3-NeuralDaredevil for Ollama. Using these settings:

      • Turbo-SFT_merge model with 22 steps, sampler: er_sde, scheduler: beta57

      • SFT-Shift1 model with 100-140 steps, sampler euler, scheduler: normal


    Save Location:

    • 📂 ComfyUI/

    • ├── 📂 models/

    • │ ├── 📂 diffusion_models/

    • │ │ └── acestep_v1.5_turbo.safetensors

    • │ ├── 📂 text_encoders/

    • │ │ ├── qwen_0.6b_ace15.safetensors

    • │ │ └── qwen_4b_ace15.safetensors (or 1.7b)

    • │ └── 📂 vae/

    • │ └── ace_1.5_vae.safetensors


    Custom Nodes used:

    optional (use Beta57 scheduler for a bit more punch, requires RES4LYF): https://github.com/ClownsharkBatwing/RES4LYF


    Examples various styles:


    Ollama help:

    1. Install Ollama from https://ollama.com/

    2. download a model: Go to a model page, chose a model , then hit the copy button, i.e. https://ollama.com/huihui_ai/qwen3-vl-abliterated

    3. open terminal and paste the model name, i.e.: ollama run huihui_ai/qwen3-vl-abliterated

    4. model will be downloaded and can be selected in green comfy node "Ollama Connectivity". Hit "Reconnect" to refresh.

    Description

    Ace Step 1.5. Turbo and SFT model with Ollama for song-tags and lyrics

    Workflows
    Other

    Details

    Downloads
    326
    Platform
    CivitAI
    Platform Status
    Available
    Created
    2/8/2026
    Updated
    3/25/2026
    Deleted
    -

    Files

    aceStep15TurboAndSFTTEXT_v10.zip

    Mirrors