Best Music Generation

Models suited to generating musical audio, from simple melodic motifs to richer, layered soundscapes. Selected for coherent structure, usable musicality, and consistent output quality.

Launch model

Top Pick

Launch View details

Best rated

ACE-Step v1.5 XL SFT

ACE-Step v1.5 XL SFT is the supervised fine-tuned 4B DiT variant in the ACE-Step 1.5 XL line. It is positioned as the highest-quality XL option, combining 50-step CFG inference with stronger prompt adherence and refined audio quality for text-to-music, cover, and repaint workflows when final output quality matters more than speed or broader editing task coverage.

Featured Models

Top-performing models in this category, recommended by our community and performance benchmarks.

Launch View details

ACE-Step v1.5 XL Base

ACE-Step v1.5 XL Base is the 4B DiT variant of ACE-Step 1.5 for high-quality music generation and editing. It supports text-to-music, cover generation, repaint, extract, lego, and complete workflows, uses 50 inference steps with CFG, and is designed for longer-form audio generation up to 10 minutes with broad multilingual prompt support.

Launch View details

ACE-Step v1.5 XL Turbo

ACE-Step v1.5 XL Turbo is the accelerated 4B DiT variant of ACE-Step 1.5. It is optimized for faster music generation with 8-step distilled inference while retaining the higher-capacity XL architecture. It supports text-to-music, cover generation, and repaint workflows, making it suitable for rapid iteration when the 2B turbo model is not enough in audio quality.

Launch View details

MiniMax Music 2.6

by MiniMax

MiniMax Music 2.6 is MiniMax’s latest music generation model for full vocal songs and instrumentals from text prompts. It supports natural-language prompts or detailed production-style instructions, follows specified BPM and key with high reliability, and exposes fine-grained song structure control through section tags. The same Music API also supports instrumental generation, lyrics-assisted workflows, and synchronous or streaming delivery.

Launch View details

ACE-Step v1.5 Base

ACE-Step v1.5 Base is an open-source music generation foundation model built on a hybrid LLM planner and Diffusion Transformer architecture. It generates full tracks from text prompts with support for voice cloning, lyric editing, remixing, cover generation, and compositions up to 10 minutes. It supports over 50 languages and runs on consumer hardware with under 4GB VRAM.

Launch View details

ACE-Step v1.5 Turbo

ACE-Step v1.5 Turbo is a speed-optimized variant of the ACE-Step v1.5 music generation model. It delivers faster inference with fewer denoising steps while retaining the core capabilities of the Base model, including voice cloning, lyric editing, remixing, and multilingual support across 50+ languages.

Launch View details

Mirelo SFX 1.5

by Mirelo

Mirelo SFX 1.5 converts video into synchronized sound effects. It targets higher audio fidelity and wider scene coverage. It helps developers add context aware soundscapes to video pipelines with faster processing and flexible integration options.

Launch View details

MiniMax Music Cover

by MiniMax

MiniMax Music Cover is MiniMax’s song-to-song transformation model for reimagining an existing track in a new style. It preserves the original vocal melody while changing voice timbre, instrumentation, genre, and arrangement through a text prompt. It supports one-step generation from reference audio or a two-step workflow with preprocessing and optional lyric editing.

Best Music Generation

ACE-Step v1.5 XL SFT

Featured Models

ACE-Step v1.5 XL Base

ACE-Step v1.5 XL Turbo

MiniMax Music 2.6

ACE-Step v1.5 Base

ACE-Step v1.5 Turbo

Mirelo SFX 1.5

MiniMax Music Cover

Explore other collections