Comments

You must log in or register to comment.

Just_CurioussSss t1_j2rz8o2 wrote

Riffusion is an excellent way to be flexible with music production. An artist can go from heavy metal to pop. Not to mention, using stable diffusion and interpolation in latent space will create smooth transitions in the generated audio clips, which could help to create more coherent and pleasant-sounding music. I'm curious about one thing, though. What do you use for semantic analysis of the text inputs? Have you had issues with that?

3