fran@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 7 months ago

Are there any good open source text-to-music models, preferably with lyrical abilities?

4

18

Are there any good open source text-to-music models, preferably with lyrical abilities?

fran@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 7 months ago

4

Only recently did I discover the text-to-music AI companies (udio.com, suno.com) and I was surprised about how good the results are. Both are under lawsuit from RIAA.

I am curious if there are any local ones I can experiment with or train myself. I know there is facebook/musicgen-large on HuggingFace. That model is over 1 year old and there might be others by now. Also, based on the card I get the feeling that model is not going to be good at doing specific song lyrics (maybe the lyrics just were absent from the training data?). I am most interested in trying my hand at writing songs and fine-tuning a model on specific types of music to get the sounds I am looking for.

Chat

Fisch@discuss.tchncs.de
link
fedilink
English
arrow-up
1
arrow-down
1·
7 months ago
Maybe it would be possible to use a regular text-to-voice model and then use something similar to autotune

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
63 users / week
280 users / month
575 users / 6 months
9 local subscribers
2.66K subscribers
266 Posts
1.17K Comments
Modlog