Sökning: "Text to Music Audio Generation"
Hittade 2 uppsatser innehållade orden Text to Music Audio Generation.
1. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : In the emerging field of audio generation using diffusion models, this project pioneers the adaptation of the AudioLDM model framework, initially designed for text-to-daily sounds generation, towards text-to-music audio generation. This shift addresses a gap in the current scope of audio diffusion models, predominantly focused on everyday sounds. LÄS MER
2. Hotspot Detection for Automatic Podcast Trailer Generation
Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologiSammanfattning : With podcasts being a fast growing audio-only form of media, an effective way of promoting different podcast shows becomes more and more vital to all the stakeholders concerned, including the podcast creators, the podcast streaming platforms, and the podcast listeners. This thesis investigates the relatively little studied topic of automatic podcast trailer generation, with the purpose of en- hancing the overall visibility and publicity of different podcast contents and gen- erating more user engagement in podcast listening. LÄS MER