Sökning: "Text to Music Audio Generation"

Hittade 2 uppsatser innehållade orden Text to Music Audio Generation.

1. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)
Författare :Ernan Wang; [2023]
Nyckelord :Text to Music Audio Generation; Latent Diffusion; AudioLDM; Sampling Methods; Denoising Diffusion Probabilistic Model DDPM ; Denoising Diffusion Implicit Model DDIM ; Text till musik Ljudgenerering; Latent Diffusion; AudioLDM; Samplingsmetoder; DDPM; DDIM;

Sammanfattning : In the emerging field of audio generation using diffusion models, this project pioneers the adaptation of the AudioLDM model framework, initially designed for text-to-daily sounds generation, towards text-to-music audio generation. This shift addresses a gap in the current scope of audio diffusion models, predominantly focused on everyday sounds. LÄS MER
2. Hotspot Detection for Automatic Podcast Trailer Generation
Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi
Författare :Winstead Xingran Zhu; [2021]
Nyckelord :automatic podcast trailer generation; hotspot detection; speech emotion recognition; text emotion recognition; text arousal detection; pull-quote selection; music detection; laughter detection; affect analysis; affective computing; machine learning; neural network;

Sammanfattning : With podcasts being a fast growing audio-only form of media, an effective way of promoting different podcast shows becomes more and more vital to all the stakeholders concerned, including the podcast creators, the podcast streaming platforms, and the podcast listeners. This thesis investigates the relatively little studied topic of automatic podcast trailer generation, with the purpose of en- hancing the overall visibility and publicity of different podcast contents and gen- erating more user engagement in podcast listening. LÄS MER

Resultatsidor:

Sökning: "Text to Music Audio Generation"

1. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model

2. Hotspot Detection for Automatic Podcast Trailer Generation

Sökningar just nu

Populära sökningar

Uppsatser med många visningar igår (2024-04-27)