Sökning: "Giuseppe Della Corte"

Hittade 1 uppsats innehållade orden Giuseppe Della Corte.

  1. 1. Text and Speech Alignment Methods for Speech Translation Corpora Creation : Augmenting English LibriVox Recordings with Italian Textual Translations

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Giuseppe Della Corte; [2020]
    Nyckelord :speech translation; parallel corpora; bilingual sentence alignment; sentence embeddings; cosine similarity; forced alignment; text collection; corpora creation; audio signal processing;

    Sammanfattning : The recent uprise of end-to-end speech translation models requires a new generation of parallel corpora, composed of a large amount of source language speech utterances aligned with their target language textual translations. We hereby show a pipeline and a set of methods to collect hundreds of hours of English audio-book recordings and align them with their Italian textual translations, using exclusively public domain resources gathered semi-automatically from the web. LÄS MER