Sökning: "Speech error"

Visar resultat 1 - 5 av 70 uppsatser innehållade orden Speech error.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    Kandidat-uppsats, Uppsala universitet/Statistiska institutionen

    Författare :Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Nyckelord :ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Sammanfattning : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. LÄS MER

  2. 2. Analysis of speaking time and content of the various debates of the presidential campaign : Automated AI analysis of speech time and content of presidential debates based on the audio using speaker detection and topic detection

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Axel Valentin Maza; [2023]
    Nyckelord :Artificial Intelligence; Speaker detection; Speaker recognition; Speaker diarization; Speaker identification; Debate; Politics; Deep Learning; Artificiell intelligens; talardetektion; talarigenkänning; talardiarisering; talaridentifiering; debatt; politik; djupinlärning;

    Sammanfattning : The field of artificial intelligence (AI) has grown rapidly in recent years and its applications are becoming more widespread in various fields, including politics. In particular, presidential debates have become a crucial aspect of election campaigns and it is important to analyze the information exchanged in these debates in an objective way to let voters choose without being influenced by biased data. LÄS MER

  3. 3. Automatic Voice Trading Surveillance : Achieving Speech and Named Entity Recognition in Voice Trade Calls Using Language Model Interpolation and Named Entity Abstraction

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen Vi3

    Författare :Martin Sundberg; Mikael Ohlsson; [2023]
    Nyckelord :Automatic Speech Recognition; Natural Language Model; Named Entity Recognition; Voice Trading; Market Surveillance.;

    Sammanfattning : This master thesis explores the effectiveness of interpolating a larger generic speech recognition model with smaller domain-specific models to enable transcription of domain-specific conversations. The study uses a corpus within the financial domain collected from the web and processed by abstracting named entities such as financial instruments, numbers, as well as names of people and companies. LÄS MER

  4. 4. Text Normalization for Text-to-Speech

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Zhaorui Zhang; [2023]
    Nyckelord :;

    Sammanfattning : Text normalization plays a crucial role in text-to-speech systems by ensuring that the input text is in an appropriate format and consists of standardized words prior to grapheme-to-phoneme conversion for text-to-speech. The aim of this study was to assess the performance of five text normalization systems based on different methods. LÄS MER

  5. 5. En undersökning av AI-verktyget Whisper som potentiell ersättare till det manuella arbetssättet inom undertextframtagning

    M1-uppsats, KTH/Hälsoinformatik och logistik

    Författare :Mailad Waled Kider Kaka; Yassin Oummadi; [2023]
    Nyckelord :Manual subtitling creation; AI; Whisper; speech recognition; speech translation; Word Error Rate; COMET-22; SubER; Manuell undertextframtagning; AI; Whisper; taligenkänning; talöversättning; Word Error Rate; COMET-22; SubER;

    Sammanfattning : Det manuella arbetssättet för undertextframtagning är en tidskrävande och kostsam process. Arbetet undersöker AI-verktyget Whisper och dess potential att ersätta processen som används idag. Processen innefattar både transkribering och översättning. LÄS MER