Sökning: "Speech Recognition Models"

Visar resultat 1 - 5 av 75 uppsatser innehållade orden Speech Recognition Models.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    Kandidat-uppsats, Uppsala universitet/Statistiska institutionen

    Författare :Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Nyckelord :ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Sammanfattning : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. LÄS MER

  2. 2. Analyzing the Influence of Synthetic andAugmented Data on Segmentation Model

    Uppsats för yrkesexamina på avancerad nivå, Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Författare :Alex Peschel; [2023]
    Nyckelord :Artificial Intelligence; Microorganisms; Segmentation; Synthesizing; Augmentation;

    Sammanfattning : The field of Artificial Intelligence (AI) has experienced unprecedented growth in recent years, thanks to the numerous applications related to speech recognition, natural language processing, and computer vision. However, one of the challenges facing AI is the requirement for large amounts of energy, time, and data to be effective and accurate. LÄS MER

  3. 3. Identification and Classification of TTS Intelligibility Errors Using ASR : A Method for Automatic Evaluation of Speech Intelligibility

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Erik Henriksson; [2023]
    Nyckelord :Automatic Speech Recognition; Natural Language Processing; Speech Technology; Speech Quality Assessment; Text-To-Speech; Taligenkänning; Språkteknologi; Talkvalitetsbedömning; Talsyntes;

    Sammanfattning : In recent years, applications using synthesized speech have become more numerous and publicly available. As the area grows, so does the need for delivering high-quality, intelligible speech, and subsequently the need for effective methods of assessing the intelligibility of synthesized speech. LÄS MER

  4. 4. Intelligible dialogue manager for social robots : An AI dialogue robot solution based on Rasa open-source framework and Pepper robot

    Master-uppsats, Umeå universitet/Institutionen för datavetenskap

    Författare :Jiangeng Sun; [2023]
    Nyckelord :Rasa; understandable robots; HRI;

    Sammanfattning : In the process of Human-Robot Interaction, improving the intelligibility of robots is crucial. Intelligibility refers to the degree to which humans can understand robot behavior and decision-making. When humans interact with low-intelligibility robots, it can lead to a series of problems, such as misunderstanding and trust issues. LÄS MER

  5. 5. Punctuation Restoration as Post-processing Step for Swedish Language Automatic Speech Recognition

    Magister-uppsats, Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Författare :Ishika Gupta; [2023]
    Nyckelord :Transformer; BERT; KB-BERT; NLP; punctuation restoration; deep learning; neural networks;

    Sammanfattning : This thesis focuses on the Swedish language, where punctuation restoration, especially as a postprocessing step for the output of Automatic Speech Recognition (ASR) applications, needs furtherresearch. I have collaborated with NewsMachine AB, a company that provides large-scale mediamonitoring services for its clients, for which it employs ASR technology to convert spoken contentinto text. LÄS MER