Sökning: "Audio Recognition"

Visar resultat 1 - 5 av 70 uppsatser innehållade orden Audio Recognition.

  1. 1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data

    Kandidat-uppsats, Uppsala universitet/Statistiska institutionen

    Författare :Max Fredriksson; Elise Ramsay Veljanovska; [2024]
    Nyckelord :ASR; Automatic Speech Recognition; Swedish Speech Recognition; Speech Recognition Models; Speech-to-Text; Whisper; VoxRex; Wav2Vec; Model Comparison; Transformer Models; Neural Networks; Machine Learning; WER; Word Error Rate; Transcription;

    Sammanfattning : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. LÄS MER

  2. 2. Analysis of speaking time and content of the various debates of the presidential campaign : Automated AI analysis of speech time and content of presidential debates based on the audio using speaker detection and topic detection

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Axel Valentin Maza; [2023]
    Nyckelord :Artificial Intelligence; Speaker detection; Speaker recognition; Speaker diarization; Speaker identification; Debate; Politics; Deep Learning; Artificiell intelligens; talardetektion; talarigenkänning; talardiarisering; talaridentifiering; debatt; politik; djupinlärning;

    Sammanfattning : The field of artificial intelligence (AI) has grown rapidly in recent years and its applications are becoming more widespread in various fields, including politics. In particular, presidential debates have become a crucial aspect of election campaigns and it is important to analyze the information exchanged in these debates in an objective way to let voters choose without being influenced by biased data. LÄS MER

  3. 3. Song Popularity Prediction with Deep Learning : Investigating predictive power of low level audio features

    Magister-uppsats, Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Författare :Gustaf Holst; Jan Niia; [2023]
    Nyckelord :machine learning; deep learning; audio;

    Sammanfattning : Today streaming services are the most popular way to consume music, and with this the field of Music Information Retrieval (MIR) has exploded. Tangy market is a music investment platform and they want to use MIR techniques to estimate the value of not yet released songs. LÄS MER

  4. 4. How do voiceprints age?

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Maya Konstantinovna Nachesa; [2023]
    Nyckelord :Voiceprint; Speaker Emotion Recognition; Age; Speaker Verification;

    Sammanfattning : Voiceprints, like fingerprints, are a biometric. Where fingerprints record a person's unique pattern on their finger, voiceprints record what a person's voice "sounds like", abstracting away from what the person said. They have been used in speaker recognition, including verification and identification. LÄS MER

  5. 5. Dynamic Mixed Reality AssemblyGuidance Using Optical Recognition Methods

    Master-uppsats, KTH/Industriell produktion

    Författare :Harpa Hlíf Guðjónsdóttir; Gestur Andrei Ólafsson; [2022]
    Nyckelord :Manufacturing; Mixed Reality; Augmented Reality; Assembly Guidance; Object Recognition; HoloLens 2; Vuforia Engine; Unity; Mixed Reality Toolkit; Tillverkning; Blandad Verklighet; Förstärkt Verklighet; Monteringsvägledning; Objektigenkänning; HoloLens 2; Unity; Vuforia Engine; Mixed Reality Toolkit;

    Sammanfattning : Mixed Reality (MR) is an emerging paradigm in industry. While MR equipment and software have taken great technological strides in past years, standardized methods and workflows for developing MR systems for industry have not been widely adopted for many tasks. This thesis proposes a dynamic MR system for an assembly process. LÄS MER