Sökning: "word error rate"
Visar resultat 1 - 5 av 31 uppsatser innehållade orden word error rate.
1. A Comparative Analysis of Whisper and VoxRex on Swedish Speech Data
Kandidat-uppsats, Uppsala universitet/Statistiska institutionenSammanfattning : With the constant development of more advanced speech recognition models, the need to determine which models are better in specific areas and for specific purposes becomes increasingly crucial. Even more so for low-resource languages such as Swedish, dependent on the progress of models for the large international languages. LÄS MER
2. Automatic Voice Trading Surveillance : Achieving Speech and Named Entity Recognition in Voice Trade Calls Using Language Model Interpolation and Named Entity Abstraction
Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen Vi3Sammanfattning : This master thesis explores the effectiveness of interpolating a larger generic speech recognition model with smaller domain-specific models to enable transcription of domain-specific conversations. The study uses a corpus within the financial domain collected from the web and processed by abstracting named entities such as financial instruments, numbers, as well as names of people and companies. LÄS MER
3. En undersökning av AI-verktyget Whisper som potentiell ersättare till det manuella arbetssättet inom undertextframtagning
M1-uppsats, KTH/Hälsoinformatik och logistikSammanfattning : Det manuella arbetssättet för undertextframtagning är en tidskrävande och kostsam process. Arbetet undersöker AI-verktyget Whisper och dess potential att ersätta processen som används idag. Processen innefattar både transkribering och översättning. LÄS MER
4. Live captioning and translation application for Android
Uppsats för yrkesexamina på grundnivå, Umeå universitet/Institutionen för tillämpad fysik och elektronikSammanfattning : Captioning has long been used in media to help D/deaf and hard-of-hearing persons. Captioning however is difficult and time-consuming manual work. With the rapid evolution of automated speech recognition (ASR) systems, live captioning of everyday speech will soon be a practical reality. LÄS MER
5. Generation of Control Logic from Ordinary Speech
Kandidat-uppsats, Högskolan i Halmstad/Akademin för informationsteknologiSammanfattning : Developments in automatic code generation are evolving remarkably fast, with companies and researchers competing to reach human-level accuracy and capability. Advancements in this field primarily focus on using machine learning models for end-to-end code generation. LÄS MER