Sökning: "Tal-och språkteknologi"

Hittade 3 uppsatser innehållade orden Tal-och språkteknologi.

  1. 1. Identification and Classification of TTS Intelligibility Errors Using ASR : A Method for Automatic Evaluation of Speech Intelligibility

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Erik Henriksson; [2023]
    Nyckelord :Automatic Speech Recognition; Natural Language Processing; Speech Technology; Speech Quality Assessment; Text-To-Speech; Taligenkänning; Språkteknologi; Talkvalitetsbedömning; Talsyntes;

    Sammanfattning : In recent years, applications using synthesized speech have become more numerous and publicly available. As the area grows, so does the need for delivering high-quality, intelligible speech, and subsequently the need for effective methods of assessing the intelligibility of synthesized speech. LÄS MER

  2. 2. Automatic Podcast Chapter Segmentation : A Framework for Implementing and Evaluating Chapter Boundary Models for Transcribed Audio Documents

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Adam Feldstein Jacobs; [2022]
    Nyckelord :Machine Learning; Natural Language Processing; Speech Technology; Deep Learning; Podcast Segmentation; Maskininlärning; Språkteknologi; Djupinlärning; Podcast Segmentation;

    Sammanfattning : Podcasts are an exponentially growing audio medium where useful and relevant content should be served, which requires new methods of information sorting. This thesis is the first to look into the state-of-art problem of segmenting podcasts into chapters (structurally and topically coherent sections). LÄS MER

  3. 3. Medical image captioning based on Deep Architectures

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Georgios Moschovis; [2022]
    Nyckelord :Artificial Neural Networks; Deep Learning; Speech and language technology; Natural Language Processing NLP ; Deep networks; Generative deep networks; Convolutional neural networks CNN ; Text generation; Information retrieval; Diagnostic captioning; Image captioning; concept prediction; classification; image encoders; transformers; Encoder-Decoder architecture; abstractive summarization; Neurala nätverk; Djup inlärning; Tal-och språkteknologi; naturlig språkbehandling; djup neurala nätverk; generativa djupa nätverk; konvolutionella neurala nätverk; Textgenerering; Informationssökning; Diagnostisk textning; Bildtextning; konceptförutsägelse; klassificering; bildkodare; transformatorer; kodaravkodararkitektur; abstrakt sammanfattning;

    Sammanfattning : Diagnostic Captioning is described as “the automatic generation of a diagnostic text from a set of medical images of a patient collected during an examination” [59] and it can assist inexperienced doctors and radiologists to reduce clinical errors or help experienced professionals increase their productivity. In this context, tools that would help medical doctors produce higher quality reports in less time could be of high interest for medical imaging departments, as well as significantly impact deep learning research within the biomedical domain, which makes it particularly interesting for people involved in industry and researchers all along. LÄS MER