Increasing Recall of Lengthening Detection via Semi-Automatic Classification

Detta är en Master-uppsats från Göteborgs universitet/Institutionen för filosofi, lingvistikoch vetenskapsteori

Författare: Jana Vosse; [2017-09-11]

Nyckelord: ;

Sammanfattning: Lengthening is the ideal hesitation strategy for synthetic speech and dialogue systems: it is unobtrusive and hard to notice, becauseit occurs frequently in everyday speech before phrase boundaries, in accentuation, and in hesitation. Despite its elusiveness,it allows valuable extra time for computing or information highlighting in incremental spoken dialogue systems.The elusiveness of the matter, however, poses a challenge for extracting lengthening instances from corpus data: we suspecta recall problem, as human annotators might not be able to consistently label lengthening instances. We address this issue by filtering corpus data for instances of lengthening, using a simple classification method, based on a threshold for normalizedphone duration. The output is then manually labelled for disfluency.This is compared to an existing, fully manual disfluency annotation, showing that recall is significantly higher withsemi-automatic classification. This shows that it is inevitable to use semi-automatic lengthening detection to gather enough datapoints for future analysis of lengthening. On the other hand, it is desirable to further increase the filter performance. We evaluatein detail human versus semi-automatic annotation and train another classifier on the resulting dataset to check the integrityof the disfluent - non-disfluent distinction.

  HÄR KAN DU HÄMTA UPPSATSEN I FULLTEXT. (följ länken till nästa sida)