Sökning: "Lexical normalization"

Hittade 4 uppsatser innehållade orden Lexical normalization.

  1. 1. A Rule-Based Normalization System for Greek Noisy User-Generated Text

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Marsida Toska; [2020]
    Nyckelord :nlp; noisy text preprocessing; rule-based; levenshtein; twitter; normalization; Greek;

    Sammanfattning : The ever-growing usage of social media platforms generates daily vast amounts of textual data which could potentially serve as a great source of information. Therefore, mining user-generated data for commercial, academic, or other purposes has already attracted the interest of the research community. LÄS MER

  2. 2. A Pipeline for Automatic Lexical Normalization of Swedish Student Writings

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Yuhan Liu; [2018]
    Nyckelord :Lexical normalization; Phonetic algorithm for Swedish;

    Sammanfattning : In this thesis, we aim to explore the combination of different lexical normalization methods and provide a practical lexical normalization pipeline for Swedish student writings within the framework of SWEGRAM(Näsman et al., 2017). LÄS MER

  3. 3. Classification of Fiction Genres : Text classification of fiction texts from Project Gutenberg

    Master-uppsats, Högskolan i Borås/Akademin för bibliotek, information, pedagogik och IT

    Författare :Rolf Bucher; [2018]
    Nyckelord :text; classification; genre; machine; learning; supervised; Gutenberg; fiction;

    Sammanfattning : Stylometric analysis in text classification is most often used in authorship attribution studies. This thesis used a machine learning algorithm, the Naive Bayes Classifier, in a text classification task comparing stylometric and lexical features. LÄS MER

  4. 4. Effekten av avståndsoperatorer samt expansion med synonymer med avseende på återvinningseffektiviteten

    Magister-uppsats, Högskolan i Borås/Institutionen Biblioteks- och informationsvetenskap / Bibliotekshögskolan

    Författare :Emma Elofsson; [2006]
    Nyckelord :library and information science; biblioteks- och informationsvetenskap;

    Sammanfattning : This thesis examines the effects of proximity operators and query expansion with synonyms on retrieval performance. The queries were expanded with synonyms and structured with proximity operators where the permitted distance between terms varied from 1 to 3. The expansion terms were selected from the online lexical reference system WordNet. LÄS MER