Sökning: "Part-of-Speech-Tagging"

Visar resultat 1 - 5 av 15 uppsatser innehållade ordet Part-of-Speech-Tagging.

  1. 1. IŻ SWÓJ JĘZYK MAJĄ! An exploration of the computational methods for identifying language variation in Polish

    Master-uppsats, Göteborgs universitet / Institutionen för filosofi, lingvistik och vetenskapsteori

    Författare :Maria Irena Szawerna; [2023-06-19]
    Nyckelord :language variation; Polish; diachronic linguistics; part-of-speech tagging; lemmatization; corpus linguistics;

    Sammanfattning : Computational approaches to language variation continue to contribute in a relevant way to various fields, including Natural Language Processing (NLP) and linguistics. Being able to accommodate variation within natural language increases the robustness of NLP models and their usefulness in real-life applications; simultaneously, detecting and describing variation and trends that govern it is one of the main goals of sociolinguistics and historical linguistics, meaning that some of the advances in NLP can contribute to these fields as well. LÄS MER

  2. 2. Cross-Lingual and Genre-Supervised Parsing and Tagging for Low-Resource Spoken Data

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Iliana Fosteri; [2023]
    Nyckelord :dependency parsing; part-of-speech tagging; low-resource languages; transcribed speech; large language models; cross-lingual learning; transfer learning; multi-task learning; Universal Dependencies;

    Sammanfattning : Dealing with low-resource languages is a challenging task, because of the absence of sufficient data to train machine-learning models to make predictions on these languages. One way to deal with this problem is to use data from higher-resource languages, which enables the transfer of learning from these languages to the low-resource target ones. LÄS MER

  3. 3. Prerequisites for Extracting Entity Relations from Swedish Texts

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Erik Lenas; [2020]
    Nyckelord :Machine Learning; Natural Language Processing; Relation Extraction; Named Entity Recognition; Coreference resolution; BERT; Maskininlärning; Natural Language Processing; Relationsextrahering; Named Entity Recognition; Coreference resolution; BERT;

    Sammanfattning : Natural language processing (NLP) is a vibrant area of research with many practical applications today like sentiment analyses, text labeling, questioning an- swering, machine translation and automatic text summarizing. At the moment, research is mainly focused on the English language, although many other lan- guages are trying to catch up. LÄS MER

  4. 4. A topic model-based approach for ontology extension in the computational materials science domain

    Magister-uppsats, Linköpings universitet/Institutionen för datavetenskap

    Författare :Tong Zhang; [2020]
    Nyckelord :;

    Sammanfattning : With the continuous development and progress of human society, the demand for advanced materials in all walks of life is increasing day by day. No matter in the agrarian age or the information age, human beings have always been tireless in the study of materials science, and the field of computational materials science has been the exploration of computational methods in materials science. LÄS MER

  5. 5. A Rule-Based Normalization System for Greek Noisy User-Generated Text

    Master-uppsats, Uppsala universitet/Institutionen för lingvistik och filologi

    Författare :Marsida Toska; [2020]
    Nyckelord :nlp; noisy text preprocessing; rule-based; levenshtein; twitter; normalization; Greek;

    Sammanfattning : The ever-growing usage of social media platforms generates daily vast amounts of textual data which could potentially serve as a great source of information. Therefore, mining user-generated data for commercial, academic, or other purposes has already attracted the interest of the research community. LÄS MER