Sökning: "data wikipedia"

Visar resultat 1 - 5 av 44 uppsatser innehållade orden data wikipedia.

  1. 1. Multilingual Text Robots for Abstract Wikipedia – Using Grammatical Framework to generate multilingual articles on Swedish localities

    Kandidat-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknik

    Författare :Omar Diriye; Filip Folkesson; Erik NIlsson; Felix NIlsson; William NIlsson; Dylan Osolian; [2023-03-03]
    Nyckelord :Text robot; Natural Language Generation; Grammatical Framework; Multilingual Natural Language Generation; Abstract Wikipedia; Wikidata;

    Sammanfattning : The vast amount of Wikipedia articles and languages has resulted in a high cost of Wikipedia, i.e. the required time and dedication for making every article available in every language. LÄS MER

  2. 2. Mitigating Unintended Bias in Toxic Comment Detection using Entropy-based Attention Regularization

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Fabio Camerota; [2023]
    Nyckelord :XLNet; BERT; Toxic Comment Classification; Entropy-based Attention Regularization; XLNet; BERT; Toxisk Kommentar Klassificering; Entropibaserad uppmärksamhetsreglering;

    Sammanfattning : The proliferation of hate speech is a growing challenge for social media platforms, as toxic online comments can have dangerous consequences also in real life. There is a need for tools that can automatically and reliably detect hateful comments, and deep learning models have proven effective in solving this issue. LÄS MER

  3. 3. Generating Wikipedia Articles with Grammatical Framework : A Case Study

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Keivan Matinzadeh; [2023]
    Nyckelord :Grammatical Framework; Computational Linguistics; Natural Language Generation; Computer Science; Grammatical Framework; Beräkningslingvistik; Textgenerering; Datavetenskap;

    Sammanfattning : Natural language generation is a method used to produce understandable texts in human languages from data [1]. Grammatical Framework is a grammar formalism and a functional programming language using a nonstatistical approach to build natural language applications. LÄS MER

  4. 4. Topical Classification of Images in Wikipedia : Development of topical classification models followed by a study of the visual content of Wikipedia

    Master-uppsats, Linköpings universitet/Datorseende

    Författare :Matheus Vieira Bernat; [2023]
    Nyckelord :Wikipedia; Multilabel classification; Deep learning;

    Sammanfattning : With over 53 million articles and 11 million images, Wikipedia is the greatest encyclopedia in history. The number of users is equally significant, with daily views surpassing 1 billion. Such an enormous system needs automation of tasks to make it possible for the volunteers to maintain. LÄS MER

  5. 5. Synthetic data generation for domain adaptation of a retriever-reader Question Answering system for the Telecom domain : Comparing dense embeddings with BM25 for Open Domain Question Answering

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Filip Döringer Kana; [2023]
    Nyckelord :Natural Language Processing; Transformers; Deep Learning; Question Answering; Data Generation; Språkteknologi; Transformers; Djupinlärning; Frågebesvaring; Datagenerering;

    Sammanfattning : Having computer systems capable of answering questions has been a goal within Natural Language Processing research for many years. Machine Learning systems have recently become increasingly proficient at this task with large language models obtaining state-of-the-art performance. LÄS MER