Sökning: "Doc2Vec"

Visar resultat 1 - 5 av 14 uppsatser innehållade ordet Doc2Vec.

  1. 1. Natural Language Processing for Improving Search Query Results : Applied on The Swedish Armed Force's Profession Guide

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Datalogi

    Författare :Andreas Harju Schnee; [2023]
    Nyckelord :natural language processing; NLP; maskininlärning; ML; artificiell intelligens; AI; language model; information retrieval system; document embedding; text representation; text data augmentation;

    Sammanfattning : Text has been the historical way of preserving and acquiring knowledge, and text data today is an increasingly growing part of the digital footprint together with the need to query this data for information. Seeking information is a constant ongoing process, and is a crucial part of many systems all around us. LÄS MER

  2. 2. Using Semi-Supervised Learning for Email Classification

    Master-uppsats, KTH/Matematik (Avd.)

    Författare :Anders Inde; [2022]
    Nyckelord :applied mathematics; semi-supervised learning; self-training; doc2vec; classification; tillämpad matematik; semi-vägledd inlärning; self-training; doc2vec; klassificering;

    Sammanfattning : In this thesis, we investigate the use of self-training, a semi-supervised learning method, to improve binary classification of text documents. This means making use of unlabeled samples, since labeled samples can be expensive to generate. More specifically, we want to classify emails that are retrieved by Skandinaviska Enskilda Banken (SEB). LÄS MER

  3. 3. Object Classification using Language Models

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Signaler och system

    Författare :Gustav From; [2022]
    Nyckelord :Classifier; BERT; machine learning; ML; language model; IMDB; word2Vec; doc2Vec; NLP;

    Sammanfattning : In today’s modern digital world more and more emails and messengers must be sent, processed and handled. The categorizing and classification of these text pieces can take an incredibly long time and will cost the company a lot of time and money. LÄS MER

  4. 4. Maskininlärning för dokumentklassificering av finansielladokument med fokus på fakturor

    Uppsats för yrkesexamina på avancerad nivå, Örebro universitet/Institutionen för naturvetenskap och teknik

    Författare :Nawar Khalid Saeed; [2022]
    Nyckelord :Document classification; Text classification; Invoices; NLP; TF-IDF; Doc2vec; Machine Learning; Logistic Regression; Multinomial Naïve Bayes; Support Vector Machine.; Dokumentklassificering; Textklassificering; Fakturor; NLP; TF-IDF; Doc2vec; Maskininlärning; Logistic Regression; Multinomial Naïve Bayes; Support Vector Machine.;

    Sammanfattning : Automatiserad dokumentklassificering är en process eller metod som syftar till att bearbeta ochhantera dokument i digitala former. Många företag strävar efter en textklassificeringsmetodiksom kan lösa olika problem. LÄS MER

  5. 5. Applying Natural Language Processing to document classification

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :David Kragbé; [2022]
    Nyckelord :Natural Language Processing; Document Classification; Embeddings; Classifiers; Naturlig Språkbehandling; Dokumentklassificering; Inbäddningar; Klassificerare;

    Sammanfattning : In today's digital world, we produce and use more electronic documents than ever before. And this trend is far from slowing down. Particularly, more and more companies and businesses now need to treat a considerable amount of documents to deal with their clients' requests. LÄS MER