Sökning: "Bag-of-Words"

Visar resultat 16 - 20 av 53 uppsatser innehållade ordet Bag-of-Words.

  1. 16. Preprocessing method comparison and model tuning for natural language data

    Master-uppsats, Högskolan Dalarna/Mikrodataanalys

    Författare :Peter Tempfli; [2020]
    Nyckelord :Natural language processing; sentiment analysis; machine learning;

    Sammanfattning : Twitter and other microblogging services are a valuable source for almost real-time marketing, public opinion and brand-related consumer information mining. As such, collection and analysis of user-generated natural language content is in the focus of research regarding automated sentiment analysis. LÄS MER

  2. 17. 'Sorry, I didn't understand that' : A comparison of methods for intent classification for social robotics applications

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Mikaela Åstrand; [2020]
    Nyckelord :;

    Sammanfattning : An important feature in a social robot is the ability to understand natural language. One of the core components in a typical system for natural language understanding (NLU) is so called intent classification; the action of classifying user utterances based on the underlying intents of the user. LÄS MER

  3. 18. Predicting Swedish News Article Popularity

    Master-uppsats, Linköpings universitet/Interaktiva och kognitiva system

    Författare :Ludvig Noring; [2020]
    Nyckelord :Natural language processing; News popularity prediction; cold-start prediction; News media; Prediction; Popularity;

    Sammanfattning : In this work, 132,229 articles from a Swedish news publisher are used to explore news article popularity prediction. Linear-, k-Nearest Neighbor- and Support Vector Regression are evaluated using the two different metrics root mean squared error and R2. The problem is then relaxed into only attempting to rank the articles relative to each other. LÄS MER

  4. 19. Automatic fingerprinting of websites

    M1-uppsats, KTH/Hälsoinformatik och logistik

    Författare :Alfred Berg; Norton Lamberg; [2020]
    Nyckelord :;

    Sammanfattning : Abstract Fingerprinting a website is the process of identifying what technologies a websiteuses, such as their used web applications and JavaScript frameworks. Currentfingerprinting methods use manually created fingerprints for each technology itlooks for. LÄS MER

  5. 20. Exploring Cross-lingual Sublanguage Classification with Multi-lingual Word Embeddings

    Master-uppsats, Linköpings universitet/Statistik och maskininlärning

    Författare :Min-Chun Shih; [2020]
    Nyckelord :;

    Sammanfattning : Cross-lingual text classification is an important task due to the globalization and the increased availability of multilingual data. This thesis explores the method of implementing cross-lingual classification on Swedish and English medical corpora. LÄS MER