Sökning: "Document Preprocessing"

Visar resultat 6 - 10 av 11 uppsatser innehållade orden Document Preprocessing.

  1. 6. Automatic Patent Classification

    Master-uppsats,

    Författare :Nala Yehe; [2020]
    Nyckelord :XGBoost; support vector machine SVM ; random forest; decision tree; machine learning; text data mining; patent classification; IPC;

    Sammanfattning : Patents have a great research value and it is also beneficial to the community of industrial, commercial, legal and policymaking. Effective analysis of patent literature can reveal important technical details and relationships, and it can also explain business trends, propose novel industrial solutions, and make crucial investment decisions. LÄS MER

  2. 7. DETECTION of INFRASTRUCTURE ANOMALIES in BUILD LOGS USING MACHINE LEARNINGText classification on Continous Integration log files.

    Master-uppsats, Umeå universitet/Institutionen för datavetenskap

    Författare :Didrik Lindqvist; [2019]
    Nyckelord :;

    Sammanfattning : Continuous integration is a practice where software developers integrate their code to a bigger codebase multiple times per day. Before the integration, the code is built and tested by e.g open source build tools such as Jenkins, and the information produced during this process is stored in a log file. LÄS MER

  3. 8. How to explain graph-based semi-supervised learning for non-mathematicians?

    Kandidat-uppsats, Malmö universitet/Fakulteten för teknik och samhälle (TS)

    Författare :Mattias Jönsson; Lucas Borg; [2019]
    Nyckelord :Graph based SSL; Label Propagation; Naive Bayes’; KNN; RBF; Feature extraction; 20 newsgroup; preprocessing; graph construction;

    Sammanfattning : Den stora mängden tillgänglig data på internet kan användas för att förbättra förutsägelser genom maskininlärning. Problemet är att sådan data ofta är i ett obehandlat format och kräver att någon manuellt bestämmer etiketter på den insamlade datan innan den kan användas av algoritmen. LÄS MER

  4. 9. PDF document search within a very large database

    Kandidat-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Lizhong Wang; [2017]
    Nyckelord :Portable Document Format; Search; Document Identification; Cosine Similarity; Document Preprocessing; Document Search; Optimization Method; Performance Analysis; Classification; Regression; Loredge.; Portable Document Format; Sökning; Dokument Identifiering; Cosine Similarity; Dokument Förhandling; Dokument Sökning; Optimering metod; Prestandaanalys; Klassificering; Regression; Loredge;

    Sammanfattning : Digital search engine, taking a search request from user and then returning a result responded to the request to the user, is indispensable for modern humans who are used to surfing the Internet. On the other hand, the digital document PDF is accepted by more and more people and becomes widely used in this day and age due to the convenience and effectiveness. LÄS MER

  5. 10. Development of a data analysis platform for characterizing functional connectivity networks in rodents

    Master-uppsats, KTH/Skolan för teknik och hälsa (STH)

    Författare :Cyril Gerard Valery Monnot; [2013]
    Nyckelord :functional magnetic resonance imaging; image processing;

    Sammanfattning : This document addresses the development and implementation of a routine for analyzing resting-state functional Magnetic Resonance Imaging (rs-fMRI) data in rodents. Even though resting-state connectivity is studied in humans already for several years with diverse applications in mental disorders or degenerative brain diseases, the interest for this modality is much more recent and less common in rodents. LÄS MER