Sökning: "Text Extrahering"

Visar resultat 1 - 5 av 11 uppsatser innehållade orden Text Extrahering.

  1. 1. Accurately extracting information from a finite set of different report categories and formats

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Jonatan Holmbäck; [2023]
    Nyckelord :Text Extraction; PDF; Excel; Text Parsing; Data Analysis; Text Extrahering; PDF; Excel; Text Parsing; Data Analys;

    Sammanfattning : POC Sports (hereafter simply POC) is a company that manufactures gear and accessories for winter sports as well as cycling. Their mission is to “Protect lives and reduce the consequences of accidents for athletes and anyone inspired to be one”. LÄS MER

  2. 2. Classification of invoices using a 2D NLP approach : A comparison between methods for invoice information extraction for the purpose of classification

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Linnéa Fredriksson; [2023]
    Nyckelord :Key-field extraction; Invoices; 2D NLP; Document Intelligence; Visually Rich Documents; LayoutLMv3; Nyckelfältsextraktion; Fakturor; 2-dimensionell naturligtspråkbehandling; LayoutLMv3;

    Sammanfattning : Many companies are handling a large number of invoices every year. To manually categorize them takes a lot of time and resources. For a model to automatically categorize invoices, the documents need to be properly read and processed by the model. LÄS MER

  3. 3. Generic Data Harvester

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :William Asp; Johannes Valck; [2022]
    Nyckelord :News; Articles; Newspapers; Web crawler; Web site parsing; Optimization; Web robot; Web spider; Web data extraction; HTML; Scrapy; Nyheter; Artiklar; Tidningar; Sökrobot; Analys av hemsida; Optimering; Webbrobot; Webbspindel; Data extrahering hemsidor; HTML; Scrapy;

    Sammanfattning : This report goes through the process of developing a generic article scraper which shall extract relevant information from an arbitrary web article. The extraction is implemented by searching and examining the HTML of the article, by using Python and XPath. LÄS MER

  4. 4. Extracting information about arms deals from news articles

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Fredrik Hernqvist; [2022]
    Nyckelord :Natural Language Processing; Machine Learning; Deep Learning; BERT; ALBERT; Arms Transfers; Information Extraction; Behandling av naturliga språk; maskininlärning; djupinlärning; BERT; ALBERT; vapenaffärer;

    Sammanfattning : The Stockholm International Peace Research Institute (SIPRI) maintains the most comprehensive publicly available database on international arms deals. Updating this database requires humans to sift through large amounts of news articles, only some of which contain information relevant to the database. LÄS MER

  5. 5. Extracting relevant answer phrases from text : For usage in reading comprehension question generation

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Filippa Kärrfelt; [2022]
    Nyckelord :Answer phrase extraction; Question generation; BERT; Reading comprehension; Neural networks; Extrahering av svarsfraser; Frågegenerering; BERT; Läsförståelse; Neurala nätverk;

    Sammanfattning : This report presents a method for extracting answer phrases, suitable as answers to reading comprehension questions, from Swedish text. All code used to produce the results is available on github*. The method is developed using a Swedish BERT, a pre-trained language model based on neural networks. LÄS MER