Sökning: "Dubblettdetektering"

Hittade 2 uppsatser innehållade ordet Dubblettdetektering.

  1. 1. Duplicate detection of multimodal and domain-specific trouble reports when having few samples : An evaluation of models using natural language processing, machine learning, and Siamese networks pre-trained on automatically labeled data

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Viktor Karlstrand; [2022]
    Nyckelord :Duplicate detection; Bug reports; Trouble reports; Natural language processing; Information retrieval; Machine learning; Siamese neural network; Transformers; Automated data labeling; Shapley values; Dubblettdetektering; Felrapporter; Buggrapporter; Naturlig språkbehandling; Informationssökning; Maskininlärning; Siamesiska neurala nätverk; Transformatorer; Automatiserad datamärkning; Shapley-värden;

    Sammanfattning : Trouble and bug reports are essential in software maintenance and for identifying faults—a challenging and time-consuming task. In cases when the fault and reports are similar or identical to previous and already resolved ones, the effort can be reduced significantly making the prospect of automatically detecting duplicates very compelling. LÄS MER

  2. 2. Finding duplicate offers in the online marketplace catalogue using transformer based methods : An exploration of transformer based methods for the task of entity resolution

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Robert-Andrei Damian; [2022]
    Nyckelord :Transformers; Language Models; Deep Neural Networks; Entity Resolution; Duplicate Detection; Entity Matching; Record Linkage; Contrastive Learning; e-commerce; Transformers; Modèles de langage; Apprentisage en profondeur; Résolution d’entité; Détection de doublons; Apprentisage contrastif; commerce électronique; Transformers; Språkmodeller; Djupinlärning; Entitetserkännande; Dubblettdetektering; Entitetsmatchning; Rekordkoppling; e-handel;

    Sammanfattning : The amount of data available on the web is constantly growing, and e-commerce websites are no exception. Considering the abundance of available information, finding offers for the same product in the catalogue of different retailers represents a challenge. This problem is an interesting one and addresses the needs of multiple actors. LÄS MER