Sökning: "Duplicate Detection"

Visar resultat 1 - 5 av 18 uppsatser innehållade orden Duplicate Detection.

  1. 1. A Case Study on the Limitations of Automated Duplicate Bug Report Detection

    Kandidat-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknik

    Författare :Malte Götharsson; Karl Stahre; [2023-09-26]
    Nyckelord :;

    Sammanfattning : Identifying duplicate bug reports is crucial in software development as it helps streamline the debugging process, reduce redundancy, and enhance overall efficiency. By addressing the challenges associated with existing automated techniques and leveraging testers’ expertise, the tool proposed in this study aims to improve the accuracy of duplicate detection, saving valuable time and resources while ensuring that potential duplicates are not overlooked. LÄS MER

  2. 2. Evaluation of Machine Learning techniques for Master Data Management

    Magister-uppsats, Högskolan i Skövde/Institutionen för informationsteknologi

    Författare :Fatime Toçi; [2023]
    Nyckelord :Master Data Management; Machine Learning; data quality; data duplicates;

    Sammanfattning : In organisations, duplicate customer master data present a recurring problem. Duplicate records can result in errors, complication, and inefficiency since they frequently result from dissimilar systems or inadequate data integration. LÄS MER

  3. 3. Duplicate detection of multimodal and domain-specific trouble reports when having few samples : An evaluation of models using natural language processing, machine learning, and Siamese networks pre-trained on automatically labeled data

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Viktor Karlstrand; [2022]
    Nyckelord :Duplicate detection; Bug reports; Trouble reports; Natural language processing; Information retrieval; Machine learning; Siamese neural network; Transformers; Automated data labeling; Shapley values; Dubblettdetektering; Felrapporter; Buggrapporter; Naturlig språkbehandling; Informationssökning; Maskininlärning; Siamesiska neurala nätverk; Transformatorer; Automatiserad datamärkning; Shapley-värden;

    Sammanfattning : Trouble and bug reports are essential in software maintenance and for identifying faults—a challenging and time-consuming task. In cases when the fault and reports are similar or identical to previous and already resolved ones, the effort can be reduced significantly making the prospect of automatically detecting duplicates very compelling. LÄS MER

  4. 4. Free-text Informed Duplicate Detection of COVID-19 Vaccine Adverse Event Reports

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen för systemteknik

    Författare :Erik Turesson; [2022]
    Nyckelord :Duplicate detection; Deduplication; Record linkage; Adverse Event Reports; COVID-19 Vaccines; Uppsala Monitoring Centre; VigiBase; Machine Learning; Gradient Boosted Decision Trees; BERT; Natural Language Processing; Pharmacovigilance; Individual Case Safety Reports;

    Sammanfattning : To increase medicine safety, researchers use adverse event reports to assess causal relationships between drugs and suspected adverse reactions. VigiBase, the world's largest database of such reports, collects data from numerous sources, introducing the risk of several records referring to the same case. LÄS MER

  5. 5. Finding duplicate offers in the online marketplace catalogue using transformer based methods : An exploration of transformer based methods for the task of entity resolution

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Robert-Andrei Damian; [2022]
    Nyckelord :Transformers; Language Models; Deep Neural Networks; Entity Resolution; Duplicate Detection; Entity Matching; Record Linkage; Contrastive Learning; e-commerce; Transformers; Modèles de langage; Apprentisage en profondeur; Résolution d’entité; Détection de doublons; Apprentisage contrastif; commerce électronique; Transformers; Språkmodeller; Djupinlärning; Entitetserkännande; Dubblettdetektering; Entitetsmatchning; Rekordkoppling; e-handel;

    Sammanfattning : The amount of data available on the web is constantly growing, and e-commerce websites are no exception. Considering the abundance of available information, finding offers for the same product in the catalogue of different retailers represents a challenge. This problem is an interesting one and addresses the needs of multiple actors. LÄS MER