Avancerad sökning

Visar resultat 1 - 5 av 8 uppsatser som matchar ovanstående sökkriterier.

  1. 1. A lightweight deep learning architecture for text embedding : Comparison between the usage of Transformers and Mixers for textual embedding

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Corentin Royer; [2023]
    Nyckelord :Deep Learning; Entity Retrieval; Mixer; Transformer;

    Sammanfattning : Text embedding is a widely used method for comparing pieces of text together by mapping them to a compact vector space. One such application is deduplication which consists in finding textual records that refer to the same underlying idea in order to merge them or delete one of them. LÄS MER

  2. 2. Free-text Informed Duplicate Detection of COVID-19 Vaccine Adverse Event Reports

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen för systemteknik

    Författare :Erik Turesson; [2022]
    Nyckelord :Duplicate detection; Deduplication; Record linkage; Adverse Event Reports; COVID-19 Vaccines; Uppsala Monitoring Centre; VigiBase; Machine Learning; Gradient Boosted Decision Trees; BERT; Natural Language Processing; Pharmacovigilance; Individual Case Safety Reports;

    Sammanfattning : To increase medicine safety, researchers use adverse event reports to assess causal relationships between drugs and suspected adverse reactions. VigiBase, the world's largest database of such reports, collects data from numerous sources, introducing the risk of several records referring to the same case. LÄS MER

  3. 3. The Cost of Confidentiality in Cloud Storage

    Master-uppsats, Linköpings universitet/Databas och informationsteknik

    Författare :Eric Henziger; [2018]
    Nyckelord :cloud storage; file synchronization; client side encryption; compression; deduplication; delta encoding; cpu utilization; memory utilization; performance; measurements; dropbox; google drive; onedrive; tresorit; spideroak; mega; sync.com; macOS; comparison;

    Sammanfattning : Cloud storage services allow users to store and access data in a secure and flexible manner. In recent years, cloud storage services have seen rapid growth in popularity as well as in technological progress and hundreds of millions of users use these services to store thousands of petabytes of data. LÄS MER

  4. 4. Study on Record Linkage regarding Accuracy and Scalability

    Kandidat-uppsats, Umeå universitet/Institutionen för datavetenskap

    Författare :Johannes Dannelöv; [2018]
    Nyckelord :;

    Sammanfattning : The idea of record linkage is to find records that refer to the same entity across different data sources. There are multiple synonyms that refer to record linkage, such as data matching, entity resolution, entity disambiguation, or deduplication etc. LÄS MER

  5. 5. Analysing Performance Effects of Deduplication on Virtual Machine Storage

    Kandidat-uppsats, Högskolan i Skövde/Institutionen för informationsteknologi

    Författare :Marcus Kauküla; [2017]
    Nyckelord :Virtualization; Virtual machine storage; Deduplication; ZFS; SDFS;

    Sammanfattning : Virtualization is a widely used technology for running multiple operating systems on a single set of hardware. Virtual machines running the same operating system have been shown to have a large amount of identical data, in such cases deduplication have been shown to be very effective in eliminating duplicated data. LÄS MER