Sökning: "Text Mining Topic Model Polylingual PLT Named Entity Recgonition NER Statistics Machine Learning Duplicate Detection Litterature Fiction Books Book Natural Language Processing NLP"

Hittade 1 uppsats innehållade orden Text Mining Topic Model Polylingual PLT Named Entity Recgonition NER Statistics Machine Learning Duplicate Detection Litterature Fiction Books Book Natural Language Processing NLP.

  1. 1. Automatic Identification of Duplicates in Literature in Multiple Languages

    Master-uppsats, Linköpings universitet/Statistik och maskininlärning

    Författare :Emil Klasson Svensson; [2018]
    Nyckelord :Text Mining Topic Model Polylingual PLT Named Entity Recgonition NER Statistics Machine Learning Duplicate Detection Litterature Fiction Books Book Natural Language Processing NLP;

    Sammanfattning : As the the amount of books available online the sizes of each these collections are at the same pace growing larger and more commonly in multiple languages. Many of these cor- pora contain duplicates in form of various editions or translations of books. LÄS MER