Sökning: "Ping Yan"

Hittade 1 uppsats innehållade orden Ping Yan.

  1. 1. Anomaly Detection in Categorical Data with Interpretable Machine Learning : A random forest approach to classify imbalanced data

    Kandidat-uppsats, Linköpings universitet/Statistik och maskininlärning

    Författare :Ping Yan; [2019]
    Nyckelord :machine learning; decision tree; imbalanced data; anomaly detection; random forest; maskininlärning; beslut träd; obalanserat data; anomalitetsdetektering;

    Sammanfattning : Metadata refers to "data about data", which contains information needed to understand theprocess of data collection. In this thesis, we investigate if metadata features can be usedto detect broken data and how a tree-based interpretable machine learning algorithm canbe used for an effective classification. The goal of this thesis is two-fold. LÄS MER