Sökning: "high dimensional data"

Visar resultat 1 - 5 av 313 uppsatser innehållade orden high dimensional data.

  1. 1. Feature Selection for Microarray Data via Stochastic Approximation

    Master-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknik

    Författare :Erik Rosvall; [2024-03-18]
    Nyckelord :feature selection; feature ranking; microarray data; stochastic approximation; Barzilai and Borwein method; Machine Learning; AI;

    Sammanfattning : This thesis explores the challenge of feature selection (FS) in machine learning, which involves reducing the dimensionality of data. The selection of a relevant subset of features from a larger pool has demonstrated its effectiveness in enhancing the performance of various machine learning algorithms. LÄS MER

  2. 2. Geometry of high dimensional Gaussian data

    Kandidat-uppsats, Linköpings universitet/Tillämpad matematik; Linköpings universitet/Tekniska fakulteten

    Författare :Olof Samuel Mossberg; [2024]
    Nyckelord :HDLSS; high dimensional data; stochastic boundedness; asymptotic orthogonality; geometry; multivariate normal distribution; HDLSS; högdimensionell data; stokastisk begränsning; asymptotisk ortogonalitet; geometri; multivariat normalfördelning;

    Sammanfattning : Collected data may simultaneously be of low sample size and high dimension. Such data exhibit some geometric regularities consisting of a single observation being a rotation on a sphere, and a pair of observations being orthogonal. This thesis investigates these geometric properties in some detail. LÄS MER

  3. 3. Variational AutoEncoders and Differential Privacy : balancing data synthesis and privacy constraints

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Baptiste Bremond; [2024]
    Nyckelord :TVAE; Differential privacy; Tabular data; Synthetic data; DP-SGD; TVAE; differentiell integritet; tabelldata; syntetiska data; DP-SGD;

    Sammanfattning : This thesis investigates the effectiveness of Tabular Variational Auto Encoders (TVAEs) in generating high-quality synthetic tabular data and assesses their compliance with differential privacy principles. The study shows that while TVAEs are better than VAEs at generating synthetic data that faithfully reproduces the distribution of real data as measured by the Synthetic Data Vault (SDV) metrics, the latter does not guarantee that the synthetic data is up to the task in practical industrial applications. LÄS MER

  4. 4. An evaluation study of 3D imaging technology as a tool to estimate body weight and growth in dairy heifers

    Master-uppsats, SLU/Dept. of Animal Nutrition and Management

    Författare :Emelie Ahlberg; [2024]
    Nyckelord :body measurement; body weight; growth; heifer; three-dimensional imaging; young stock management;

    Sammanfattning : The aim of this thesis was to evaluate the use of a 3D camera as a tool to estimate body weight and growth in dairy heifers. Data collection lasted from October 2022 to January 2023 and was performed at the Swedish Livestock Research Centre in Uppsala, Sweden. LÄS MER

  5. 5. Regularization Methods and High Dimensional Data: A Comparative Study Based on Frequentist and Bayesian Methods

    Kandidat-uppsats, Lunds universitet/Statistiska institutionen

    Författare :Markus Gerholm; Johan Sörstadius; [2024]
    Nyckelord :Linear regression; high dimensional data; regularization; Bayesian methods; Mathematics and Statistics;

    Sammanfattning : As the amount of high dimensional data becomes increasingly accessible and common, the need for reliable methods to combat problems such as overfitting and multicollinearity increases. Models need to be able to manage large data sets where predictor variables often outnumber the amount of observations. LÄS MER