Sökning: "over-sampling"

Visar resultat 1 - 5 av 14 uppsatser innehållade ordet over-sampling.

  1. 1. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs

    Kandidat-uppsats, Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)

    Författare :Li Ang Hu; Long Ma; [2023]
    Nyckelord :Imbalanced datasets; Swedish text financial datasets; Accuracy; Matthews correlation coefficient; Recall; Multinomial Naive Bayes; SMOTE; TomekLinks; Performance optimization;

    Sammanfattning : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. LÄS MER

  2. 2. REINDEER GRAZING IN A NORTHERN BOREAL FOREST : Seasonal and reindeer-induced changes in nutrient availability and soil temperature

    Kandidat-uppsats, Umeå universitet/Institutionen för ekologi, miljö och geovetenskap

    Författare :Agnes Karlsson; [2023]
    Nyckelord :nutrient availability; soil temperature; reindeer grazing; seasonal nutrient cycling; boreal forest;

    Sammanfattning : Soil nutrient availability is a key component to understanding the boreal ecosystems, as it directly relates to plant productivity and ecosystem diversity. There is however little known about how the nutrient availability changes seasonally in the boreal forest. LÄS MER

  3. 3. FAULT DETECTION IN AIR HANDLING UNIT (AHU) USING MACHINE LEARNING

    Master-uppsats, Högskolan Dalarna/Mikrodataanalys

    Författare :Humphry Takang Bate; Wilkingson Igbinosun; [2022]
    Nyckelord :Fault Detection; AHU; Machine Learning;

    Sammanfattning : Fault in Air Handling Unit (AHU) of the Heating, Ventilation, and Air Conditioning (HVAC) systems in buildings is a challenge that building managements face. These faults cause buildings to waste 15 – 30% of the energy consumed by the AHU. LÄS MER

  4. 4. Predicting purchase intentions of customers by using web data : To identify potential customer groups during sales processes in the real estate sector

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen för systemteknik

    Författare :Olle Kåhre Zäll; [2022]
    Nyckelord :Web Usage Mining; Classification; Purchase Intentions; Time On the Market;

    Sammanfattning : This master thesis aims to investigate the possibilities of predicting purchase intentions of customers during their sales processes in the real estate sector. Also, the web activity of customers on a real estate company’s web site is used as the basis for the forecasting. LÄS MER

  5. 5. Corporate default prediction: a comparison between Merton model and random forest in an environment of data scarcity

    Magister-uppsats, Lunds universitet/Nationalekonomiska institutionen

    Författare :Aitor Díaz García; Matiss Mirosnikovs; [2022]
    Nyckelord :Merton model; random forest; default prediction; SMOTE.; Business and Economics;

    Sammanfattning : The aim of this paper is to compare the performance of the Merton model to a machine learning technique (random forest), in a context where the number of predictors is low or the dataset is quite small. Since random forest is a data-intensive method, the main goal is to find the minimum number of explanatory variables and observations that is needed for it to perform at least as well as the Merton model, an approach developed in the 70s that gives the probability of the firm defaulting. LÄS MER