Sökning: "over-sampling"
Visar resultat 1 - 5 av 14 uppsatser innehållade ordet over-sampling.
1. Optimising Machine Learning Models for Imbalanced Swedish Text Financial Datasets: A Study on Receipt Classification : Exploring Balancing Methods, Naive Bayes Algorithms, and Performance Tradeoffs
Kandidat-uppsats, Linnéuniversitetet/Institutionen för datavetenskap och medieteknik (DM)Sammanfattning : This thesis investigates imbalanced Swedish text financial datasets, specifically receipt classification using machine learning models. The study explores the effectiveness of under-sampling and over-sampling methods for Naive Bayes algorithms, collaborating with Fortnox for a controlled experiment. LÄS MER
2. REINDEER GRAZING IN A NORTHERN BOREAL FOREST : Seasonal and reindeer-induced changes in nutrient availability and soil temperature
Kandidat-uppsats, Umeå universitet/Institutionen för ekologi, miljö och geovetenskapSammanfattning : Soil nutrient availability is a key component to understanding the boreal ecosystems, as it directly relates to plant productivity and ecosystem diversity. There is however little known about how the nutrient availability changes seasonally in the boreal forest. LÄS MER
3. FAULT DETECTION IN AIR HANDLING UNIT (AHU) USING MACHINE LEARNING
Master-uppsats, Högskolan Dalarna/MikrodataanalysSammanfattning : Fault in Air Handling Unit (AHU) of the Heating, Ventilation, and Air Conditioning (HVAC) systems in buildings is a challenge that building managements face. These faults cause buildings to waste 15 – 30% of the energy consumed by the AHU. LÄS MER
4. Predicting purchase intentions of customers by using web data : To identify potential customer groups during sales processes in the real estate sector
Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen för systemteknikSammanfattning : This master thesis aims to investigate the possibilities of predicting purchase intentions of customers during their sales processes in the real estate sector. Also, the web activity of customers on a real estate company’s web site is used as the basis for the forecasting. LÄS MER
5. Corporate default prediction: a comparison between Merton model and random forest in an environment of data scarcity
Magister-uppsats, Lunds universitet/Nationalekonomiska institutionenSammanfattning : The aim of this paper is to compare the performance of the Merton model to a machine learning technique (random forest), in a context where the number of predictors is low or the dataset is quite small. Since random forest is a data-intensive method, the main goal is to find the minimum number of explanatory variables and observations that is needed for it to perform at least as well as the Merton model, an approach developed in the 70s that gives the probability of the firm defaulting. LÄS MER