Sökning: "Shidi Zhou"

Hittade 2 uppsatser innehållade orden Shidi Zhou.

  1. 1. Auto-Tuning Apache Spark Parameters for Processing Large Datasets

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Shidi Zhou; [2023]
    Nyckelord :Apache Spark; Cloud Environment; Spark Configuration Parameter; Resource Utilization; Ridge Regression; Elastic Net; Random Forest; Deep Neural Network; Bayesian Optimization; Particle Swarm Optimization.; Apache Spark; Molnmiljö; Apache Spark konfigurationsparameter; Resursutnyttjande; Ridge-regression; Elastisk nät; Slumpskog; Djupt neuralt nätverk; Bayesiansk optimering; Partikelsvärmsoptimering.;

    Sammanfattning : Apache Spark is a popular open-source distributed processing framework that enables efficient processing of large amounts of data. Apache Spark has a large number of configuration parameters that are strongly related to performance. Selecting an optimal configuration for Apache Spark application deployed in a cloud environment is a complex task. LÄS MER

  2. 2. A Web Scraper For Forums : Navigation and text extraction methods

    Kandidat-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Michael Palma; Shidi Zhou; [2017]
    Nyckelord :Data mining; Web Scraper; Java; Web forums; Text-extraction; Link Duplicates; Data mining; Web Scraper; Java; Web forums; Text-extraction; Link Duplicates;

    Sammanfattning : Web forums are a popular way of exchanging information and discussing various topics. These websites usually have a special structure, divided into boards, threads and posts. Although the structure might be consistent across forums, the layout of each forum is different. LÄS MER