Sökning: "HopsFS"

Visar resultat 1 - 5 av 10 uppsatser innehållade ordet HopsFS.

  1. 1. Data Build Tool (DBT) Jobs in Hopsworks

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Zidi Chen; [2022]
    Nyckelord :feature engineering; Structured Query Language SQL ; funktionsteknik; strukturerat frågespråk SQL ;

    Sammanfattning : Feature engineering at scale is always critical and challenging in the machine learning pipeline. Modern data warehouses enable data analysts to do feature engineering by transforming, validating and aggregating data in Structured Query Language (SQL). LÄS MER

  2. 2. Project based multi-tenant managed RStudio on Kubernetes for Hopsworks

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Gibson Chikafa; [2021]
    Nyckelord :Multi-tenancy; Cloud computing; Performance isolation; Security; Scaling; Docker; Kubernetes; Azure; GCP; Multitenans; Molntjänster; Prestandaisolering; Säkerhet; Skalning; Docker; Kubernetes; Azure; GCP;

    Sammanfattning : In order to fully benefit from cloud computing, services are designed following the “multi-tenant” architectural model which is aimed at maximizing resource sharing among users. However, multi-tenancy introduces challenges of security, performance isolation, scaling and customization. LÄS MER

  3. 3. Spark on Kubernetes using HopsFS as a backing store : Measuring performance of Spark with HopsFS for storing and retrieving shuffle files while running on Kubernetes

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Shivam Saini; [2020]
    Nyckelord :Spark; Kubernetes; HopsFS; Data processing; Distributed and Parallel processing;

    Sammanfattning : Data is a raw list of facts and details, such as numbers, words, measurements or observations that is not useful for us all by itself. Data processing is a technique that helps to process the data in order to get useful information out of it. Today, the world produces huge amounts of data that can not be processed using traditional methods. LÄS MER

  4. 4. Towards an S3-based, DataNode-less implementation of HDFS

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Franco Jesus Caceres Gutierrez; [2020]
    Nyckelord :Hadoop distributed file system; HDFS; HopsFS; S3;

    Sammanfattning : The relevance of data processing and analysis today cannot be overstated. The convergence of several technological advancements has fostered the proliferation of systems and infrastructure that together support the generation, transmission, and storage of nearly 15,000 exabytes of digital, analyzabledata. LÄS MER

  5. 5. S3-HopsFS: A Scalable Cloud-native Distributed File System

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Joel Stenkvist; [2019]
    Nyckelord :;

    Sammanfattning : Data has been regarded as the new oil in today’s modern world. Data is generated everywhere from how you do online shopping to where you travel. Companies rely on analyzing this data to make informed business decisions and improve their products and services. However, storing this massive amount of data can be very expensive. LÄS MER