Sökning: "Apache Hudi"

Hittade 2 uppsatser innehållade orden Apache Hudi.

  1. 1. Scaling Apache Hudi by boosting query performance with RonDB as a Global Index : Adopting a LATS data store for indexing

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Ralfs Zangis; [2022]
    Nyckelord :Apache Hudi; Lakehouse; RonDB; Performance; Index; Key-value store; Apache Hudi; Lakehouse; RonDB; Prestanda; Index; Nyckel-värde butik;

    Sammanfattning : The storage and use of voluminous data are perplexing issues, the resolution of which has become more pressing with the exponential growth of information. Lakehouses are relatively new approaches that try to accomplish this while hiding the complexity from the user. LÄS MER

  2. 2. Hudi on Hops : Incremental Processing and Fast Data Ingestion for Hops

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Netsanet Gebretsadkan Kidane; [2019]
    Nyckelord :Hudi; Hadoop; Hops; Upsert; SQL; Spark; Kafka; Hudi; Hadoop; Hops; Upsert; SQL; Spark; Kafka;

    Sammanfattning : In the era of big data, data is flooding from numerous data sources and many companies have been utilizing different types of tools to load and process data from various sources in a data lake. The major challenges where different companies are facing these days are how to update data into an existing dataset without having to read the entire dataset and overwriting it to accommodate the changes which have a negative impact on the performance. LÄS MER