Sökning: "MapReduce"

Visar resultat 16 - 20 av 33 uppsatser innehållade ordet MapReduce.

  1. 16. Data Analysis on Hadoop - finding tools and applications for Big Data challenges

    Master-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Juan De Dios Santos Rivera; [2015]
    Nyckelord :;

    Sammanfattning : With the increasing number of data generated each day, recent development in software, provide the tools needed to tackle the challenges of the so called Big Data era. This project introduces some of these platforms, in particular it focuses on platforms for data analysis and query tools that works alongside Hadoop. LÄS MER

  2. 17. NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection

    Kandidat-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Lukas Klingsbo; [2015]
    Nyckelord :;

    Sammanfattning : Collecting and analysing data of analytical value is important for many service providers today. Many make use of NoSQL databases for their larger software systems, what is less known is how to effectively analyse and gather business intelligence from the data in these systems. LÄS MER

  3. 18. Distributed Resource Management for YARN

    Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Srijeyanthan Kuganesan; [2015]
    Nyckelord :;

    Sammanfattning : In the last year, Hadoop YARN has become the defacto standard resource management platform for data-intensive applications, with support for a wide range of data analytics platforms such as Apache Spark, MapReduce V2, MPI, Apache Flink, and Apache Giraph. The ResourceManager fulfills three main functions: it manages the set of active applications (Applications service), it schedules resources (CPU, memory) to applications (the FIFO/Capacity/Fair Scheduler), and it monitors the state of resources in the cluster (ResourceTracker service). LÄS MER

  4. 19. Scaling YARN: A Distributed Resource Manager for Hadoop

    Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Theofilos Kakantousis; [2014]
    Nyckelord :;

    Sammanfattning : In recent years, there has been a growing need for computer systems that are capable of handling unprecedented amounts of data. To this end, Hadoop HDFS and Hadoop YARN have become the de facto standard for meeting demanding storage requirements and for managing applications that can process this data. LÄS MER

  5. 20. Indexing Genomic Data on Hadoop

    Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Peter Büchler; [2014]
    Nyckelord :;

    Sammanfattning : In the last years Hadoop has been used as a standard backend for big data applications. Its most known application MapReduce provides a powerful parallel programming paradigm. Big companies, storing petabytes of data, like Facebook and Yahoo deployed their own Hadoop distribution for data analytics, interactive services etc. LÄS MER