Sökning: "MapReduce"
Visar resultat 16 - 20 av 33 uppsatser innehållade ordet MapReduce.
16. Data Analysis on Hadoop - finding tools and applications for Big Data challenges
Master-uppsats, Uppsala universitet/Institutionen för informationsteknologiSammanfattning : With the increasing number of data generated each day, recent development in software, provide the tools needed to tackle the challenges of the so called Big Data era. This project introduces some of these platforms, in particular it focuses on platforms for data analysis and query tools that works alongside Hadoop. LÄS MER
17. NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
Kandidat-uppsats, Uppsala universitet/Institutionen för informationsteknologiSammanfattning : Collecting and analysing data of analytical value is important for many service providers today. Many make use of NoSQL databases for their larger software systems, what is less known is how to effectively analyse and gather business intelligence from the data in these systems. LÄS MER
18. Distributed Resource Management for YARN
Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)Sammanfattning : In the last year, Hadoop YARN has become the defacto standard resource management platform for data-intensive applications, with support for a wide range of data analytics platforms such as Apache Spark, MapReduce V2, MPI, Apache Flink, and Apache Giraph. The ResourceManager fulfills three main functions: it manages the set of active applications (Applications service), it schedules resources (CPU, memory) to applications (the FIFO/Capacity/Fair Scheduler), and it monitors the state of resources in the cluster (ResourceTracker service). LÄS MER
19. Scaling YARN: A Distributed Resource Manager for Hadoop
Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)Sammanfattning : In recent years, there has been a growing need for computer systems that are capable of handling unprecedented amounts of data. To this end, Hadoop HDFS and Hadoop YARN have become the de facto standard for meeting demanding storage requirements and for managing applications that can process this data. LÄS MER
20. Indexing Genomic Data on Hadoop
Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)Sammanfattning : In the last years Hadoop has been used as a standard backend for big data applications. Its most known application MapReduce provides a powerful parallel programming paradigm. Big companies, storing petabytes of data, like Facebook and Yahoo deployed their own Hadoop distribution for data analytics, interactive services etc. LÄS MER