  1. 1. Building a high throughput microscope simulator using the Apache Kafka streaming framework

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Avdelningen för beräkningsvetenskap

    Författare :Lovisa Lugnegård; [2018]
    Nyckelord :Data streaming; Cloud computing; Apache Kafka;

    Sammanfattning : Today microscopy imaging is a widely used and powerful method for investigating biological processes. The microscopes can produce large amounts of data in a short time. It is therefore impossible to analyse all the data thoroughly because of time and cost constraints. LÄS MER

  2. 2. Investigating the scaleability of analyzing and processing RDBMSdatasets with Apache Spark

    Kandidat-uppsats, Uppsala universitet/Institutionen för informationsteknologi

    Författare :Ferhat Bahceci; [2018]
    Sammanfattning : At the Uppsala Monitoring Centre (UMC), individual case safety reports (ICSRs) are managed, analyzed and processed for publishing statistics of adverse drug reactions. On top of the UMC’s ICSR database there is a data processing tool used to analyze the data. LÄS MER

  3. 3. A Quantative Study of Social Media Echo Chambers

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Matematiska institutionen

    Författare :Joakim Johansson; [2018]
    Sammanfattning : The changing online environment - where the breadth of the information we are exposed to is algorithmically narrowed - has raised concerns about the creation of "echo chambers"; in which individuals are exposed mainly to information already in alignment with their preconceived ideas and opinions. This thesis explores the role of Twitter as a social media and as an information network, and investigates if exposure to and participation in political discussions resembles echo chambers. LÄS MER

  4. 4. Performance Evaluation of Cassandra in a Virtualized Environment

    Master-uppsats, Blekinge Tekniska Högskola/Institutionen för datalogi och datorsystemteknik

    Författare :Mohit Vellanki; [2017]
    Nyckelord :Cassandra; Virtualization; NoSQL databases;

    Sammanfattning : Context. Apache Cassandra is an open-source, scalable, NoSQL database that distributes the data over many commodity servers. It provides no single point of failure by copying and storing the data in different locations. Cassandra uses a ring design rather than the traditional master-slave design. LÄS MER

  5. 5. Visual Debugging of Dataflow Systems

    Master-uppsats, KTH/Skolan för informations- och kommunikationsteknik (ICT)

    Författare :Fanti Machmount Al Samisti; [2017]
    Sammanfattning : Big data processing has seen vast integration into the idea of data analysis in live streaming and batch environments. A plethora of tools have been developed to break down a problem into manageable tasks and to allocate both software and hardware resources in a distributed and fault tolerant manner. LÄS MER


