Sökning: "GPU performance"

Visar resultat 1 - 5 av 248 uppsatser innehållade orden GPU performance.

  1. 1. An evaluation of GPU virtualization

    Uppsats för yrkesexamina på avancerad nivå, Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Författare :Josef Vilestad; [2024]
    Nyckelord :gpu; virtualization; mig; siov;

    Sammanfattning : There has been extensive research and progress on virtualization on CPUs for a while. More recently the focus on GPU virtualization has increased as processing power doubles roughly every 2.5 years. Coupled with advances in memory management and the PCIe standard the first hardware assisted virtual solutions became available in the 2010s. LÄS MER

  2. 2. IDENTIFICATION OF ENVIRONMENTALLY RELEVANT BENTHIC FORAMINIFERA FROM THE SKAGERRAK FJORDS BY DEEP LEARNING IMAGE MODELING

    Master-uppsats, Göteborgs universitet / Institutionen för biologi och miljövetenskap

    Författare :Marko Plavetic; [2023-06-26]
    Nyckelord :benthic foraminifera; deep learning; environmental monitoring; YOLOv7;

    Sammanfattning : Over the several past decades, there has been increasing interest in using foraminifera as environmental indicators for coastal marine environments. As compared to macrofauna, which are currently used in environmental studies, foraminifera offer several distinct advantages as bioindicators, including short generation times, a high number of individuals per small sample volume, hard and durable tests with high preservation potential, and low cost of sample extraction. LÄS MER

  3. 3. EMONAS : Evolutionary Multi-objective Neuron Architecture Search of Deep Neural Network

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Jiayi Feng; [2023]
    Nyckelord :DNN Deep Neural Network ; NAS Neural Architecture Search ; EA Evolutionary Algorithm ; Multi-Objective Optimization; Binary One Optimization; Embedded Systems; DNN Deep Neural Network ; NAS Neural Architecture Search ; EA Evolutionary Algorithm ; Multi-Objective Optimization; Binary One Optimization; Inbyggda system;

    Sammanfattning : Customized Deep Neural Network (DNN) accelerators have been increasingly popular in various applications, from autonomous driving and natural language processing to healthcare and finance, etc. However, deploying them directly on embedded system peripherals within real-time operating systems (RTOS) is not easy due to the paradox of the complexity of DNNs and the simplicity of embedded system devices. LÄS MER

  4. 4. A comparative performance analysis of Fast Fourier Transformation and Gerstner waves

    Kandidat-uppsats, Blekinge Tekniska Högskola/Institutionen för datavetenskap

    Författare :Morgan Westerberg; Oliver Olguin Jönsson; [2023]
    Nyckelord :Water simulation; Procedural methods; Ocean waves; Fast Fourier Transformation; Gerstner waves;

    Sammanfattning : Background:  As time moves on hardware is able to tackle heavier and more complex computations in real-time systems. This means that more realistic and stylistic environments can be computed. One of these environments is the ocean. LÄS MER

  5. 5. A Conjugate Residual Solver with Kernel Fusion for massive MIMO Detection

    Master-uppsats, Högskolan i Halmstad/Centrum för forskning om tillämpade intelligenta system (CAISR)

    Författare :Ioannis Broumas; [2023]
    Nyckelord :MIMO; massive MIMO; GPU; CUDA; Software Defined Radio; SDR; MMSE; ZF; zero-forcing; parallel detection; iterative methods; conjugate residual; parallel computing; kernel fusion;

    Sammanfattning : This thesis presents a comparison of a GPU implementation of the Conjugate Residual method as a sequence of generic library kernels against implementations ofthe method with custom kernels to expose the performance gains of a keyoptimization strategy, kernel fusion, for memory-bound operations which is to makeefficient reuse of the processed data. For massive MIMO the iterative solver is to be employed at the linear detection stageto overcome the computational bottleneck of the matrix inversion required in theequalization process, which is 𝒪(𝑛3) for direct solvers. LÄS MER