Sökning: "GPU parallel computing"
Visar resultat 1 - 5 av 40 uppsatser innehållade orden GPU parallel computing.
1. Parallelising High OrderTransform of Point SpreadFunction and TemplateSubtraction for AstronomicImage Subtraction : The implementation of BACH
Master-uppsats, Linköpings universitet/InformationskodningSammanfattning : This thesis explores possible improvements, using parallel computing, to the PSF-alignment and image subtraction algorithm found in HOTPANTS. In time-domain astronomy the PSF-alignment and image subtraction algorithm OIS is used to identify transient events. LÄS MER
2. A Conjugate Residual Solver with Kernel Fusion for massive MIMO Detection
Master-uppsats, Högskolan i Halmstad/Centrum för forskning om tillämpade intelligenta system (CAISR)Sammanfattning : This thesis presents a comparison of a GPU implementation of the Conjugate Residual method as a sequence of generic library kernels against implementations ofthe method with custom kernels to expose the performance gains of a keyoptimization strategy, kernel fusion, for memory-bound operations which is to makeefficient reuse of the processed data. For massive MIMO the iterative solver is to be employed at the linear detection stageto overcome the computational bottleneck of the matrix inversion required in theequalization process, which is 𝒪(𝑛3) for direct solvers. LÄS MER
3. Real-time Soft Body Simulation using Extended Position-Based Dynamics and Tetrahedral Deformation
Kandidat-uppsats, Blekinge Tekniska Högskola/Institutionen för datavetenskapSammanfattning : Background. Several methods have been used to simulate soft body deformation, such as mass-spring systems and position-based dynamics. This has been done using tetrahedral mesh models for preservation of shape and volume. LÄS MER
4. Evaluation of FPGA-based High Performance Computing Platforms
Master-uppsats, Linköpings universitet/DatorteknikSammanfattning : High performance computing is a topic that has risen to the top in the era ofdigitalization, AI and automation. Therefore, the search for more cost and timeeffective ways to implement HPC work is always a subject extensively researched.One part of this is to have hardware that is capable to improve on these criteria. LÄS MER
5. Register Caching for Energy Efficient GPGPU Tensor Core Computing
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : The General-Purpose GPU (GPGPU) has emerged as the predominant computing device for extensive parallel workloads in the fields of Artificial Intelligence (AI) and Scientific Computing, primarily owing to its adoption of the Single Instruction Multiple Thread architecture, which not only provides a wealth of thread context but also effectively hide the latencies exposed in the single threads executions. As computational demands have evolved, modern GPGPUs have incorporated specialized matrix engines, e. LÄS MER