Sökning: "encoder"

Visar resultat 11 - 15 av 264 uppsatser innehållade ordet encoder.

  1. 11. Constructing and representing a knowledge graph(KG) for Positive Energy Districts (PEDs)

    Master-uppsats, Högskolan Dalarna/Institutionen för information och teknik

    Författare :Mahtab Davari; [2023]
    Nyckelord :Knowledge graph; Positive Energy Districts PEDs ; longest path; Questions and Answers; Community Detection; Node Embedding; t-SNE plots; Edge Prediction;

    Sammanfattning : In recent years, knowledge graphs(KGs) have become essential tools for visualizing concepts and retrieving contextual information. However, constructing KGs for new and specialized domains like Positive Energy Districts (PEDs) presents unique challenges, particularly when dealing with unstructured texts and ambiguous concepts from academic articles. LÄS MER

  2. 12. Visual Attention Guided Adaptive Quantization for x265 using Deep Learning

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Mikaela Gärde; [2023]
    Nyckelord :video encoding; deep learning; visual attention; adaptive quantization; videokodning; djupinlärning; visuellt fokus; adaptiv kvantisering;

    Sammanfattning : The video on demand streaming is raising drastically in popularity, bringing new challenges to the video coding field. There is a need for new video coding techniques that improve performance and reduce the bitrates. LÄS MER

  3. 13. Text to Music Audio Generation using Latent Diffusion Model : A re-engineering of AudioLDM Model

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Ernan Wang; [2023]
    Nyckelord :Text to Music Audio Generation; Latent Diffusion; AudioLDM; Sampling Methods; Denoising Diffusion Probabilistic Model DDPM ; Denoising Diffusion Implicit Model DDIM ; Text till musik Ljudgenerering; Latent Diffusion; AudioLDM; Samplingsmetoder; DDPM; DDIM;

    Sammanfattning : In the emerging field of audio generation using diffusion models, this project pioneers the adaptation of the AudioLDM model framework, initially designed for text-to-daily sounds generation, towards text-to-music audio generation. This shift addresses a gap in the current scope of audio diffusion models, predominantly focused on everyday sounds. LÄS MER

  4. 14. Multi-Scale Task Dynamics in Transfer and Multi-Task Learning : Towards Efficient Perception for Autonomous Driving

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Simon Ekman von Huth; [2023]
    Nyckelord :Autonomous Driving; Computer Vision; Deep Learning; Machine Learning; Multi-Task Learning; Transfer Learning; Task Relationships; Task Dynamics; Python; Multi-Scale Representation Learning; Fuss-Free Network; Självkörande Fordon; Datorseende; Djupinlärning; Maskininlärning; Multiuppgiftsinlärning; Överföringsinlärning; Uppgiftsrelationer; Uppgiftsdynamik; Python; Flerskalig Representationsinlärning; Fuss-Free Nätverk;

    Sammanfattning : Autonomous driving technology has the potential to revolutionize the way we think about transportation and its impact on society. Perceiving the environment is a key aspect of autonomous driving, which involves multiple computer vision tasks. LÄS MER

  5. 15. Robust Multi-Modal Fusion for 3D Object Detection : Using multiple sensors of different types to robustly detect, classify, and position objects in three dimensions.

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Viktor Kårefjärd; [2023]
    Nyckelord :Computer Vision; 3D Object Detection; Multi-Modal Fusion; Deep Learning; Datorseenden; 3D-objektdetektion; Multimodal fusion; Djupinlärning;

    Sammanfattning : The computer vision task of 3D object detection is fundamentally necessary for autonomous driving perception systems. These vehicles typically feature a multitude of sensors, such as cameras, radars, and light detection and ranging sensors. LÄS MER