Sökning: "Stor Språkmodell"

Visar resultat 1 - 5 av 15 uppsatser innehållade orden Stor Språkmodell.

  1. 1. Topological regularization and relative latent representations

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Alejandro García Castellanos; [2023]
    Nyckelord :Algebraic Topology; Large Language Models; Relative Representation; Representation Learning; Model Stitching; Topological DataAnalysis; Zero-shot; Algebraisk topologi; Stora språkmodeller; Relativ representation; Representationsinlärning; Modell sömmar; Topologisk dataanalys; Zero-shot;

    Sammanfattning : This Master's Thesis delves into the application of topological regularization techniques and relative latent representations within the realm of zero-shot model stitching. Building upon the prior work of Moschella et al. LÄS MER

  2. 2. Efficient Sentiment Analysis and Topic Modeling in NLP using Knowledge Distillation and Transfer Learning

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :George Malki; [2023]
    Nyckelord :Large Language Model; RoBERTa; Knowledge distillation; Transfer learning; Sentiment analysis; Topic modeling; Stor språkmodell; RoBERTa; Kunskapsdestillation; överföringsinlärning; Sentimentanalys; Ämnesmodellering;

    Sammanfattning : This abstract presents a study in which knowledge distillation techniques were applied to a Large Language Model (LLM) to create smaller, more efficient models without sacrificing performance. Three configurations of the RoBERTa model were selected as ”student” models to gain knowledge from a pre-trained ”teacher” model. LÄS MER

  3. 3. Contextual short-term memory for LLM-based chatbot

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Mikael Lauri Aleksi Törnwall; [2023]
    Nyckelord :Chatbot; Artificial Intelligence; Machine Learning; Language Model; Large Language Model; GPT-3; Natural Language Processing; Text Summarization; Dialogue Summarization; Prompt Design; Prompt Programming; Chatbot; Artificiell Intelligens; Maskininlärning; Språkmodell; Stor Språkmodell; GPT-3; Naturlig Ppråkbehandling; Textsammanfattning; Sammanfattning av Dialog; Design för Inmatningsprompt; Inmatningsprompt Programmering;

    Sammanfattning : The evolution of Language Models (LMs) has enabled building chatbot systems that are capable of human-like dialogues without the need for fine-tuning the chatbot for a specific task. LMs are stateless, which means that a LM-based chatbot does not have a recollection of the past conversation unless it is explicitly included in the input prompt. LÄS MER

  4. 4. Java Unit Testing with AI: An AI-Driven Prototype for Unit Test Generation

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Katrin Kahur; Jennifer Su; [2023]
    Nyckelord :Unit testing; Artificial intelligence; Large language model; GPT-3.5; Java; Quantitative method; Software development; Enhetstestning; Artificiell intelligens; Stor språkmodell; GPT-3.5; Java; Kvantitativ metod; Mjukvaruutveckling;

    Sammanfattning : In recent years, artificial intelligence (AI) has become increasingly popular. An area where AI technology is used and has received much attention during the past year is chatbots. They can simulate an understanding of human language and form text responses to questions asked. LÄS MER

  5. 5. KARTAL: Web Application Vulnerability Hunting Using Large Language Models : Novel method for detecting logical vulnerabilities in web applications with finetuned Large Language Models

    Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Sinan Sakaoglu; [2023]
    Nyckelord :Broken Access Control; Vulnerability; Large Language Models; Web Application; API; Detection; Scanner; DAST; Application Security; Brutet åtkomstkontroll; Sårbarhet; Stora språkmodeller; Webbapplikation; API; Upptäckt; Skanner; DAST; Applikationssäkerhet;

    Sammanfattning : Broken Access Control is the most serious web application security risk as published by Open Worldwide Application Security Project (OWASP). This category has highly complex vulnerabilities such as Broken Object Level Authorization (BOLA) and Exposure of Sensitive Information. LÄS MER