Sökning: "get corpus"
Visar resultat 1 - 5 av 32 uppsatser innehållade orden get corpus.
1. REDUPLICATED FORM OF ADJECTIVES IN SOMALI: A plural form or something else?
Master-uppsats, Göteborgs universitet/Institutionen för språk och litteraturerSammanfattning : Abstract The Somali adjectives gets reduplicated. This form of the adjective usually follows the nouns in the plural and the base form follows the nouns in the singular. The thesis aims to study the form and function of the reduplication of the adjectives (in contrast with the base form). LÄS MER
2. Analysing CSR reporting over the years, company size, region, and sector through dictionary-based text mining
Master-uppsats, Högskolan Dalarna/Institutionen för information och teknikSammanfattning : As Corporate Social Responsibility (CSR) reports become more prevalent and systematised, there is a strong need to develop approaches that seek to analyse the contents of these reports. In this thesis, we present two valuable contributions. LÄS MER
3. Synthetic data generation for domain adaptation of a retriever-reader Question Answering system for the Telecom domain : Comparing dense embeddings with BM25 for Open Domain Question Answering
Master-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)Sammanfattning : Having computer systems capable of answering questions has been a goal within Natural Language Processing research for many years. Machine Learning systems have recently become increasingly proficient at this task with large language models obtaining state-of-the-art performance. LÄS MER
4. Semantic Topic Modeling and Trend Analysis
Master-uppsats, Linköpings universitet/Statistik och maskininlärningSammanfattning : This thesis focuses on finding an end-to-end unsupervised solution to solve a two-step problem of extracting semantically meaningful topics and trend analysis of these topics from a large temporal text corpus. To achieve this, the focus is on using the latest develop- ments in Natural Language Processing (NLP) related to pre-trained language models like Google’s Bidirectional Encoder Representations for Transformers (BERT) and other BERT based models. LÄS MER
5. Análisis de errores gramaticales en el aula de ELE : Un estudio de la producción escrita y la producción oral en la escuela sueca
Kandidat-uppsats, Högskolan Dalarna/Institutionen för språk, litteratur och lärandeSammanfattning : The present study aimed to investigate and analyze the most frequent grammatical errors in both oral and written production, what causes the errors as well as the possible correlation that may exist between them by means of a quantitative and qualitative method.The sample selected to perform the investigation is composed of 23 students who study the eighth course in the primary Swedish school. LÄS MER