Sökning: "Website data extraction"

Visar resultat 1 - 5 av 13 uppsatser innehållade orden Website data extraction.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Rikard Johansson; [2023]
    Nyckelord :Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Sammanfattning : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. LÄS MER

  2. 2. Förutsättningar för en cirkulär möbelindustri : en fallstudie på ett nordiskt möbelföretag

    Master-uppsats, Linköpings universitet/Industriell miljöteknik

    Författare :Fritjof Axelsson; Tim Ericson; [2023]
    Nyckelord :Access-based; Barriers; Business models; Circular; Collaborative; Consumption; Economy; Enablers; Furniture; Implementation; Industry; PaaS; Product; PSS; Refurbishment; Remanufacturing; Service; Servitization; Strategy; Sweden; Systems;

    Sammanfattning : The furniture industry is an integral part of the European economy and is now facing economic, environmental, and regulatory challenges. Within the European Union (EU), a large amount of furniture every year goes to incineration or landfill, with only 10% being recycled. LÄS MER

  3. 3. Effectivisation of keywords extraction process : A supervised binary classification approach of scraped words from company websites

    Uppsats för yrkesexamina på avancerad nivå, Umeå universitet/Institutionen för matematik och matematisk statistik

    Författare :Josef Andersson; Max Fremling; [2023]
    Nyckelord :Machine learning; keyword classification; unbalanced data; word embedding;

    Sammanfattning : In today’s digital era, establishing an online presence and maintaining a well-structured website is vitalfor companies to remain competitive in their respective markets. A crucial aspect of online success liesin strategically selecting the right words to optimize customer engagement and search engine visibility. LÄS MER

  4. 4. Bioinformatics analysis on the drug design supporting systems

    Magister-uppsats, Högskolan i Skövde/Institutionen för biovetenskap

    Författare :Emilia Guszpit; [2023]
    Nyckelord :;

    Sammanfattning : This research project investigates the interactions of staurosporine, a potent kinase inhibitor, with 11 ligands, highlighting its role in drug design and bioinformatics. Focusing on the selectivity and promiscuity of staurosporine in binding to protein kinases, the study employs the MANORAA database for data extraction. LÄS MER

  5. 5. Automated Metadata Extraction for Job Advertisements

    Master-uppsats, Göteborgs universitet/Institutionen för data- och informationsteknik

    Författare :Evelina Strauss; Usama Safdar; [2022-06-20]
    Nyckelord :Machine learning; NLP; text classification; computer; science; computer science; project; thesis;

    Sammanfattning : This thesis is written in collaboration with the Swedish Public Employment Service and aims to investigate methods and techniques to automatically extract metadata from unstructured texts. The Swedish Public Employment Service collect job ads from different private job boards and these ads consist of a title and description and are thus of an unstructured format. LÄS MER