Sökning: "Web-scraper"

Visar resultat 1 - 5 av 13 uppsatser innehållade ordet Web-scraper.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Rikard Johansson; [2023]
    Nyckelord :Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Sammanfattning : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. LÄS MER

  2. 2. Data Analysis of Discussions, Regarding Common Vulnerabilities and Exposures, and their Sentiment on Social Media

    Kandidat-uppsats, Linköpings universitet/Institutionen för datavetenskap

    Författare :Mustafa Rahmati; Danijel Grujicic; [2022]
    Nyckelord :Social media; Reddit; Twitter; sentiment analysis; computer science; information technology; CVE; information security; CVSS score; Flair; Vader; TextBlob; API; data collection; web scraper; data analysis; natural language processing; NLP; information retrieval;

    Sammanfattning : As common vulnerabilites and exposures are detected, they are also discussed in various social platforms. The problem is that only a few of the posts made about them, are getting enough attention. This leads to an unawareness of potential and critical threats against systems. LÄS MER

  3. 3. Automating the extraction of Financial data

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Nicolas Rollino; Rakin Ali; [2022]
    Nyckelord :Web scraper; Financial data; Textract; AWS; Node.JS; Puppeteer;

    Sammanfattning : It is hard for retail investors and data providing companies to attain financial data of European companies. The work of extracting financial data of European companies is most likely done manually, which is a time-consuming process. This would explain why European companies’ data is supplied slower than American companies. LÄS MER

  4. 4. Security smells in open-source infrastructure as code scripts : A replication study

    Kandidat-uppsats, Karlstads universitet/Handelshögskolan (from 2013)

    Författare :Andreas Hortlund; [2021]
    Nyckelord :infrastructure as code; security; Ansible; Puppet; static code analysis; security smells;

    Sammanfattning : With the rising number of servers used in productions, virtualization technology engineers needed a new a tool to help them manage the rising configuration workload. Infrastructure as code(IaC), a term that consists mainly of techniques and tools to define wanted configuration states of servers in machine readable code files, which aims at solving the high workload induced by the configuration of several servers. LÄS MER

  5. 5. Second-hand goods classification with CNNs : A proposal for a step towards a more sustainable fashion industry

    Uppsats för yrkesexamina på avancerad nivå, Uppsala universitet/Datalogi

    Författare :Torsten Malmgård; [2021]
    Nyckelord :Webscraper; CNN; Fashion; Second-Hand; Convolutional Neural Network;

    Sammanfattning : For some time now, the fashion industry has been a big contributor to humanity's carbon emissions. If we are to become a more sustainable society and cut down on our pollution, this industry needs to be reformed. The clothes we wear must be reused to a greater extent than today. LÄS MER