Sökning: "web scraper"

Visar resultat 1 - 5 av 16 uppsatser innehållade orden web scraper.

  1. 1. The One Spider To Rule Them All : Web Scraping Simplified: Improving Analyst Productivity and Reducing Development Time with A Generalized Spider

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Rikard Johansson; [2023]
    Nyckelord :Web scraping; Web crawlers; HTML; Scrapy; Optimization; Web data extraction; Webbskrapning; Webbsökrobotar; HTML; Scrapy; Optimering; Webbdataextraktion;

    Sammanfattning : This thesis addresses the process of developing a generalized spider for web scraping, which can be applied to multiple sources, thereby reducing the time and cost involved in creating and maintaining individual spiders for each website or URL. The project aims to improve analyst productivity, reduce development time for developers, and ensure high-quality and accurate data extraction. LÄS MER

  2. 2. Generic Data Harvester

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :William Asp; Johannes Valck; [2022]
    Nyckelord :News; Articles; Newspapers; Web crawler; Web site parsing; Optimization; Web robot; Web spider; Web data extraction; HTML; Scrapy; Nyheter; Artiklar; Tidningar; Sökrobot; Analys av hemsida; Optimering; Webbrobot; Webbspindel; Data extrahering hemsidor; HTML; Scrapy;

    Sammanfattning : This report goes through the process of developing a generic article scraper which shall extract relevant information from an arbitrary web article. The extraction is implemented by searching and examining the HTML of the article, by using Python and XPath. LÄS MER

  3. 3. Data Analysis of Discussions, Regarding Common Vulnerabilities and Exposures, and their Sentiment on Social Media

    Kandidat-uppsats, Linköpings universitet/Institutionen för datavetenskap

    Författare :Mustafa Rahmati; Danijel Grujicic; [2022]
    Nyckelord :Social media; Reddit; Twitter; sentiment analysis; computer science; information technology; CVE; information security; CVSS score; Flair; Vader; TextBlob; API; data collection; web scraper; data analysis; natural language processing; NLP; information retrieval;

    Sammanfattning : As common vulnerabilites and exposures are detected, they are also discussed in various social platforms. The problem is that only a few of the posts made about them, are getting enough attention. This leads to an unawareness of potential and critical threats against systems. LÄS MER

  4. 4. Automating the extraction of Financial data

    Kandidat-uppsats, KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Författare :Nicolas Rollino; Rakin Ali; [2022]
    Nyckelord :Web scraper; Financial data; Textract; AWS; Node.JS; Puppeteer;

    Sammanfattning : It is hard for retail investors and data providing companies to attain financial data of European companies. The work of extracting financial data of European companies is most likely done manually, which is a time-consuming process. This would explain why European companies’ data is supplied slower than American companies. LÄS MER

  5. 5. Security smells in open-source infrastructure as code scripts : A replication study

    Kandidat-uppsats, Karlstads universitet/Handelshögskolan (from 2013)

    Författare :Andreas Hortlund; [2021]
    Nyckelord :infrastructure as code; security; Ansible; Puppet; static code analysis; security smells;

    Sammanfattning : With the rising number of servers used in productions, virtualization technology engineers needed a new a tool to help them manage the rising configuration workload. Infrastructure as code(IaC), a term that consists mainly of techniques and tools to define wanted configuration states of servers in machine readable code files, which aims at solving the high workload induced by the configuration of several servers. LÄS MER