I am interested in consolidating data from several websites onto a spreadsheet that I can sort and rank. This data is intended for my personal use only. I have looked at a couple tutorials and begun writing a web scraper in Python using requests, Beautiful Soup, and mysql. However, it seems that in order to write what I want I need to learn several separate pieces of software and languages. Before I dive into that effort it occurs to me their may be a generic web scraper that I can use to gather the information I am seeking. My specific questions are:
1. Is there a generic web scraper available for Raspberry Pi 4b.
2. Are web scrapers written specifically to scrape a particular website or can one piece of software (written by me or someone else) be used to scrape similar information from several sites?
3. Are there any legal restrictions in scraping data from publicly available websites? I presume that, since I am physically able to point my browser at a particular website and receive data, i am also ale to obtain that same information in a different manner or format.
4. What is the difference between a web crawler and a web scraper?
5. Are the tools I selected the right way to go or is there a simpler approach?
1. Is there a generic web scraper available for Raspberry Pi 4b.
2. Are web scrapers written specifically to scrape a particular website or can one piece of software (written by me or someone else) be used to scrape similar information from several sites?
3. Are there any legal restrictions in scraping data from publicly available websites? I presume that, since I am physically able to point my browser at a particular website and receive data, i am also ale to obtain that same information in a different manner or format.
4. What is the difference between a web crawler and a web scraper?
5. Are the tools I selected the right way to go or is there a simpler approach?
Statistics: Posted by ras_oscar — Sat Feb 15, 2025 3:47 pm