Newest 'web-scraping+python-3.x' Questions

5 votes

2 answers

687 views

Readability and error handling improvements for Python web scraping class

Description I recently wrote a Python script to download files from the Library of Congress (LOC) based on a search query. The code fetches metadata, extracts file ...

IntegerEuler

253

asked Nov 19, 2024 at 0:04

3 votes

1 answer

99 views

Scraping website with Python and Selenium to collect data from dynamic website

Summary: The code scrapes the website and collects the data to store it in CSV. It also downloads selected information that is available for download in PDF format. The details and the entire code are ...

sangharsh

269

asked Apr 9, 2024 at 20:41

2 votes

1 answer

72 views

A selenium web scraper to package NBA data

I'm building a selenium web scraper for basketball-reference.com that takes a player name and returns data in either a JSON format or Pandas DataFrame object. The class in question is one of many that ...

BluffShove

21

asked Mar 24, 2024 at 0:25

2 votes

1 answer

116 views

Search Stack Overflow and GitHub for code in a specified language

This code is designed to scrape Stack Overflow and GitHub, pulling information based on a user-specified programming language and processing the data into a format for AI learning. It uses a number of ...

Robert3737

39

asked Mar 10, 2023 at 11:21

3 votes

1 answer

229 views

A simple web scraper for nature.com news articles

I have created a simple web scraper that fetches news article previews from nature.com and saves each article to a file containing the article preview text. I am learning independently, so I would ...

razzleDazzle

31

asked Mar 9, 2023 at 17:39

3 votes

2 answers

233 views

Saving Scraped Data to a File

When scraping and saving data into a file, Which method is more efficient when saving scraped data to a file? open the file first, scrape, and save the data all ...

Seraph776

201

asked Oct 5, 2022 at 16:21

3 votes

1 answer

57 views

request data and print results

On last test, the below code takes approximately 10 seconds to download then print the data from 10 url's. I wish to speed this up as much as possible as later on I plan to expand this further and use ...

Luca

67

asked Sep 21, 2022 at 15:53

3 votes

1 answer

219 views

Scrape PokeDex and display in tkinter

Hi I am new here and I just completed my first working version of a pokedex app with a GUI using tkinter. I used selenium to scrape the data from pokemondb.net, and then used pandas to clean up the ...

Salah Zahran

33

asked Jul 26, 2022 at 0:20

9 votes

2 answers

2k views

Python script to scrape and parse the Stanford Encyclopedia of Philosophy

I wrote the following script to parse an SEP article and call pandoc to convert it to EPUB. I'd love your feedback. There is no function but I didn't think it was worth adding. Also there is no test ...

user260644

asked Jul 11, 2022 at 11:17

1 vote

2 answers

122 views

Scraper to grab publicly available data

I am using Python scraper code to grab publicly available data from http://103.48.16.132/echalan/, but it takes almost 6gb of memory and more cpu. I need to run ...

Learner

119

asked Jul 11, 2022 at 4:40

3 votes

0 answers

821 views

A simple Python script that crawls information about Youtube playlists and your watch history

You will need to follow this guide. This is a Youtube crawler that crawls information about Youtube playlists, it uses Youtube Data API v3 and it crawls the title, url, description, count and videos ...

Ξένη Γήινος

3,702

asked Apr 29, 2022 at 9:17

3 votes

0 answers

106 views

Scraping housing dataset row by row from Estate Agent's website

I have written web scraping code with selenium with python3 for a dynamic site (it was my first project that is why I started with selenium). In my code I extracted the values and created DataFrame. ...

berkersenol

31

asked Mar 23, 2022 at 19:59

8 votes

1 answer

153 views

Scrape a URL for articles to save

Below is an exercise project I was doing on a certain educational site. It is supposed to parse a given (static address in this example) URL for html data, search articles of a given type there and ...

Gavin Greenhorn

145

asked Feb 11, 2022 at 19:00

4 votes

2 answers

632 views

Web scraping data.cdc.gov for COVID-19 Data with Selenium in Python

I'm attempting to scrape data.cdc.gov for their COVID-19 information on cases and deaths. The problem that I'm having is that the code seems to be very inefficient. It takes an extremely long time for ...

Nini

55

asked Dec 9, 2021 at 16:12

2 votes

0 answers

169 views

Web scraping articles using asyncio

I use asyncio to speed up web scraping. I collect only title, author, tags, datetime, total comments from list view from specific website. Also, i collect these from all pages. I would like to improve ...

AlexDotis

417

asked Nov 10, 2021 at 21:31

Stack Exchange Network

All Questions

Readability and error handling improvements for Python web scraping class

Scraping website with Python and Selenium to collect data from dynamic website

A selenium web scraper to package NBA data

Search Stack Overflow and GitHub for code in a specified language

A simple web scraper for nature.com news articles

Saving Scraped Data to a File

request data and print results

Scrape PokeDex and display in tkinter

Python script to scrape and parse the Stanford Encyclopedia of Philosophy

Scraper to grab publicly available data

A simple Python script that crawls information about Youtube playlists and your watch history

Scraping housing dataset row by row from Estate Agent's website

Scrape a URL for articles to save

Web scraping data.cdc.gov for COVID-19 Data with Selenium in Python

Web scraping articles using asyncio

Hot Network Questions