Newest 'regex+web-scraping' Questions

8 votes

3 answers

686 views

Function in Python to extract web Data

I developed this feature that I think can be improved quite a bit. The result is the desired one, but it took me many lines of code. Any idea to optimize it? ...

Raymont

215

asked Dec 18, 2020 at 19:33

3 votes

0 answers

627 views

Scraper Class with Regex

I already posted a not (yet) complete version of this class, but that question got closed, because it contained not passing tests. Here is the full version, all 5 tests pass. Originally there was ...

user3568719

173

asked Jul 21, 2017 at 21:05

2 votes

1 answer

602 views

Scraping data unveiling a button from craigslist

I've written some code to parse the names and phone numbers from craigslist. It starts from the link in "m_url" then goes one layer deep to parse the name and then again another layer deep to parse ...

SIM

2,501

asked Jul 12, 2017 at 19:26

2 votes

1 answer

739 views

JavaScript Website-Content Grabber

I my firm a few people have following problem: A Content Management System is hosted externally. The treaty doesn't include database-access. In September the treaty will expire. So they have to get ...

michael.zech

5,002

asked Jul 3, 2017 at 7:52

7 votes

2 answers

5k views

Using python and beautifulsoup to iterate through a list of websites to find a particular string

I'm attempting to find companies who mention a particular service in on their homepage. To do this, I am iterating through a csv file with two columns - ID and URL. I'm using BeautifulSoup to get the ...

Nicholas Johnson

71

asked May 18, 2017 at 1:59

2 votes

1 answer

78 views

Parsing HTML to download e-books

I'm currently writing a little tool to get into Go. As I'm not familiar with the language I'm especially looking for Conventional go stuff. utility.go feels wrong.Should I wrap the client and email/...

Nordiii

151

asked Mar 27, 2017 at 22:42

13 votes

1 answer

373 views

Regex-guided crawler that downloads regex-matching images up to a crawling level

This is one simple crawler that downloads images from websites, the website's URL to be crawled to must match the regex, as well as any image-to-download's URL. (Also, I know, I made my own thread ...

wallabra

789

asked Jul 2, 2016 at 21:16

2 votes

1 answer

82 views

Wikipedia indexer and shortest link finder

I have the following code, how can I make it more efficient? Also, it doesn't always find the shortest route. (See Cat -> Tree) ...

dangee1705

345

asked Mar 1, 2016 at 19:39

3 votes

2 answers

193 views

Parsing HTML from multiple webpages simultaneously

My friend wrote a scraper in Go that takes the results from a house listing webpage and finds listings for houses that he's interested in. The initial search returns listings, they are filtered by ...

Explosion Pills

133

asked Feb 27, 2016 at 13:10

7 votes

3 answers

302 views

IP and router connections

How can I make my code more pythonic ? I definitely think there is a way to make this code a lot more readable and clear + shorter... But I haven't found an effective way. Any techniques I can use to ...

Marie Anne

181

asked Oct 25, 2015 at 18:20

1 vote

1 answer

197 views

Formatting HTML for use in a locally hosted iframe

This formats HTML for use in a locally hosted iframe so that you can manipulate the content in the iframe freely, without running into cross domain issues. It uses Goutte to retrieve the HTML. I'd ...

zacbrac

43

asked May 13, 2015 at 16:28

5 votes

2 answers

288 views

Press any login button on any site

I'm working on a script that will be able to press the login button on any site for an app I'm working on. I have it working (still a few edge cases to work out such as multiple submit buttons and ...

Levi Fuller

163

asked Feb 1, 2015 at 18:44

2 votes

2 answers

959 views

Phone Number Extracting using RegEx And HtmlAgilityPack

I've written this whole code to extract cell numbers from a website. It is extracting numbers perfectly but very slowly, and it's also hanging my Form while Extracting. ...

Shehryar Iqbal

21

asked Sep 14, 2014 at 16:27

3 votes

1 answer

74 views

Cheat Code Scraper

During breaks, I find myself playing Emerald version a lot and was tired of having to use the school's slow wifi to access the internet. I wrote a scraper to obtain cheat codes and send them to my psp ...

user27606

asked Jul 2, 2014 at 15:57

15 votes

1 answer

60k views

Getting data correctly from <span> tag with beautifulsoup and regex

I am scraping an online shop page, trying to get the price mentioned in that page. In the following block the price is mentioned: ...

avi

993

asked Feb 2, 2014 at 7:53

Stack Exchange Network

All Questions

Function in Python to extract web Data

Scraper Class with Regex

Scraping data unveiling a button from craigslist

JavaScript Website-Content Grabber

Using python and beautifulsoup to iterate through a list of websites to find a particular string

Parsing HTML to download e-books

Regex-guided crawler that downloads regex-matching images up to a crawling level

Wikipedia indexer and shortest link finder

Parsing HTML from multiple webpages simultaneously

IP and router connections

Formatting HTML for use in a locally hosted iframe

Press any login button on any site

Phone Number Extracting using RegEx And HtmlAgilityPack

Cheat Code Scraper

Getting data correctly from <span> tag with beautifulsoup and regex

Hot Network Questions