FInd javascript-links with Python

Question

Is there any way to find javascript-links on a webpage with python? I use mechanize and I can't find all the links I want. I want the url on the pictures on this site: http://500px.com/popular

want the url on the pictures on this site: 500px.com/popular — user3465589, 32 mins ago

Martijn Pieters · Answer 1 · 2014-08-02 20:47:59Z

With just BeautifulSoup this is quite easy:

js_links = soup.select('a[href^="javascript:"]')

This selects all <a> elements that have a href attribute whose value starts with javascript::

>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup('''\
... <html><body>
... <a href="http://stackoverflow.com">Not a javascript link</a>
... <a name="target">Not a link, no href</a>
... <a href="javascript:alert('P4wned');">Javascript link (with scary message)</a>
... <a href="javascript:return False">Another javascript link</a>
... </body></html>
... ''')
>>> for link in soup.select('a[href^="javascript:"]'):
...     print link['href'], link.get_text()
... 
javascript:alert('P4wned'); Javascript link (with scary message)
javascript:return False Another javascript link

asked	today
viewed	16 times
active	today

current community

your communities

more stack exchange communities

FInd javascript-links with Python

1 Answer 1

Your Answer

Not the answer you're looking for? Browse other questions tagged python beautifulsoup mechanize or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

FInd javascript-links with Python

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged python beautifulsoup mechanize or ask your own question.

Related

Hot Network Questions