How to parse javascript results with Python

Question

I'm having trouble with my Python Script. All i want to do is to parse a div element with an id value:value and to store all the changed values. The value of this element is generating by javascript. This means that the value of the element is depending on user's input. To be more specific the html element looks like that

<div id="value">...Here the frequently changed value generated by javascript...</div>

My python script is the following:

from bs4 import BeautifulSoup
import urllib
x=urllib.urlopen("http://example.com")
s = x.read()
soup = BeautifulSoup(s)

m = soup.find("div",{"id":"value"})
val = m.text
print val

The result is None but on the webpage the changes are obvious! Please help me to figure it out.

Your code looks fine. You can check x.getcode() to make sure you actually download the page (it should return 200). — Dzinx
– Dzinx, Commented Apr 18, 2014 at 13:17

alecxe · Accepted Answer · 2014-04-18 13:18:38Z

0

If the value is generated by javascript - the easiest solution would be to make use of a real browser to crawl the web page. This is where selenium would help. Here's a simple example:

from selenium import webdriver

browser = webdriver.Firefox()
browser.get('http://example.com')

element = browser.find_element_by_id('value')
print element.text

answered Apr 18, 2014 at 13:18

alecxe

476k127 gold badges1.1k silver badges1.2k bronze badges

i'm having some troubles with the installation of selenium module. Is Splinter something similar like Selenium?

Labrosb
– Labrosb

04/18/2014 16:06:25
Commented Apr 18, 2014 at 16:06
@ather0s splinter is simply an abstraction layer on top of the selenium and other libraries.

alecxe
– alecxe

04/18/2014 16:09:43
Commented Apr 18, 2014 at 16:09

Add a comment |

Collectives™ on Stack Overflow

How to parse javascript results with Python

1 Answer 1

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Related