I also made myself a little bot to scrape some sites. The only problem I ran into was that I had to use Firefox / Chrome, because I couldn't use a headless browser because the page I was scraping used some JS.
Is anyone aware of any headless browsers for Python that support JS, that are pretty simple to use? I'm having a hard time finding anything worthwhile.
javascript support in htmlunit is poor, try pyvirtualdisplay Corey Goldberg: Python - Taking Browser Screenshots With No Display (Selenium/Xvfb) or PyPhantomJS http://dev.umaclan.com/projects/pyphantomjs (let me know if you have any success with that one). I don't think either could be described as 'simple to use'
