imacros help

wickedDUDE

New member
Jun 25, 2006
1,054
12
0
Out of a long list of urls, I need to know which ones contain a specific string of text on the page. The text is exactly the same on each page.

I've used imacros before to pull other info like meta keywords, but never for text that is mixed in with content.

Thoughts are welcome...
 


Code:
import requests
urls = (l.strip() for l in open('urls.txt'))

for url in urls:
    try:
        r = requests.get(url)
        print url, 'footprint' in r.text
    except:
        pass
 
...
load url
extract html
open tab
submit html to a web form
process said webform for the phrase with php and return what you need
extract html
close tab
....

Same exact principle for firing off a captcha to services really.