Web Scraping Tools - Please Advise



Hit up mattseh, he is both an expert web scraper and a tool

Code:
import magicrequests as requests
import time
import string
import collections
import json

url = 'http://www.warriorforum.com/warrior-special-offers-forum/'

views_counter = collections.Counter()
posts_counter = collections.Counter()

while True:
    print url
    try:
        r = requests.get(url)
    except:
        continue
    counts = [int(c.replace(',', '')) for c in r.xpath('//tr/td[5]/text()') if not any(char in c for char in string.lowercase)]
    posters = r.xpath('//tr/td[2]/div[2]/span/text()')
    views_counter.update(dict(zip(posters, counts)))
    posts_counter.update(posters)

    next = r.xpath('//a[@rel="next"]/@href')
    if len(next):
        url = next[0]
        time.sleep(5)
    else:
        break

    print 'most common', views_counter.most_common(10)

json.dump(views_counter, open('wafo-views.json', 'w'))
json.dump(posts_counter, open('wafo-posts.json', 'w'))
 
Waste of time bro, your service doesn't have a 7 in the price and doesn't promise the next one click shiny button to 100% guaranteed success and eternal happiness.
 
Just pay hehejo... His turn around time is faster than you building anything.

Always did a great job for me.
 
I don't know how complicated of a scraper you want, but check out import.io. Little buggy since it's in beta, but free & pretty easy to use.
 
tumblr_mwnfteNODS1srd8hzo1_500.gif
 
just hire a coder.. seriously. most little jobs like your talking about can be done in an hour or 2.

Sounds good in theory but when you have clients and something breaks, that coder isn't at your beck and call to fix it.