Best way I've found is to scrape 2 pages from that site which both use the same template. A blog for example, every post has the same layout right? Scrape 2 of them, then compare. The content that appears on both pages will be the menu, footer, etc etc.. And the content that is unique will generally be the content you want. This is easy to do if you grab each line of HTML and put it into an array.