1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Need recommendation for a site scraper / web page bulk downloader

Discussion in 'Content Management' started by Labcoat88, Mar 2, 2006.

  1. #1
    I'm looking for an application (preferably Windows) or a Unix script that can scrape a portion or entire website to local files. "Why I need this" is a long story, but I need to be able to slurp down HTML files, graphics, CSS, javascript, etc., and have the URL paths rewritten to work from a local hard drive.

    Any software recommendations for software that people have used? As for cost, 'free' would be nice, but I'm willing to pay some $ if the product is good and professional.

    thanks!
    LC
     
    Labcoat88, Mar 2, 2006 IP
    Barefootsies likes this.
  2. Labcoat88

    Labcoat88 Peon

    Messages:
    192
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    0
    #2
    hehe... replying to my own thread.

    I found a great one that does exactly what I needed: MetaProducts "Offline Explorer" ... http://www.metaproducts.com/

    LC
     
    Labcoat88, Mar 2, 2006 IP
  3. jimrthy

    jimrthy Guest

    Messages:
    283
    Likes Received:
    13
    Best Answers:
    0
    Trophy Points:
    0
    #3
    $70...ouch.

    I don't have much use for this sort of program, but, when I have had the situation come up, I just use wget. It takes a little bit (though not much) to learn, but I'm comfortable using command line tools. And it's free.

    I'm not sure about a windows version, though. If there's not, I bet it would work with cygwin.
     
    jimrthy, Mar 5, 2006 IP
  4. rehash

    rehash Well-Known Member

    Messages:
    1,502
    Likes Received:
    30
    Best Answers:
    0
    Trophy Points:
    150
    #4
    yeah, wget should do it most of times
     
    rehash, Mar 7, 2006 IP
  5. iatbm

    iatbm Prominent Member

    Messages:
    5,151
    Likes Received:
    352
    Best Answers:
    0
    Trophy Points:
    360
  6. xern

    xern Peon

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Maybe you can consider to use FEAR::API, another site scraping framework based on Perl. You can use it to create your own specific scraping scripts.

    Best,
    Yung-chung Lin
     
    xern, May 16, 2006 IP