1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

Inquiry about a site with 100,000+ links to crawl..

Discussion in 'Google Sitemaps' started by jaiz, Sep 27, 2006.

  1. #1
    Hi I have had GSiteCrawler running for a few days now and it still hasn't finished crawling my site. I have a coppermine gallery which has thousands among thousands of links which is making it impossible for me to create full sitemaps like I want. Is there any sitemap creators that you would recommend that could crawl this many links within a few hours or at least 24 hours? Or is it hopeless?
     
    jaiz, Sep 27, 2006 IP
  2. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #2
    What slows down the crawlers may be if you have a "backend" DB which gets "hogged". I have experienced this with some forums as well. It may pay of to be very careful with crawler filters. (e.g. if two pages are very similar to each other, cut one of them)

    There is no way tell if e.g. my software may be faster before knowing the url of the website. For such a large website (100,000+ pages), I would in my program disable :
    • Track all links and redirects from and to all pages
    • Let website crawler collect external links
     
    websitetools, Sep 28, 2006 IP
  3. disgust

    disgust Guest

    Messages:
    2,417
    Likes Received:
    133
    Best Answers:
    0
    Trophy Points:
    0
    #3
    why is it so essential that it gets done in 24 hours?

    beyond that:

    make sure the pages are unique enough to warrant listing AND you also have enough incoming links for google to want to go that deep. having a sitemap won't instantly make google index your whole site, no matter how many or how few pages you have; links have played a huge factor in search engines for ages and will continue to do so for a long time.
     
    disgust, Sep 28, 2006 IP
  4. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    #4
    A sitemap won't help you get this many urls indexed. Make the site easy to crawl instead.
     
    mad4, Sep 28, 2006 IP
  5. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #5
    While I agree that the effect of sitemaps are not always fully clearcut, I tend to think they help quite well for bigger websites. Of course, it is always a good idea to have friendly urls (e.g. like no session IDs! and prefebly everything mod_rewrite) and incoming links... But to argue that Google XML sitemaps (which this forum is about) is no help for big websites... That kinda surprises :eek: me :D
     
    websitetools, Sep 28, 2006 IP