Dear MSNBot,
Take a chill pill. You've been pulling my RSS feed roughly every 10 minutes today.
I know you're late to the search game and even later to the RSS game, but seriously... WTF?!
Please take a lesson from the rest of the RSS aggregation and search services. Listen to a ping stream and think about checking once an hour or so.
Thanks,
Jeremy
Posted by jzawodn at August 18, 2005 05:53 PM
One of my servers has had 2,397 hits from msnbot today (as of 8pm).
You folks aren't kidding, msnbot is on crank. Crawling more often does not equal a better crawl!
This happened to my feed yesterday to the point where it was hitting it once every minute. It's still going today, but it looks like it has slowed. From looking through the logs, it looks like MSN is trying to determine the refresh rate of my feed (and ignoring a Last-Modified: header). I see it hitting more often during some hours, and then slowing, and now it seems as if it has settled on a constant once-every-five-minutes.
Dear Jeremy,
Sorry about that. I will try to lay off the meth.
Sincerely,
MSNBot
It's been grabbing my feed about every five minutes since Wednesday evening. And it doesn't ask for gzip compression to save bandwidth.
I am not on the A list like Jeremy, but I am
puzzled by how much crawing Google seems to do vs other spiders. I am new to this so I may be reading my stats wrong:
http://blog.eronj.com/awstats/awstats.pl
Robots/Spiders visitors
18 different robots Hits Bandwidth
Last visit
Googlebot 18562 569.74 MB 19 Aug 2005 - 01:19
MSNBot 1436 6.78 MB 19 Aug 2005 - 00:48
Inktomi Slurp 713 10.16 MB 19 Aug 2005 - 01:02
so adding this to your pages doesn't help?
User-agent: msnbot
Crawl-delay: a bazillion
Ron K. Jeffries: Those Googlebot crawls are just insane for a site like yours.
I wonder if the MSN bot can read the sitemap.xml file, perhaps that would cool it off. I know MS said they fixed it, or were fixing it, but it still seems to be all over my site.