Interesting. Dan reports that Google is watching more closely than he expected.
I made an offhand comment on IRC with the URL for the real image (I hardlink it on the server, rather than playing Apache redirect games) and three minutes later... Pow! There was googlebot, looking at it. Turns out that one of the folks on the channel'd looked at it, and they run Opera with the google stuff on it so presumably that's how google got the URL. I will admit to being very impressed with the speed that the crawler struck out with (and it was the crawler, at least according to the log data and the PTR record for the IP address) but still...
Of course, it could have been the experimental IRC
sniffer bot they've been playing with.
Posted by jzawodn at November 10, 2003 09:16 PM
True, it could have been the bot, but I've noticed that pages with AdSense get crawled quite often. If google doesn't already have the page in its cache, it will send out googlebot within 30 minutes (usually) to grab a page for AdSense. A bonus for site owners is that the page gets stuffed into the search database at the same time. I'm not familiar with the backend, so it might just all be the same database, but still, it's pretty cool.
If it was Google monitoring the channel then it means they were tied into the IRC network. #parrot is pretty small (~30 folks normally) and everyone's got op privs, so there wasn't a bot hiding and watching. (Well, not a google bot at least, as we've got a half-dozen special-purpose bots hanging around)
While I'm not thrilled with the idea of a google toolbar watching the URLs you type, I like it a lot better than the thought that they've just got another server tied into the IRC net I'm on and are watching everything...
It's definitely the plugin. Did some testing, and put up the results on the blog.
what about redirect with robot.txt while google toolbar and the URLs in conjunction