Google Now Fills Out Forms & Crawls Results

ambition · on April 12, 2008

I noticed this happening in my blog's logs months ago, for my blog's search form. I thought it was fascinating. The keywords it chose to search were generally meaningful words that occurred frequently on the site. I'm not sure if they were correlated with the Google search terms that hit my blog.

The corresponding SEO tip is to make sure that your site's search feature works, delivers good results for the keywords and delivers the results to which you want Google to pay attention.

kf · on April 11, 2008

Wow, this is kind of a big deal. Crawling the deep web seemed like the one real feature Google's vaporware competitors could tout.

slapshot · on April 12, 2008

The deep web isn't limited to GET requests and forms that don't require passwords.

kf · on April 12, 2008

Yes, but this is a start... give it a couple years.

And this is still more than the vaporware competitors that are still vaporware.

phaedrus · on April 12, 2008

I propose a new sport - Google Crawler hunting. Kind of like Snipe hunting, only you lay traps. Object is to see how long you can keep the Google bot there by generating new pages programmatically, and how many unique things you can get the bot to do.

raghus · on April 12, 2008

Here's Matt Cutts explaining it a little more: http://www.mattcutts.com/blog/solved-another-common-site-rev...

michaelneale · on April 11, 2008

So its ok for them to do this? but if you do the same against google (scraping results) - presumably Bad Things happen?

cstejerean · on April 11, 2008

Is their search url listed in their robots.txt?

thorax · on April 11, 2008

Seems strange that you wouldn't have just looked when asking that question (less characters to type at least). Lots of things disallowed.

http://google.com/robots.txt

Hexstream · on April 12, 2008

I think it was a rhetorical question.

michaelneale · on April 13, 2008

Yes it was. But I should have been clearer for the type of literal argument starters that would tend to hang around here ;)

I wonder if google always honour robots.txt

Hexstream · on April 13, 2008

I hope so because otherwise they could get stuck in one of those:

http://en.wikipedia.org/wiki/Spider_trap

flyhighplato · on April 12, 2008

What an interesting problem... so, if you were to build a searchbot, what would it type into those form fields it found?

It has to be very difficult, especially since you can't do too many submits. Could you somehow infer what to put in there based on searches in your own database already?

dzorz · on April 12, 2008

What happens when google starts editing wiki pages and filing new bug reports/tickets?

tocomment · on April 12, 2008

It only submits GET forms, you shouldn't use those for those things.

davidw · on April 12, 2008

I was sort of hoping they had a valid credit card number:-)

Youhavetobelogg · on April 12, 2008

text Google Now Fills Out 1 minute ago link

danteembermage · on April 12, 2008

scaling Ron Paul Graham