Like a good portion of the blogosphere (it sounds like) I have been playing with the new Google blog search – which technically speaking should be referred to as FeedSearch since feed content is the basis of the indexing – but no matter. Danny Sullivan has been playing as well apparently.
The index contains about 2/3 months worth of posts, and Google claims that it will be backfilled as time goes. Since many feeds only list the latest n posts (10, 20, 50 ?), I am not sure how they will be able to do so – besides scrapping blog pages or extracting posts from the cache of their main index (?).
A few things caught my attention:
Tags and categories seem to be ignored in the indexing or the ranking algorithm. The content of the title field for both the blog and posts seems to have a high (if not disproportionate) degree of importance in the relevancy … Read more »