Can Bit.ly solve real-time search?
One of the themes of my last post was that for all this talk of new methods of social distribution the methods of filtering information haven’t substantially changed, instead we’re merely able to better identify smaller data sets to bring to bear on the issue. This got me thinking about the problems of real-time search.
Real-time search has been something of a bugbear. Google recently declared that real-time search is their biggest challenge and a variety of companies have come out of the gate trying to compete in this space. Thus far, all of them do so by reducing the data set to something more manageable. Summize (now Twitter Search), the most famous example, searches Twitter and others like One Riot search Twitter, Digg and other social sites. The issue with these is that while they are manifestly able to beat the likes of Google in terms of providing real-time results, finding relevant information is still difficult. Eric Wiesen points out that ‘“Now” is actually a pretty bad filter for a tremendous proportion of content’ so even when searching for information on stories like the plane in the Hudson, where Summize is generally perceived to have excelled, one still finds oneself wading through an awful lot of hay to find the needle.
However, I think that bit.ly has the potential to approach this space from a slightly different angle and provide real value. Bit.ly is now processing more than 100 million decodes a week. This gives them in essence a real-time global heatmap of what is most relevant to people at this moment in time. Some of the data they surface may not have been published as recently as the last post on twitter, but it is likely to be relevant to the real-time needs of the user doing the search, and while a lot of bit.ly links come from twitter it’s not constrained to any particular service in the way that other real-time search engines are.
The mere fact that someone has chosen to share this information is indicative of a certain level of relevance, but bit.ly’s ability to measure the virality of a given link enables them to rank the relative relevance of those links too. So what they have is a different pathway to achieving something very close to what Google can do with Pagerank over the limited dataset that real-time search requires. It will be interesting to see if bit.ly chooses to go down this path or not, but I think it’s reasonable to suggest that bit.ly has the potential to be a serious challenger in this space and it is certainly much much more than just ten lines of code.
