microsearch

Inside Yahoo I’m often the first one to tell people about the wonders of the Semantic Web. Typically, one of the first questions I get is: “but just how much of this metadata is out there?” And more often than I like to admit, my answer is not better than “tons”.

Why is this question difficult to answer? First, we don’t have a quantitative result: the crawls of existing Semantic Web search engines are likely to be partial, and certainly don’t include the bulk of metadata that is embedded inside webpages using microformats and RDFa. Second, the questioner most likely would like a qualitative answer: how much metadata is out there that people would actually care about?

And so the idea of microsearch was born. The idea was to create a search page that instead of hiding metadata, brings it to the front, thereby showing the user just how much metadata is out there for any given query. After a few months of hacking and tweaking and some more time spent on getting permissions to share with the world, I’m now happy to say that our demo is finally online, and you can try microsearch for yourself. Here are some examples: 1,2,3,4 and a screenshot for the first example:

microsearch screenshot

Here is how it works in brief. You type in a query (say, your name). We gather the search results from our search engine and strip the metadata, including microformats of three known flavors (hCard, hCalendar, hReview), linked RDF and RDFa. The metadata is shown inside the abstracts as well as on a map and a timeline.

The map and the timeline work as aggregators, for example, in the third example above we show all events related to the query “san francisco conferences” on a single timeline. Further, in case pages can be related through the metadata we group them together. You can see this on the first example (shown on the screenshot). For the technologically minded, I’m using Java Servlets/JSP, Sesame, Elmo, the Simile Fresnel API, the Simile Timeline, Yahoo! Ajax Maps API and of course Yahoo! Search technology.

This is hardly the end of this story, but we are really curious about your feedback. So respond, comment here or drop me an email!

Disclaimer: this is not an indication if or how microformats and other forms of metadata would feature in Yahoo! products or services in the future. Also, it is a research prototype: use it long enough and it will break.

About these ads

8 comments so far

  1. David Karger on

    This looks neat, but I think we could have even more fun with it if you also offered a jsonp feed of the search results with all that metadata your own visualization is using—then we could use exhibit (http://simile.mit.edu/exhibit/) to create visualizations that are similar to yours but are personalized or task-specific.

  2. masaka on

    Hi, very interesting. It would be much nicer if i18n chars are displayed normally. Is GRDDL support difficult ?

  3. David on
  4. microsearch « Aman’s Blog on

    […] February 22, 2008 at 6:06 am · Filed under Projects, RDFa, microformat microsearch « Tripletalk […]

  5. […] Пост одного из авторов « [ИIИ] Nine Inch Nails следуют примеру Radiohead Идея: hMedia. » […]

  6. […] Microsearch (Yahoo) (source: Planet […]

  7. Adrienne Jebb on

    Just want to say your article is as astounding. The clarity in your post is simply nice and i can assume you’re an expert on this subject. Well with your permission let me to grab your feed to keep updated with forthcoming post. Thanks a million and please continue the enjoyable work.

  8. Rojer2389 on

    Ззаказал вновь навес могу сказать весь доволен приобретением, советую всем


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: