recent comments

recent articles

  • How long would it take to read Wikipedia?

    Almer S. Tigelaar 21 / 02 / 2012

    Wikipedia has become the de facto encyclopedia on the Internet. A traditional encyclopedia spans many textbook volumes which would take any normal person ages to read. Few people would likely engage in such an endeavor. However, since Wikipedia is readily accessible: should you take up the challenge?

    read more 0 comments
  • Life in a Day

    Almer S. Tigelaar 09 / 02 / 2012

    The premise behind the YouTube documentary “Life in a Day” is interesting: invite everyone around the world to shoot video on one specific day: July 24th 2010. Have people upload their raw footage and edit it so it becomes a short, ninety minute, documentary that chronicles a single day on our planet. Does this extreme form of crowdsourcing actually work?

    read more 0 comments
  • Top 8 Prejudices about Americans

    Almer S. Tigelaar 07 / 02 / 2012

    When travelling abroad it is difficult to go with an open mind. Despite our best efforts we bring with us an excess of prejudice shaped by our own culture and view of the destination country. So to it was for me when I visited the United States. When coming back, people at home are very insistent that you play into their prejudice regarding where you’ve been as well, perhaps as a means of reinforcing their own identity.

    read more 0 comments

Category: Workshops

Query-Load Balancing by Caching Search Results in Peer-to-Peer Information Retrieval Networks

Almer S. Tigelaar 04 / 02 / 2011, 12:15

Query-Load Balancing by Caching Search Results in Peer-to-Peer Information Retrieval Networks
Tigelaar, A. S. & Hiemstra, D.
In Proceedings of DIR 2011, Amsterdam, The Netherlands (pp. 28-31).

View in Repository

Abstract
For peer-to-peer web search engines it is important to keep the delay between receiving a query and providing search results within an acceptable range for the end user. How to achieve this remains an open challenge. One way to reduce delays is by caching search results for queries and allowing peers to access each others cache. In this paper we explore the limitations of search result caching in large-scale peer-to-peer information retrieval networks by simulating such networks with increasing levels of realism. We find that cache hit ratios of at least thirty-three percent are attainable.

Presented at Dutch-Belgian Information Retrieval 2011 Workshop, February 4th 2011, Amsterdam, The Netherlands.

University of Amsterdam

DIR2011 Workshop Logo

read more 0 comments

Query-Based Sampling using Snippets

Almer S. Tigelaar 23 / 07 / 2010, 11:11

Query-Based Sampling using Snippets
Tigelaar, A. S. & Hiemstra, D.

In Proceedings of LSDS-IR 2010, Geneva, Switzerland (pp. 9-14).

View in Repository

Abstract
Query-based sampling is a commonly used approach to model the content of servers. Conventionally, queries are sent to a server and the documents in the search results returned are downloaded in full as representation of the server’s content. We present an approach that uses the document snippets in the search results as samples instead of downloading the entire documents. We show this yields equal or better modeling performance for the same bandwidth consumption depending on collection characteristics, like document length distribution and homogeneity. Query-based sampling using snippets is a useful approach for real-world systems, since it requires no extra operations beyond exchanging queries and search results.

Presented at the Large-Scale Distributed Systems for Information Retrieval Workshop on July 23rd in Geneva, Switzerland.

sigir2010-13
More Photos

Large-Scale and Distributed Systems for Information Retrieval Workshop Logo

read more 0 comments

Query-Based Sampling using Only Snippets (Poster)

Almer S. Tigelaar 29 / 09 / 2009, 15:00

Query-Based Sampling using Only Snippets
Tigelaar, A. S. & Hiemstra, D.
Poster NWO SIREN 2009, Enschede, The Netherlands.

Depicts the Query-Based Sampling Process

read more 0 comments

Matching Queries to Frequently Asked Questions: Search Functionality for the MRSA Web-Portal

Almer S. Tigelaar 02 / 02 / 2009, 17:00

Matching Queries to Frequently Asked Questions: Search Functionality for the MRSA Web-Portal
Tigelaar, A. S. & Akker, R. op den & Verhoeven, F.
In Proceedings of DIR 2009, Enschede, The Netherlands (pp. 26-33).

View in Repository

Abstract
As part of the long-term EUREGIO MRSA-net project a system was developed which enables health care workers and the general public to quickly find answers to their questions regarding the MRSA pathogen. This paper focuses on how these questions can be answered using Information Retrieval (IR) and Natural Language Processing (NLP) techniques on a Frequently-Asked-Questions-style (FAQ) database.

Presented at the Dutch-Belgian Information Retrieval Workshop 2009

read more 0 comments