Evaluating Verbose Query Processing Techniques
Sam Huston (University of Massachusetts Amherst)
CSIRO ICTDATE: 2010-07-13
TIME: 16:00:00 - 17:00:00
LOCATION: CSIT Seminar Room, N101
CONTACT: JavaScript must be enabled to display this email address.
ABSTRACT:
Verbose or long queries are a small but significant part of the query stream in web search, and are common in other applications such as collaborative question answering (CQA). Current search engines perform well with keyword queries but are not, in general, effective for verbose queries. In this paper, we examine query processing techniques which can be applied to verbose queries prior to submission to a search engine in order to improve the search engine's results.
We focus on verbose queries that have sentence-like structure, but are not simple `wh-' questions, and assume the search engine is a `black box.' We evaluated the output of two search engines using queries from a CQA service and our results show that, among a broad range of techniques, the most effective approach is to simply reduce the length of the query. This can be achieved effectively by removing `stop structure' instead of only stop words. We show that the process of learning and removing stop structure from a query can be effectively automated.
BIO:
Samuel Huston is a PhD Student currently studying at the
University of Massachusetts Amherst within the Center for
Intelligent Information Retrieval. His studies focus on
issues related to effectively answering long queries
within information retrieval systems.


