ANU Computer Science Technical Reports
TR-CS-95-08
David Hawking.
The design and implementation of a parallel document retrieval
engine.
December 1995.
[POSTSCRIPT (144949 bytes)] [PDF (263051 bytes)]
Abstract: Document retrieval as traditionally
formulated is an inherently parallel task because the document collection can
be divided into N sub-collections each of which may be searched
independently. Document retrieval software can potentially exploit the power
and capacity of a large-scale parallel machine to improve speed, to extend
the size of the largest collection which can be processed, to respond quickly
to changes in the document collection and/or to increase the power and
expressivity of the retrieval query language. This paper includes discussion
of the issues involved in the design of a practical parallel document
retrieval engine for a distributed-memory multicomputer and a description of
the implementation of PADRE, a retrieval engine for the Fujitsu AP1000.
Performance results are presented and scope of applicability of the
techniques is discussed.
Technical Reports <Technical-DOT-Reports-AT-cs-DOT-anu.edu.au>
Last modified: Tue May 31 12:55:59 EST 2011