ANU Computer Science Technical Reports
TR-CS-96-04
Peter Bailey and David Hawking.
A parallel architecture for query processing over a terabyte of
text.
June 1996.
[POSTSCRIPT (148809 bytes)] [PDF (279492 bytes)]
Abstract: The Parallel Document Retrieval Engine
(PADRE) has previously demonstrated that full text scanning methods supported
by parallel hardware permit powerful query constructors and rapid response to
changing document collections. Extensions to PADRE have been designed and
implemented which make use of parallel secondary storage to allow each
procesing node to handle data up to 32 times the size of its primary memory.
Using the largest purchasable machine on which PADRE currently runs, these
increase the maximum possible collection size to one terabyte. This paper
addresses the practicality of achieving this limit and the extent to which
the performance, responsiveness, functionality and scalability of the full
text scanning PADRE are preserved in the extended version.
Technical Reports <Technical-DOT-Reports-AT-cs-DOT-anu.edu.au>
Last modified: Tue May 31 12:55:59 EST 2011