Signature files are widely used in information retrieval and database. They act as search filters for content-based retrieval. In a large database server, a parallel device is utilized to achieve concurrency access. Efficient allocation of signature files on parallel devices minimizes the query response time and is important in the design of access methods for large scale index servers. We have developed an algorithm to organize the storage of signatures in parallel secondary storage to reduce the query response time. First, signature file is clustered into signature pages. Then, the clustered signature pages are distributed among the disks using the parity check matrix of error correcting code in coding theory. Through the construction of error correcting code, the least frequently simultaneously accessed pages are allocated on the same disk. Performance analysis shows that this algorithm improves the efficiency of access.
Journal of Information Science and Engineering, 15(2), 199-215