Abstract
In this paper, we study three methods for implementing the tf × idf ranking strategy with inverted files, where tf stands for term frequency and idf stands for inverse document frequency. The first one sorts the postings lists of the query terms by increasing length. It is the traditional sorting method used in the upperbound search algorithm. The second one sorts query terms based upon two parameters, namely the maximum tf of the postings list and the list length. The third one first requires each postings list to be sorted by decreasing t f value. It sorts disk pages, rather than postings lists, based upon three parameters, the maximum tf of the disk page, the length of the postings list and the number of document identifiers in the disk page. We show that the second and third methods are able to identify a large portion of top documents without using a large amount of disk page accesses. They outperform the first method by a large margin. The performance of these methods is demonstrated by experimental runs on four test collections made available with the SMART system.
| Original language | English |
|---|---|
| Title of host publication | Advances in Computing and Information – ICCI 1991 - International Conference on Computing and Information, Proceedings |
| Editors | Waldemar W. Koczkodaj, Frank Dehne, Frantisek Fiala |
| Publisher | Springer Verlag |
| Pages | 231-239 |
| Number of pages | 9 |
| ISBN (Print) | 9783540540298 |
| DOIs | |
| Publication status | Published - 1991 |
| Externally published | Yes |
| Event | 3rd International Conference on Computing and Information, ICCI 1991 - Ottawa, Canada Duration: 27 May 1991 → 29 May 1991 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 497 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 3rd International Conference on Computing and Information, ICCI 1991 |
|---|---|
| Country/Territory | Canada |
| City | Ottawa |
| Period | 27/05/91 → 29/05/91 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 1991.