TY - GEN
T1 - XClean
T2 - 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
AU - Lu, Yifei
AU - Wang, Wei
AU - Li, Jianxin
AU - Liu, Chengfei
PY - 2011
Y1 - 2011
N2 - An important facility to aid keyword search on XML data is suggesting alternative queries when user queries contain typographical errors. Query suggestion thus can improve users' search experience by avoiding returning empty result or results of poor qualities. In this paper, we study the problem of effectively and efficiently providing quality query suggestions for keyword queries on an XML document. We illustrate certain biases in previous work and propose a principled and general framework, XClean, based on the state-of-the-art language model. Compared with previous methods, XClean can accommodate different error models and XML keyword query semantics without losing rigor. Algorithms have been developed that compute the top-k suggestions efficiently. We performed an extensive experiment study using two large-scale real datasets. The experiment results demonstrate the effectiveness and efficiency of the proposed methods.
AB - An important facility to aid keyword search on XML data is suggesting alternative queries when user queries contain typographical errors. Query suggestion thus can improve users' search experience by avoiding returning empty result or results of poor qualities. In this paper, we study the problem of effectively and efficiently providing quality query suggestions for keyword queries on an XML document. We illustrate certain biases in previous work and propose a principled and general framework, XClean, based on the state-of-the-art language model. Compared with previous methods, XClean can accommodate different error models and XML keyword query semantics without losing rigor. Algorithms have been developed that compute the top-k suggestions efficiently. We performed an extensive experiment study using two large-scale real datasets. The experiment results demonstrate the effectiveness and efficiency of the proposed methods.
UR - https://www.scopus.com/pages/publications/79957835960
U2 - 10.1109/ICDE.2011.5767847
DO - 10.1109/ICDE.2011.5767847
M3 - Conference Paper published in a book
AN - SCOPUS:79957835960
SN - 9781424489589
T3 - Proceedings - International Conference on Data Engineering
SP - 661
EP - 672
BT - 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
Y2 - 11 April 2011 through 16 April 2011
ER -