Set and string similarity queries: A survey

Xue Min Lin*, Wei Wang

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

17 Citations (Scopus)

Abstract

Similarity queries are an important topic in many domains in computer science, including databases, data integration, World Wide Web, data mining, and bioinformatics. In this paper, we focus on the subfield of similarity queries for sets and strings. There has been a wide body of development in this area since 2000. We survey major results and present our analyses and categorization of existing approaches. Finally, we conclude the paper by giving a list of future research directions.

Original languageEnglish
Pages (from-to)1853-1862
Number of pages10
JournalJisuanji Xuebao/Chinese Journal of Computers
Volume34
Issue number10
DOIs
Publication statusPublished - Oct 2011
Externally publishedYes

Keywords

  • Edit distance
  • Jaccard
  • Prefix filtering
  • Similarity join
  • Similarity query

Fingerprint

Dive into the research topics of 'Set and string similarity queries: A survey'. Together they form a unique fingerprint.

Cite this