Abstract
Similarity queries are an important topic in many domains in computer science, including databases, data integration, World Wide Web, data mining, and bioinformatics. In this paper, we focus on the subfield of similarity queries for sets and strings. There has been a wide body of development in this area since 2000. We survey major results and present our analyses and categorization of existing approaches. Finally, we conclude the paper by giving a list of future research directions.
| Original language | English |
|---|---|
| Pages (from-to) | 1853-1862 |
| Number of pages | 10 |
| Journal | Jisuanji Xuebao/Chinese Journal of Computers |
| Volume | 34 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - Oct 2011 |
| Externally published | Yes |
Keywords
- Edit distance
- Jaccard
- Prefix filtering
- Similarity join
- Similarity query