共 16 条
- [1] Arasu A., Ganti V., Kaushik R., Efficient exact set-similarity joins, Proc. of the VLDB, pp. 918-929, (2006)
- [2] Theobald M., Siddharth J., Paepcke A., SpotSigs: Robust and efficient near duplicate detection in large Web collections, Proc. of the SIGIR, pp. 563-570, (2008)
- [3] Bayardo R.J., Ma Y.M., Srikant R., Scaling up all pairs similarity search, Proc. of the WWW, pp. 131-140, (2007)
- [4] Chaudhuri S., Ganti V., Kaushik R., A primitive operator for similarity joins in data cleaning, Proc. of the ICDE, pp. 5-17, (2006)
- [5] Elmagarmid A.K., Ipeirotis P.G., Verykios V.S., Duplicate record detection: A survey, IEEE Trans. on Knowledge and Data Engineering, 19, 1, pp. 1-16, (2007)
- [6] Sarawagi S., Kirpal A., Efficient set joins on similarity predicates, Proc. of the SIGMOD Conf., pp. 743-754, (2004)
- [7] Xiao C., Wang W., Lin X.M., Yu J.X., Efficient similarity joins for near duplicate detection, Proc. of the WWW, pp. 131-140, (2008)
- [8] Gao Q.S., Gao X.Y., Hu Y., A uniform definition of fuzzy set theory and fundamental of probability theory, Journal of Dalian University of Technology, 46, 1, pp. 141-150, (2006)
- [9] Soliman M.A., Ilyas I.F., Chang K.C., Top-k query processing in uncertain databases, Proc. of the ICDE, pp. 896-905, (2007)
- [10] Bernecker T., Kriegel H.P., Renz M., Verhein F., Zufle A., Probabilistic frequent itemset mining in uncertain databases, Proc. of the KDD, pp. 119-128, (2009)