ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data
被引:6
作者:
Davashi, Razieh
论文数: 0引用数: 0
h-index: 0
机构:
Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, IranIslamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
Davashi, Razieh
[1
,2
]
机构:
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
机构:
Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Isfahan, Iran
Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Selangor, MalaysiaIslamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Isfahan, Iran
Nadimi-Shahraki, Mohammad H.
Mustapha, Norwati
论文数: 0引用数: 0
h-index: 0
机构:
Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Selangor, MalaysiaIslamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Isfahan, Iran
Mustapha, Norwati
Sulaiman, Md. Nasir
论文数: 0引用数: 0
h-index: 0
机构:
Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Selangor, MalaysiaIslamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Isfahan, Iran
Sulaiman, Md. Nasir
Mamat, Ali
论文数: 0引用数: 0
h-index: 0
机构:
Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Selangor, MalaysiaIslamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Isfahan, Iran