Efficient algorithms for dualizing large-scale hypergraphs

被引:46
作者
Murakami, Keisuke [1 ]
Uno, Takeaki [2 ]
机构
[1] Aoyama Gakuin Univ, Chuo Ku, Sagamihara, Kanagawa 2525258, Japan
[2] Res Org Informat & Syst, Natl Inst Informat, Chiyoda Ku, Tokyo 1018430, Japan
关键词
Dualization; Hypergraph transversal; Minimal hitting set; Minimal set covering; Enumeration; Experimental efficiency; Real-world data; QUASI-POLYNOMIAL ALGORITHM; MAXIMAL FREQUENT; IMPLEMENTATION; TRANSVERSALS; STRATEGIES; COMPLEXITY; SETS;
D O I
10.1016/j.dam.2014.01.012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
A hypergraph F is a set family defined on vertex set V. The dual of F is the set of minimal subsets H of V such that F boolean AND H not equal phi for any F is an element of F. The computation of the dual is equivalent to several problems, such as minimal hitting set enumeration of a subset family, generation problem for maximal frequent and minimal infrequent sets, and so on. Although many algorithms have been proposed for the problem, to the best of our knowledge, none of them can work on large-scale input with a large number of minimal hitting sets. In this paper, we propose efficient algorithms for checking the minimality and pruning methods. These algorithms enable us to construct time effective and polynomial space dualization algorithms. The computational experiments show that our algorithms are quite fast even for large-scale input for which existing algorithms do not terminate in a practical time. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 94
页数:12
相关论文
共 25 条
[1]   Reverse search for enumeration [J].
Avis, D ;
Fukuda, K .
DISCRETE APPLIED MATHEMATICS, 1996, 65 (1-3) :21-46
[2]  
Bailey J, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P485
[3]  
Berge C., 1989, HYPERGRAPHS, V45
[4]  
Boros E, 2003, LECT NOTES COMPUT SC, V2832, P556
[5]   Dual-bounded generating problems: weighted transversals of a hypergraph [J].
Boros, E ;
Gurvich, VA ;
Khachiyan, L ;
Makino, K .
DISCRETE APPLIED MATHEMATICS, 2004, 142 (1-3) :1-15
[6]   Generating dual-bounded hypergraphs [J].
Boros, E ;
Elbassioni, K ;
Gurvich, V ;
Khachiyan, L .
OPTIMIZATION METHODS & SOFTWARE, 2002, 17 (05) :749-781
[7]   On maximal frequent and minimal infrequent sets in binary matrices [J].
Boros, E ;
Gurvich, V ;
Khachiyan, L ;
Makino, K .
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2003, 39 (03) :211-221
[8]   Mining border descriptions of emerging patterns from dataset pairs [J].
Dong, GZ ;
Li, JY .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) :178-202
[9]   Computational aspects of monotone dualization: A brief survey [J].
Eiter, Thomas ;
Makino, Kazuhisa ;
Gottlob, Georg .
DISCRETE APPLIED MATHEMATICS, 2008, 156 (11) :2035-2049
[10]   On the complexity of dualization of monotone disjunctive normal forms [J].
Fredman, ML ;
Khachiyan, L .
JOURNAL OF ALGORITHMS, 1996, 21 (03) :618-628