Exploring Different Paradigms to Extract Proper Implications From High Dimensional Formal Contexts

被引:2
作者
Neves, Julio C., V [1 ]
Batista Ruas Da Silveira, Pedro Henrique [1 ]
Missaoui, Rokia [2 ]
Dias, Sergio M. [1 ,3 ]
Zarate, Luis E. [1 ]
Song, Mark A. J. [1 ]
机构
[1] Pontifical Catholic Univ Minas Gerais PUC Minas, Dept Comp Sci, BR-30535901 Belo Horizonte, MG, Brazil
[2] Univ Quebec Outaouais, Dept Comp Sci & Engn, Gatineau, PQ J8X 3X7, Canada
[3] Fed Serv Data Proc SERPRO, BR-31035536 Belo Horizonte, MG, Brazil
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Data mining; Lattices; Formal concept analysis; Binary decision diagrams; LinkedIn; Heuristic algorithms; proper implications; binary decision diagram; RELATIONAL CONCEPT ANALYSIS; FREQUENT ITEMSETS; CONCEPT LATTICES; ALGORITHMS; FCA;
D O I
10.1109/ACCESS.2020.3010482
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Formal Concept Analysis (FCA) is an applied mathematical technique for data analysis, in which the relations between objects and attributes are identified. It introduces the notion of concepts and their hierarchical structure, from which we can obtain a set of implications between attributes that characterize a knowledge domain. The volume of information to be processed makes the use of FCA difficult in domains with a high number of dimensions, creating a demand for new solutions and algorithms for FCA applications. This article explores different approaches to extract proper implications from high dimensional contexts based on constraints to obtain the set of implications rules. We propose algorithms that use a data structure called Binary Decision Diagram (BDD) to represent the formal context, which reduces its size and, due to this, operates more efficiently. We also propose a heuristic to obtain proper implications by reducing the unnecessary generation of premises. In addition, we implemented a parallel computing model for generating and obtaining different implications. To analyze the proposed algorithms, we used different synthetic contexts with a varying number of objects, attributes, and density. The results obtained presented speed gains of up to 22 times when compared to the solutions proposed in the literature such as Impec and PropIm.
引用
收藏
页码:134161 / 134175
页数:15
相关论文
共 45 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], 1999, FORMAL CONCEPT ANAL
[3]  
Armstrong W.W., 1974, IFIP C
[4]   The multiple facets of the canonical direct unit implicational basis [J].
Bertet, K. ;
Monjardet, B. .
THEORETICAL COMPUTER SCIENCE, 2010, 411 (22-24) :2155-2166
[5]  
Brin S., 1997, P 1997 ACM SIGMOD IN, P265
[6]  
BRYANT RE, 1986, IEEE T COMPUT, V35, P677, DOI 10.1109/TC.1986.1676819
[7]  
Burmeister P., 2003, FORMAL CONCEPT ANAL
[8]   Scalable formal concept analysis algorithms for large datasets using Spark [J].
Chunduri, Raghavendra K. ;
Cherukuri, Aswani Kumar .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (11) :4283-4303
[9]   A step forward for Topic Detection in Twitter: An FCA-based approach [J].
Cigarran, Juan ;
Castellanos, Angel ;
Garcia-Serrano, Ana .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 :21-36
[10]   Graphical norms via conceptual graphs [J].
Croitoru, Madalina ;
Oren, Nir ;
Miles, Simon ;
Luck, Michael .
KNOWLEDGE-BASED SYSTEMS, 2012, 29 :31-43