Towards an Automated Classification of Spreadsheets

被引:0
作者
Mendes, Jorge [1 ,2 ]
Do, Kha N. [3 ]
Saraiva, Joao [1 ,2 ]
机构
[1] INESC TEC, HASLab, Oporto, Portugal
[2] Univ Minho, HASLab, Braga, Portugal
[3] Vietnam Natl Univ, Univ Sci, Ho Chi Minh, Vietnam
来源
SOFTWARE TECHNOLOGIES: APPLICATIONS AND FOUNDATIONS (STAF 2016) | 2016年 / 9946卷
关键词
Spreadsheets; Data mining; Classification;
D O I
10.1007/978-3-319-50230-4_26
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many spreadsheets in the wild do not have documentation nor categorization associated with them. This makes difficult to apply spreadsheet research that targets specific spreadsheet domains such as financial or database. We introduce with this paper a methodology to automatically classify spreadsheets into different domains. We exploit existing data mining classification algorithms using spreadsheet-specific features. The algorithms were trained and validated with cross-validation using the EUSES corpus, with an up to 89% accuracy. The best algorithm was applied to the larger Enron corpus in order to get some insight from it and to demonstrate the usefulness of this work.
引用
收藏
页码:346 / 355
页数:10
相关论文
共 50 条
[41]   AUTOMATED MORPHOLOGICAL CLASSIFICATION OF APM GALAXIES [J].
NAIM, A .
ASTROPHYSICAL LETTERS & COMMUNICATIONS, 1995, 31 (1-6) :87-90
[42]   Automated Classification of an Environmental Sensitivity Index [J].
Helmut Schiller ;
Carlo Van Bernem ;
Hansjörg L. Krasemann .
Environmental Monitoring and Assessment, 2005, 110 :291-299
[43]   Automated classification of Croatian traditional music [J].
Strizrep, Ivan ;
Krzic, Ana Sovic ;
Sersic, Damir .
2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, :1028-1033
[44]   Towards a natural classification of Botryosphaeriales [J].
Jian-Kui Liu ;
Rungtiwa Phookamsak ;
Mingkhuan Doilom ;
Saowanee Wikee ;
Yan-Mei Li ;
Hiran Ariyawansha ;
Saranyaphat Boonmee ;
Putarak Chomnunti ;
Dong-Qin Dai ;
Jayarama D. Bhat ;
Andrea I. Romero ;
Wen-Ying Zhuang ;
Jutamart Monkai ;
E. B. Gareth Jones ;
Ekachai Chukeatirote ;
Thida Win Ko Ko ;
Yong-Chang Zhao ;
Yong Wang ;
Kevin D. Hyde .
Fungal Diversity, 2012, 57 :149-210
[45]   Tubal disease: towards a classification [J].
Akande, Valentine A. .
REPRODUCTIVE BIOMEDICINE ONLINE, 2007, 15 (04) :369-375
[46]   Towards Fair and Robust Classification [J].
Sun, Haipei ;
Wu, Kun ;
Wang, Ting ;
Wang, Wendy Hui .
2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, :356-376
[47]   Towards a New Classification of Cardiomyopathies [J].
Perry Elliott .
Current Cardiology Reports, 2023, 25 :229-233
[48]   Towards Understanding Classification and Identification [J].
Fumagalli, Mattia ;
Bella, Gabor ;
Giunchiglia, Fausto .
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 :71-84
[49]   Towards a natural classification of Botryosphaeriales [J].
Liu, Jian-Kui ;
Phookamsak, Rungtiwa ;
Doilom, Mingkhuan ;
Wikee, Saowanee ;
Li, Yan-Mei ;
Ariyawansha, Hiran ;
Boonmee, Saranyaphat ;
Chomnunti, Putarak ;
Dai, Dong-Qin ;
Bhat, Jayarama D. ;
Romero, Andrea I. ;
Zhuang, Wen-Ying ;
Monkai, Jutamart ;
Jones, E. B. Gareth ;
Chukeatirote, Ekachai ;
Ko, Thida Win Ko ;
Zhao, Yong-Chang ;
Wang, Yong ;
Hyde, Kevin D. .
FUNGAL DIVERSITY, 2012, 57 (01) :149-210
[50]   Towards a New Classification of Cardiomyopathies [J].
Elliott, Perry .
CURRENT CARDIOLOGY REPORTS, 2023, 25 (04) :229-233