Towards Generating Web-accessible STEM Documents from PDF

被引:7
作者
Sorge, Volker [1 ]
Bansal, Akashdeep [2 ]
Jadhav, Neha M. [2 ]
Garg, Himanshu [2 ]
Verma, Ayushi [2 ]
Balakrishnan, M. [2 ]
机构
[1] Univ Birmingham, Birmingham, W Midlands, England
[2] IIT Delhi, Delhi, India
来源
17TH INTERNATIONAL WEB FOR ALL CONFERENCE (WEB4ALL) | 2020年
关键词
STEM Accessibility; PDF; Web;
D O I
10.1145/3371300.3383351
中图分类号
学科分类号
摘要
PDF is still a very popular format that is widely used to exchange and archive electronic documents. And although considerable efforts have been made to ensure accessibility of PDF documents, they are still far from ideal when complex content like formulas, diagrams or tables is present. Unfortunately, many publications in scientific subjects are available in PDF format only and are therefore, if at all, only partially accessible. In this paper, we present a fully automated web-based technology to convert PDF documents into an accessible single file format. We concentrate on presenting working solutions for mathematical formulas and tables while also discussing some of the open problems in this context and how we aim to solve them in the future.
引用
收藏
页数:5
相关论文
共 14 条
[1]  
Adobe Acrobat, AD ACR
[2]   Axessibility: a LaTeX Package for Mathematical Formulae Accessibility in PDF Documents [J].
Ahmetovic, Dragan ;
Armano, Tiziana ;
Bernareggi, Cristian ;
Berra, Michele ;
Capietto, Anna ;
Coriasco, Sandro ;
Murru, Nadir ;
Ruighi, Alice ;
Taranto, Eugenia .
ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, :352-354
[3]  
[Anonymous], arXiv
[4]  
Apache PDFBox, ABOUT US
[5]  
Baker Josef B., 2012, 11th International Conference, AISC 2012 19th Symposium, Calculemus 2012. Proceedings 5th International Workshop, DML 2012. 11th International Conference, MKM 2012. Systems and Projects, Held as Part of CICM 2012, P422, DOI 10.1007/978-3-642-31374-5_29
[6]  
Baker JB, 2009, LECT NOTES COMPUT SC, V5625, P201, DOI 10.1007/978-3-642-02614-0_19
[7]  
Bigham Jeffrey P., 2016, P 2016 CHI C HUM FAC, P621, DOI DOI 10.1145/2851581.2892588
[8]  
Caldwell B, 2012, PDF TECHNIQUES WCAG
[9]  
Ester M, 1996, KDD 96, P226, DOI DOI 10.5555/3001460.3001507
[10]   Document representation and its application to page decomposition [J].
Jain, AK ;
Yu, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) :294-308