Visually Lossless HTML']HTML Compression

被引:0
|
作者
Skibinski, Przemyslaw [1 ]
机构
[1] Univ Wroclaw, Inst Comp Sci, PL-50383 Wroclaw, Poland
来源
WEB INFORMATION SYSTEMS ENGINEERING - WISE 2009, PROCEEDINGS | 2009年 / 5802卷
关键词
!text type='HTML']HTML[!/text] compression; !text type='HTML']HTML[!/text] transform; semi-structural data compression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The verbosity of the Hypertext Markup Language (HTML) remains one of its main weaknesses. This problem can be solved with the aid of HTML specialized compression algorithms. In this work, we describe a visually lossless HTML transform that, combined with generally used compression algorithms, allows to attain high compression ratios. Its core is a transform featuring substitution of words in an HTML document using a static English dictionary, effective encoding of dictionary indexes, numbers, and specific patterns. Visually lossless compression means that the HTML document layout will be modified, but the document displayed in a browser will provide the exact fidelity with the original. The experimental results show that the proposed transform improves the HTML compression efficiency of general purpose compressors on average by 21% in the case of gzip, achieving comparable processing speed. Moreover, we show that the compression ratio of gzip can be improved by up to 32% for the price of higher memory requirements and much slower processing.
引用
收藏
页码:195 / 202
页数:8
相关论文
共 50 条
  • [1] Improving HTML']HTML compression
    Skibinski, Przemyslaw
    DCC: 2008 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, : 545 - 545
  • [2] Improving HTML']HTML Compression
    Skibinski, Przemyslaw
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2009, 33 (03): : 363 - 373
  • [3] An HTML']HTML interface for visually handicapped people
    Hadjadj, D
    Bouzidi, A
    Burger, D
    IMPROVING THE QUALITY OF LIFE FOR THE EUROPEAN CITIZEN: TECHNOLOGY FOR INCLUSIVE DESIGN AND EQUALITY, 1998, 4 : 38 - 41
  • [4] WEB ACCESSIBILITY TOOL FOR VISUALLY IMPAIRED ACTIVATED THROUGH HTML']HTML TAGS
    Peraza Garzon, Juan Francisco
    Estrada Lizarraga, Rogelio
    Olivarria Gonzalez, Monica del Carmen
    Zaragoza Gonzalez, Jose Nicolas
    Mendoza Zatarain, Rafael
    Ortega Carrillo, Jose Antonio
    Cobian Campos, Jose Alfredo
    INTED2015: 9TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2015, : 7549 - 7552
  • [5] Rec.HTML']HTML: Declarative HTML']HTML
    Reynders, Bob
    Choi, Kwanghoon
    COMPANION PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON THE ART, SCIENCE, AND ENGINEERING OF PROGRAMMING (PROGRAMMING 2021 COMPANION), 2021, : 1 - 5
  • [6] SAS® and HTML']HTML -: HTML']HTML publishing using SAS
    Bahler, C
    Muller, S
    Doolittle, D
    Barrios, A
    PROCEEDINGS OF THE TWENTY-THIRD ANNUAL SAS USERS GROUP INTERNATIONAL CONFERENCE, 1998, : 229 - 237
  • [7] Mastering HTML']HTML and XHTML']HTML
    Staples, J
    TECHNICAL COMMUNICATION, 2004, 51 (01) : 126 - 128
  • [8] IMPROVING HTML']HTML DATA TABLES NAVIGATION A Method to obtain Information for Visually Impaired People
    Fernandez, Juan Manuel
    Soler, Vicenc
    Roig, Jordi
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL HCI: HUMAN-COMPUTER INTERACTION, 2008, : 397 - 400
  • [9] Dynamic HTML']HTML: The HTML']HTML developer's guide.
    Gillespie, T
    LIBRARY JOURNAL, 1999, 124 (13) : 132 - 132
  • [10] HTML']HTML & XHTML']HTML: The definitive guide
    Robertson, A
    TECHNICAL COMMUNICATION, 2001, 48 (04) : 498 - 500