Fully Quantized Transformer for Machine Translation

被引:0
|
作者
Prato, Gabriele [1 ]
Charlaix, Ella [2 ]
Rezagholizadeh, Mehdi [2 ]
机构
[1] Univ Montreal, Mila, Montreal, PQ, Canada
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020 | 2020年
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art neural machine translation methods employ massive amounts of parameters. Drastically reducing computational costs of such methods without affecting performance has been up to this point unsuccessful. To this end, we propose FullyQT: an allinclusive quantization strategy for the Transformer. To the best of our knowledge, we are the first to show that it is possible to avoid any loss in translation quality with a fully quantized Transformer. Indeed, compared to fullprecision, our 8-bit models score greater or equal BLEU on most tasks. Comparing ourselves to all previously proposed methods, we achieve state-of-the-art quantization results.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [21] FPQNet: Fully Pipelined and Quantized CNN for Ultra-Low Latency Image Classification on FPGAs Using OpenCAPI
    Ji, Mengfei
    Al-Ars, Zaid
    Hofstee, Peter
    Chang, Yuchun
    Zhang, Baolin
    ELECTRONICS, 2023, 12 (19)
  • [22] MACHINE TRANSLATION BASED DATA AUGMENTATION FOR CANTONESE KEYWORD SPOTTING
    Huang, Guangpu
    Gorin, Arseniy
    Gauvain, Jean-Luc
    Lamel, Lori
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6020 - 6024
  • [23] ZeUS: An Unified Training Framework for Constrained Neural Machine Translation
    Yang, Murun
    IEEE ACCESS, 2024, 12 : 124695 - 124704
  • [24] Incorporating Syntactic Knowledge in Neural Quality Estimation for Machine Translation
    Ye, Na
    Wang, Yuanyuan
    Cai, Dongfeng
    MACHINE TRANSLATION, CCMT 2019, 2019, 1104 : 23 - 34
  • [25] Neural Machine Translation Transfer Model Based on Mutual Domain Guidance
    Liu, Yupeng
    Zhang, Lei
    Zhang, Yanan
    IEEE ACCESS, 2022, 10 : 101595 - 101608
  • [26] Assessing the Impact of Static, Contextual and Character Embeddings for Arabic Machine Translation
    Nouhaila, Bensalah
    Habib, Ayad
    Abdellah, Adib
    El Farouk Abdelhamid, Ibn
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024, 23 (02)
  • [27] RESEARCH ON ENGLISH TRANSLATION OPTIMIZATION ALGORITHM BASED ON STATISTICAL MACHINE LEARNING
    Wang, Jinghan
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (06): : 4780 - 4786
  • [28] Interactive Multi-System Machine Translation with Neural Language Models
    Rikters, Matiss
    DATABASES AND INFORMATION SYSTEMS IX, 2016, 291 : 269 - 280
  • [29] Neural machine translation of clinical texts between long distance languages
    Soto, Xabier
    Perez-de-Vinaspre, Olatz
    Labaka, Gorka
    Oronoz, Maite
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (12) : 1478 - 1487
  • [30] RETRACTED: Design of English Automatic Translation System Based on Machine Intelligent Translation and Secure Internet of Things (Retracted Article)
    Ban, Haidong
    Ning, Jing
    MOBILE INFORMATION SYSTEMS, 2021, 2021