Taming Binarized Neural Networks and Mixed-Integer Programs

被引:0
|
作者
Aspman, Johannes [1 ]
Korpas, Georgios [1 ,2 ]
Marecek, Jakub [1 ]
机构
[1] Czech Tech Univ, Dept Comp Sci, Prague, Czech Republic
[2] HSBC Holdings, HSBC Lab, Innovat & Ventures, London, England
关键词
FIELD;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a great deal of recent interest in binarized neural networks, especially because of their explainability. At the same time, automatic differentiation algorithms such as back-propagation fail for binarized neural networks, which limits their applicability. We show that binarized neural networks admit a tame representation by reformulating the problem of training binarized neural networks as a subadditive dual of a mixed-integer program, which we show to have nice properties. This makes it possible to use the framework of Bolte et al. for implicit differentiation, which offers the possibility for practical implementation of backpropagation in the context of binarized neural networks. This approach could also be used for a broader class of mixed-integer programs, beyond the training of binarized neural networks, as encountered in symbolic approaches to AI and beyond.
引用
收藏
页码:10935 / 10943
页数:9
相关论文
共 50 条
  • [41] Mixed-integer nonlinear programs featuring "on/off" constraints
    Hijazi, Hassan
    Bonami, Pierre
    Cornuejols, Gerard
    Ouorou, Adam
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2012, 52 (02) : 537 - 558
  • [42] Approximating the stability region for binary mixed-integer programs
    Kilinc-Karzan, Fatma
    Toriello, Alejandro
    Ahmed, Shabbir
    Nemhauser, George
    Savelsbergh, Martin
    OPERATIONS RESEARCH LETTERS, 2009, 37 (04) : 250 - 254
  • [43] Alternative mixed-integer reformulation of Generalized Disjunctive Programs
    Bogataj, Milos
    Kravanja, Zdravko
    28TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2018, 43 : 549 - 554
  • [44] Decomposition algorithms for nonconvex mixed-integer nonlinear programs
    Kesavan, P
    Barton, PI
    FIFTH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN, 2000, 96 (323): : 458 - 461
  • [45] A Framework for Globally Optimizing Mixed-Integer Signomial Programs
    Misener, Ruth
    Floudas, Christodoulos A.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2014, 161 (03) : 905 - 932
  • [46] Training Experimentally Robust and Interpretable Binarized Regression Models Using Mixed-Integer Programming
    Tule, Sanjana
    Le, Nhi Ha Lan
    Say, Buser
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 838 - 845
  • [47] Continuity of parametric mixed-integer quadratic programs and its application to stability analysis of two-stage quadratic stochastic programs with mixed-integer recourse
    Han, Youpan
    Chen, Zhiping
    OPTIMIZATION, 2015, 64 (09) : 1983 - 1997
  • [48] PRISM: Recurrent Neural Networks and Presolve Methods for Fast Mixed-integer Optimal Control
    Cauligi, Abhishek
    Chakrabarty, Ankush
    Di Cairano, Stefano
    Quirynen, Rien
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [49] DECOMPOSITION METHODS FOR GLOBAL SOLUTION OF MIXED-INTEGER LINEAR PROGRAMS
    Sun, Kaizhao
    Sun, Mou
    Yin, Wotao
    SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (02) : 1206 - 1235
  • [50] EXPEDIENTS FOR SOLVING SOME SPECIALLY STRUCTURED MIXED-INTEGER PROGRAMS
    SHERALI, HD
    NAVAL RESEARCH LOGISTICS, 1981, 28 (03) : 447 - 462