Neural architecture search for image saliency fusion

被引:20
作者
Bianco, Simone [1 ]
Buzzelli, Marco [1 ]
Ciocca, Gianluigi [1 ]
Schettini, Raimondo [1 ]
机构
[1] Univ Milano Bicocca, Dept Comp Sci Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
关键词
Saliency fusion; Evolutionary algorithms; Neural architecture search; OBJECT DETECTION; NETWORK; DRIVEN; MODEL;
D O I
10.1016/j.inffus.2019.12.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Saliency detection methods proposed in the literature exploit different rationales, visual clues, and assumptions, but there is no single best saliency detection algorithm that is able to achieve good results on all the different benchmark datasets. In this paper we show that fusing different saliency detection algorithms together by exploiting neural network architectures makes it possible to obtain better results. Designing the best architecture for a given task is still an open problem since the existing techniques have some limits with respect to the problem formulation, to the search space, and require very high computational resources. To overcome these problems, in this paper we propose a three-step fusion approach. In the first step, genetic programming techniques are exploited to combine the outputs of existing saliency algorithms using a set of provided operations. Having a discrete search space allows us a fast generation of the candidate solutions. In the second step, the obtained solutions are converted into backbone Convolutional Neural Networks (CNNs) where operations are all implemented with differentiable functions, allowing an efficient optimization of the corresponding parameters (in a continuous space) by backpropagation. In the last step, to enrich the expressiveness of the initial architectures, the networks are further extended with additional operations on intermediate levels of the processing that are once again efficiently optimized through backpropagation. Extensive experimental evaluations show that the proposed saliency fusion approach outperforms the state-of-the-art on the MSRAB dataset and it is able to generalize to unseen data of different benchmark datasets.
引用
收藏
页码:89 / 101
页数:13
相关论文
共 80 条
  • [1] [Anonymous], 2006, P IEEE CS C COMP VIS
  • [2] Seam carving for content-aware image resizing
    Avidan, Shai
    Shamir, Ariel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03):
  • [3] Aytekin Ç, 2015, IEEE IMAGE PROC, P1692, DOI 10.1109/ICIP.2015.7351089
  • [4] Context proposals for saliency detection
    Azaza, Aymen
    van de Weijer, Joost
    Douik, Ali
    Masana, Marc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 174 : 1 - 11
  • [5] Baker Bowen, 2016, INT C LEARN REPR
  • [6] BANITALEBIDEHKO.A, 2015, P 23 EUR C SIGN PROC, P1541
  • [7] Optimizing feedforward artificial neural network architecture
    Benardos, P. G.
    Vosniakos, G. -C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (03) : 365 - 382
  • [8] Multiscale fully convolutional network for image saliency
    Bianco, Simone
    Buzzelli, Marco
    Schettini, Raimondo
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [9] How Far Can You Get by Combining Change Detection Algorithms?
    Bianco, Simone
    Ciocca, Gianluigi
    Schettini, Raimondo
    [J]. IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 96 - 107
  • [10] Combination of Video Change Detection Algorithms by Genetic Programming
    Bianco, Simone
    Ciocca, Gianluigi
    Schettini, Raimondo
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (06) : 914 - 928