FAMINet: Learning Real-Time Semisupervised Video Object Segmentation With Steepest Optimized Optical Flow

被引:0
|
作者
Liu, Ziyang [1 ]
Liu, Jingmeng [1 ]
Chen, Weihai [2 ]
Wu, Xingming [1 ]
Li, Zhengguo [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[3] Inst Infocomm Res, SRO Dept, Singapore 138632, Singapore
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Optical imaging; Integrated optics; Motion segmentation; Feature extraction; Adaptive optics; Optical network units; Streaming media; Online memorizing; optical flow; real time; relaxed steepest descent; semisupervised video object segmentation (VOS);
D O I
10.1109/TIM.2021.3133003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semisupervised video object segmentation (VOS) aims to segment a few moving objects in a video sequence, where these objects are specified by annotation of the first frame. The optical flow has been considered in many existing semisupervised VOS methods to improve the segmentation accuracy. However, the optical flow-based semisupervised VOS methods cannot run in real time due to high complexity of optical flow estimation. A FAMINet, which consists of a feature extraction network (F), an appearance network (A), a motion network (M), and an integration network (I), is proposed in this study to address the above-mentioned problem. The appearance network outputs an initial segmentation result based on static appearances of objects. The motion network estimates the optical flow via very few parameters, which are optimized rapidly by an online memorizing algorithm named relaxed steepest descent. The integration network refines the initial segmentation result using the optical flow. Extensive experiments demonstrate that the FAMINet outperforms other state-of-the-art semisupervised VOS methods on the DAVIS and YouTube-VOS benchmarks and achieves a good trade-off between accuracy and efficiency. Our code is available at https://github.com/liuziyang123/FAMINet.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Adaptive foreground object extraction for real-time video surveillance with lighting variations
    Zeng, Hui-Chi
    Lai, Shang-Hong
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 1201 - 1204
  • [42] Task-Oriented Video Compressive Streaming for Real-Time Semantic Segmentation
    Xiao, Xuedou
    Zuo, Yingying
    Yan, Mingxuan
    Wang, Wei
    He, Jianhua
    Zhang, Qian
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 14396 - 14413
  • [43] EFFICIENT REAL-TIME LOCAL OPTICAL FLOW ESTIMATION BY MEANS OF INTEGRAL PROJECTIONS
    Senst, Tobias
    Eiselein, Volker
    Paetzold, Michael
    Sikora, Thomas
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [44] rStaple: A Robust Complementary Learning Method for Real-Time Object Tracking
    He, Wangpeng
    Li, Heyi
    Liu, Wei
    Li, Cheng
    Guo, Baolong
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [45] Novel video stabilization for real-time optical character recognition applications
    Lee, Yun Gu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 44 : 148 - 155
  • [46] SPARSE OPTICAL FLOW REGULARIZATION FOR REAL-TIME VISUAL TRACKING
    Spruyt, Vincent
    Ledda, Alessandro
    Philips, Wilfried
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [47] Real-time optical flow measurement based on GPU architecture
    Minami, Shogo
    Yamaguchi, Teruo
    Harada, Hiroshi
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2012, : 305 - 307
  • [48] Clustering Motion for Real-Time Optical Flow based Tracking
    Senst, Tobias
    Evangelio, Ruben Heras
    Keller, Ivo
    Sikora, Thomas
    2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, : 410 - 415
  • [49] Event-based Real-time Optical Flow Estimation
    Lee, Alex Junho
    Kim, Ayoung
    2017 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2017, : 787 - 791
  • [50] Real-time video object detection and classification using hybrid texture feature extraction
    Venkatesvara Rao N.
    Venkatavara Prasad D.
    Sugumaran M.
    International Journal of Computers and Applications, 2021, 43 (02) : 119 - 126