MMSS: Multi-modal story-oriented video summarization

被引:7
|
作者
Pan, JY [1 ]
Yang, H [1 ]
Faloutsos, C [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
来源
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2004年
关键词
D O I
10.1109/ICDM.2004.10033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-modal story-oriented video summarization (MMSS) which, unlike previous works that use fine-tuned, domain-specific heuristics, provides a domain-independent, graph-based framework. MMSS uncovers correlation between information of different modalities which gives meaningful story-oriented news video summaries. MMSS can also be applied for video retrieval, giving performance that matches the best traditional retrieval techniques (OKAPI and LSI), with no fine-tuned heuristics such as tf/idf.
引用
收藏
页码:491 / 494
页数:4
相关论文
共 50 条
  • [31] Multi-modal Laughter Recognition in Video Conversations
    Escalera, Sergio
    Puertas, Eloi
    Radeva, Petia
    Pujol, Oriol
    2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 869 - 874
  • [32] Multi-modal tracking of faces for video communications
    Crowley, JL
    Berard, F
    1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 640 - 645
  • [33] Multi-modal humor segment prediction in video
    Zekun Yang
    Yuta Nakashima
    Haruo Takemura
    Multimedia Systems, 2023, 29 : 2389 - 2398
  • [34] The Multi-Modal Video Reasoning and Analyzing Competition
    Peng, Haoran
    Huang, He
    Xu, Li
    Li, Tianjiao
    Liu, Jun
    Rahmani, Hossein
    Ke, Qiuhong
    Guo, Zhicheng
    Wu, Cong
    Li, Rongchang
    Ye, Mang
    Wang, Jiahao
    Zhang, Jiaxu
    Liu, Yuanzhong
    He, Tao
    Zhang, Fuwei
    Liu, Xianbin
    Lin, Tao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 806 - 813
  • [35] A cinemetric approach to sentimental processing on story-oriented contents
    Seung-Bo Park
    Eunsoon You
    Jason J. Jung
    Quality & Quantity, 2014, 48 : 49 - 62
  • [36] A cinemetric approach to sentimental processing on story-oriented contents
    Park, Seung-Bo
    You, Eunsoon
    Jung, Jason J.
    QUALITY & QUANTITY, 2014, 48 (01) : 49 - 62
  • [37] News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003
    Hsu, W
    Kennedy, L
    Huang, CW
    Chang, SF
    Lin, CY
    Iyengar, G
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 645 - 648
  • [38] Multi-modal fusion for associated news story retrieval
    Younessian, Ehsan
    Rajan, Deepu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (08) : 2563 - 2585
  • [39] Multi-modal Solution for Unconstrained News Story Retrieval
    Younessian, Ehsan
    Rajan, Deepu
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 186 - 195
  • [40] Multi-modal fusion for associated news story retrieval
    Ehsan Younessian
    Deepu Rajan
    Multimedia Tools and Applications, 2015, 74 : 2563 - 2585