Subjective and Objective Analysis of Indian Social Media Video Quality

被引:0
作者
Mishra, Sandeep [1 ]
Jha, Mukul [2 ]
Bovik, Alan C. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] ShareChat Inc, Bengaluru 560103, India
基金
美国国家科学基金会;
关键词
Databases; Quality assessment; Video recording; Predictive models; Feature extraction; Social networking (online); Cultural differences; Smart phones; Data models; Visualization; No-reference video quality assessment; user-generated culturally-oriented mobile video quality database;
D O I
10.1109/TIP.2024.3512376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We conducted a large-scale subjective study of the perceptual quality of User-Generated Mobile Video Content on a set of mobile-originated videos obtained from ShareChat, a social media platform widely used across India. The content viewed by volunteer human subjects under controlled laboratory conditions has the benefit of culturally diversifying the existing corpus of User-Generated Content (UGC) video quality datasets. There is a great need for large and diverse UGC-VQA datasets, given the explosive global growth of the visual internet and social media platforms. This is particularly true in regard to videos obtained by smartphones, especially in rapidly emerging economies like India. ShareChat provides a safe and cultural community oriented space for users to generate and share content in their preferred Indian languages and dialects. Our subjective quality study, which is based on this data, supplies much needed cultural, visual, and language diversification to the overall shareable corpus of video quality data. We expect that this new data resource will also allow for the development of systems that can predict the perceived visual quality of Indian social media videos, and in this context, control scaling and compression protocols for streaming, provide better user recommendations, and guide content analysis and processing. We demonstrate the value of the new data resource by conducting a study of leading No-Reference Video Quality Assessment (NR-VQA) models on it, including a simple new model, called MoEVA, which deploys a mixture of experts to predict video quality. Both the new LIVE-ShareChat Database and sample source code for MoEVA are being made freely available to the research community at https://github.com/sandeep-sm/LIVE-SC.
引用
收藏
页码:140 / 153
页数:14
相关论文
共 57 条
[1]   Fast and reliable structure-oriented video noise estimation [J].
Amer, A ;
Dubois, E .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (01) :113-118
[2]  
Bosse S, 2016, IEEE IMAGE PROC, P3773, DOI 10.1109/ICIP.2016.7533065
[3]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[4]  
Chen T., 2020, PMLR, P1597
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
DIGITAL, 2023, INDIA
[7]  
Ebenezer JP, 2023, Arxiv, DOI arXiv:2304.13162
[8]  
economictimes, 2022, India Set To Surpass 1 Billion Internet Users, $400 Bn Online Spending By 2030
[9]   Measurement of ringing artifacts in JPEG images [J].
Feng, XJ ;
Allebach, JP .
DIGITAL PUBLISHING, 2006, 6076
[10]  
Ghadiyaram D, 2017, Live in the Wild Image Quality Challenge Database