Joint Coding of Local and Global Deep Features in Videos for Visual Search

被引：15

作者：

Ding, Lin ^{[1
]}

Tian, Yonghong ^{[1
,2
]}

Fan, Hongfei ^{[3
]}

Chen, Changhuai ^{[4
]}

Huang, Tiejun ^{[1
]}

机构：

[1] Peking Univ, Sch Elect Engn & Comp Sci, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

[3] Kingsoft Cloud Co, Beijing 100085, Peoples R China

[4] Hikvision Co, Hangzhou 310012, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金;

关键词：

Local deep feature; joint coding; visual search; inter-feature correlation; EFFICIENT APPROACH; QUANTIZATION; DESCRIPTORS; RETRIEVAL;

D O I：

10.1109/TIP.2020.2965306

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Practically, it is more feasible to collect compact visual features rather than the video streams from hundreds of thousands of cameras into the cloud for big data analysis and retrieval. Then the problem becomes which kinds of features should be extracted, compressed and transmitted so as to meet the requirements of various visual tasks. Recently, many studies have indicated that the activations from the convolutional layers in convolutional neural networks (CNNs) can be treated as local deep features describing particular details inside an image region, which are then aggregated (e.g., using Fisher Vectors) as a powerful global descriptor. Combination of local and global features can satisfy those various needs effectively. It has also been validated that, if only local deep features are coded and transmitted to the cloud while the global features are recovered using the decoded local features, the aggregated global features should be lossy and consequently would degrade the overall performance. Therefore, this paper proposes a joint coding framework for local and global deep features (DFJC) extracted from videos. In this framework, we introduce a coding scheme for real-valued local and global deep features with intra-frame lossy coding and inter-frame reference coding. The theoretical analysis is performed to understand how the number of inliers varies with the number of local features. Moreover, the inter-feature correlations are exploited in our framework. That is, local feature coding can be accelerated by making use of the frame types determined with global features, while the lossy global features aggregated with the decoded local features can be used as a reference for global feature coding. Extensive experimental results under three metrics show that our DFJC framework can significantly reduce the bitrate of local and global deep features from videos while maintaining the retrieval performance.

引用

页码：3734 / 3749

页数：16

共 50 条

[1] Visual search for global/local stimulus features in humans and baboons
Christine Deruelle
Joël Fagot
Psychonomic Bulletin & Review, 1998, 5 : 476 - 481
[2] Coding Local and Global Binary Visual Features Extracted From Video Sequences
Baroffio, Luca
Canclini, Antonio
Cesana, Matteo
Redondi, Alessandro
Tagliasacchi, Marco
Tubaro, Stefano
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3546 - 3560
[3] Selection of local features for visual search
Francini, Gianluca
Lepsoy, Skjalg
Balestri, Massimo
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (04) : 311 - 322
[4] Local and global orientation in visual search
Andrew Found
Hermann J. Müller
Perception & Psychophysics, 1997, 59 : 941 - 963
[5] Combining local and global limitations of visual search
Poder, Endel
JOURNAL OF VISION, 2017, 17 (04): : 1 - 12
[6] Measuring Search Efficiency in Complex Visual Search Tasks: Global and Local Clutter
Beck, Melissa R.
Lohrenz, Maura C.
Trafton, J. Gregory
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2010, 16 (03) : 238 - 250
[7] The role of global and local similarity of indicators in a visual search task
Tiurina, N. A.
Utochkin, I. S.
VOPROSY PSIKHOLOGII, 2014, (04) : 107 - +
[8] A global and local perspective of interruption frequency in a visual search task
Radovic, Tara
Rieger, Tobias
Manzey, Dietrich
FRONTIERS IN PSYCHOLOGY, 2022, 13
[9] Visual Search in ASD: Instructed Versus Spontaneous Local and Global Processing
Van der Hallen, Ruth
Evers, Kris
Boets, Bart
Steyaert, Jean
Noens, Ilse
Wagemans, Johan
JOURNAL OF AUTISM AND DEVELOPMENTAL DISORDERS, 2016, 46 (09) : 3023 - 3036
[10] Visual Search in ASD: Instructed Versus Spontaneous Local and Global Processing
Ruth Van der Hallen
Kris Evers
Bart Boets
Jean Steyaert
Ilse Noens
Johan Wagemans
Journal of Autism and Developmental Disorders, 2016, 46 : 3023 - 3036

← 1 2 3 4 5 →