Fine-scale observations of spatio-spectro-temporal dynamics of bird vocalizations using robot audition techniques

被引:10
作者
Sumitani, Shinji [1 ]
Suzuki, Reiji [1 ]
Matsubayashi, Shiho [2 ]
Arita, Takaya [1 ]
Nakadai, Kazuhiro [3 ,4 ]
Okuno, Hiroshi G. [5 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi, Japan
[2] Osaka Univ, Grad Sch Engn, Suita, Osaka, Japan
[3] Tokyo Inst Technol, Sch Engn, Dept Syst & Control Engn, Tokyo, Japan
[4] Honda Res Inst Japan Co, Wako, Saitama, Japan
[5] Waseda Univ, Grad Sch Fundamental Sci & Engn, Shinjuku Ku, Tokyo, Japan
关键词
Bird songs; ecoacoustics; robot audition; sound source localization; soundscape; t-SNE; ACOUSTIC INTERACTIONS; MICROPHONE ARRAYS; LOCALIZATION; COMMUNITIES; LOCATION; ECOLOGY; SENSOR;
D O I
10.1002/rse2.152
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Ecoacoustics needs sophisticated acoustic monitoring tools to extract a wide level of features from an observed mixture of sounds. We have developed a portable acoustic monitoring system called 'HARKBird' which consists of a laptop PC and an inexpensive commercial microphone array with the robot audition software HARK. HARKBird can extract acoustic events in a recording, and we can obtain the begin and end timings, the spatial information (e.g., position or direction from the microphone array), and the spectrogram of the sound separated from the original recording. In this study, we report how robot audition techniques contribute to monitoring spatio-spectro-temporal dynamics of bird behaviors, using an extended and minimal system based on multiple microphone arrays. The dimension reduction of separated sounds is important to integrate the information from multiple microphone arrays. As a dimension reduction algorithm, we use t-SNE to help manual annotation of each sound and to generate the vocalization distribution automatically. We conduct playback experiments to Spotted Towhee (Pipilo maculatus) to simulate different cases of territorial intrusions (song/call/no playback). Our hypothesis in playback experiments is that playback of conspecific vocalizations would invoke aggressive responses of males against song playbacks and the effects would be more prominent than those of call playbacks. Our primary aim is to test whether our system can extract the necessary information on the aggressiveness of target individuals to examine our hypothesis. We show the system with manual annotation of vocalizations can extract their different spatio-spectro-temporal dynamics in different conditions, which supported our hypothesis. We also consider the spectral affinity-based automatic matching of localized sounds from different microphone arrays. The relative number of localized songs depending on the playback conditions reflected a similar trend to those in the manual approach, implying that we can grasp the long-term dynamics of vocalizations without costly annotations.
引用
收藏
页码:18 / 35
页数:18
相关论文
共 44 条
[31]   Detailed temporal structure of communication networks in groups of songbirds [J].
Stowell, Dan ;
Gill, Lisa ;
Clayton, David .
JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2016, 13 (119)
[32]  
Sumitani S, 2019, INT CONF ACOUST SPEE, P8246, DOI 10.1109/ICASSP.2019.8683743
[33]  
Sumitani S, 2018, IEEE INT C INT ROBOT, P2485, DOI 10.1109/IROS.2018.8594130
[34]  
Suzuki R., 2018, J. Ecoacoustics, V2, pEYAJ46, DOI [10.22261/jea.eyaj46, DOI 10.22261/JEA.EYAJ46, 10.22261/JEA.EYAJ46]
[35]   Complex systems approaches to temporal soundspace partitioning in bird communities as a self-organizing phenomenon based on behavioral plasticity [J].
Suzuki, Reiji ;
Cody, Martin L. .
ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (04) :439-444
[36]   A spatiotemporal analysis of acoustic interactions between great reed warblers (Acrocephalus arundinaceus) using microphone arrays and robot audition software HARK [J].
Suzuki, Reiji ;
Matsubayashi, Shiho ;
Saito, Fumiyuki ;
Murate, Tatsuyoshi ;
Masuda, Tomohisa ;
Yamamoto, Koichi ;
Kojima, Ryosuke ;
Nakadai, Kazuhiro ;
Okuno, Hiroshi G. .
ECOLOGY AND EVOLUTION, 2018, 8 (01) :812-825
[37]  
Suzuki R, 2017, J ROBOT MECHATRON, V29, P213, DOI 10.20965/jrm.2017.p0213
[38]  
Tan M., 2017, BIRD SOUNDS 2D VISUA
[39]  
Thielk M., 2019, 870311 BIORXIV
[40]   SoundCompass: A Distributed MEMS Microphone Array-Based Sensor for Sound Source Localization [J].
Tiete, Jelmer ;
Dominguez, Federico ;
da Silva, Bruno ;
Segers, Laurent ;
Steenhaut, Kris ;
Touhafi, Abdellah .
SENSORS, 2014, 14 (02) :1918-1949