共 25 条
[1]
Smith J., Live event detection using audio signals, Proc. IEEE Int. Conf. Signal Process, pp. 1-5, (2023)
[2]
Brown A., Et al., Speech command dataset for audio event detection, (2018)
[3]
Lee C., Kim D., Deep learning innovations in video classification, IEEE Trans. Pattern Anal. Mach. Intell, 46, 3, pp. 1234-1245, (2024)
[4]
Johnson E., Et al., A survey on feature fusion for multi-modal deep learning, Proc. IEEE Int. Conf. Comput. Vis, pp. 567-572, (2020)
[5]
Garcia F., NLP for social event classification, Proc. Int. Conf. Nat. Lang. Process, pp. 89-94, (2014)
[6]
Zhang H., Liu Y., Video event detection using audio-visual fusion, Proc. IEEE Int. Conf. Multimedia Expo, pp. 1-6, (2014)
[7]
Chen M., Et al., Audio-visual grouplet for temporal event correlation, Proc. IEEE Conf. Comput. Vis. Pattern Recognit, pp. 123-128, (2011)
[8]
Patel R., Deep learning for video event detection, Proc. IEEE Int. Symp. Multimedia, pp. 45-50, (2015)
[9]
Kumar S., Nguyen T., Spatiotemporal event detection using CNN-LSTM, Proc. IEEE Int. Conf. Image Process, pp. 789-794, (2017)
[10]
Wang L., Et al., Multi-modal fusion for aggression detection in public transport, Proc. IEEE Int. Conf. Adv. Video Signal Based Surveill, pp. 1-6, (2019)