Towards Unified Deep Learning Model for NSFW Image and Video Captioning

被引：0

作者：

Ko, Jong-Won ^{[1
]}

Hwang, Dong-Hyun ^{[1
]}

机构：

[1] Enumnet Co Ltd, Res & Dev Ctr, Seoul Si, South Korea

来源：

ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018 | 2019年 / 518卷

关键词：

Deep learning; CNN; RNN; NSFW image and video captioning;

D O I：

10.1007/978-981-13-1328-8_8

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The deep learning model is an evolution of an artificial intelligence model called the Artificial Neural Network. And the internal layer of an artificial neural network consisting of layers is a multi-stage structure, the latest deep learning model has a larger number of internal layers, which can result in up to billions of nodes. In addition, learning models combining CNN and RNN to comment on pictures or video are currently being studied. The images or videos are input by CNN, summarized, and the results are input into RNN for printing out meaningful sentences, the so-called image and video captioning. This paper proposes unified deep learning model for NSFW image and video captioning. As noted above, traditional studies on image and video captioning have been approached via a combination of CNN and RNN models. In contrast, in this paper, the classification for safety judgement, object detection, and captioning can all be handled through one dataset definition.

引用

页码：57 / 63

页数：7

共 50 条

[21] Image/video captioning
Ushiku Y.
Ushiku, Yoshitaka, 2018, Inst. of Image Information and Television Engineers (72): : 650 - 654
[22] Deep learning and knowledge graph for image/video captioning: A review of datasets, evaluation metrics, and methods
Wajid, Mohammad Saif
Terashima-Marin, Hugo
Najafirad, Peyman
Wajid, Mohd Anas
ENGINEERING REPORTS, 2024, 6 (01)
[23] Generative image captioning in Urdu using deep learning
Afzal M.K.
Shardlow M.
Tuarob S.
Zaman F.
Sarwar R.
Ali M.
Aljohani N.R.
Lytras M.D.
Nawaz R.
Hassan S.-U.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7719 - 7731
[24] A Hybridized Deep Learning Method for Bengali Image Captioning
Humaira, Mayeesha
Paul, Shimul
Jim, Md Abidur Rahman Khan
Ami, Amit Saha
Shah, Faisal Muhammad
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 698 - 707
[25] Deep learning-based solar image captioning
Baek, Ji-Hye
Kim, Sujin
Choi, Seonghwan
Park, Jongyeob
Kim, Dongil
ADVANCES IN SPACE RESEARCH, 2024, 73 (06) : 3270 - 3281
[26] Image Captioning Using Multimodal Deep Learning Approach
Farkh, Rihem
Oudinet, Ghislain
Foued, Yasser
Computers, Materials and Continua, 2024, 81 (03): : 3951 - 3968
[27] Learning Text-to-Video Retrieval from Image Captioning
Ventura, Lucas
Schmid, Cordelia
Varol, Gul
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 1834 - 1854
[28] RETRACTED: Medical Image Captioning Using Optimized Deep Learning Model (Retracted Article)
Singh, Arjun
Raguru, Jaya Krishna
Prasad, Gaurav
Chauhan, Surbhi
Tiwari, Pradeep Kumar
Zaguia, Atef
Ullah, Mohammad Aman
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[29] Automated Image Captioning Using Sparrow Search Algorithm With Improved Deep Learning Model
Arasi, Munya A.
Alshahrani, Haya Mesfer
Alruwais, Nuha
Motwakel, Abdelwahed
Ahmed, Noura Abdelaziz
Mohamed, Abdullah
IEEE ACCESS, 2023, 11 : 104633 - 104642
[30] Multimodal Deep Neural Network with Image Sequence Features for Video Captioning
Oura, Soichiro
Matsukawa, Tetsu
Suzuki, Einoshin
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,

← 1 2 3 4 5 →