Innovating Sustainability : VQA-Based AI for Carbon Neutrality Challenges

被引:66
作者
Chen, Yanyu [1 ]
Li, Qian [1 ]
Liu, JunYi [1 ]
机构
[1] Zhejiang Normal Univ, Jinhua, Peoples R China
关键词
ALBEF; Artificial Intelligence; Carbon Neutrality; CLIP; Environmental Governance Decision-making; Sustainable Development; VQA; ENERGY;
D O I
10.4018/JOEUC.337606
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In today's global society, carbon neutrality has become a focal point of concern. Greenhouse gas emissions and rising atmospheric temperatures are triggering various extreme weather events, sea level rise, and ecological imbalances. These changes not only affect the stability and sustainable development of human society but also pose a serious threat to the Earth's ecosystems and biodiversity. Faced with this global challenge, finding effective solutions has become urgent. This article aims to propose a comprehensive artificial intelligence design approach to address issues related to carbon neutrality. This method integrates technologies from fields such as computer vision, natural language processing, and deep learning to achieve a comprehensive understanding of environmental conservation and innovative solutions. Specifically, the authors first use a visual module to extract features from images, which helps capture important information in the images. Next, we employ the ALBEF model for cross -modal alignment, enabling better collaboration between images and textual information.
引用
收藏
页数:22
相关论文
共 37 条
[1]  
Akula AR, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P2148
[2]  
Alayrac JB, 2022, ADV NEUR IN
[3]   Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J].
Anderson, Peter ;
He, Xiaodong ;
Buehler, Chris ;
Teney, Damien ;
Johnson, Mark ;
Gould, Stephen ;
Zhang, Lei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6077-6086
[4]   VQA: Visual Question Answering [J].
Antol, Stanislaw ;
Agrawal, Aishwarya ;
Lu, Jiasen ;
Mitchell, Margaret ;
Batra, Dhruv ;
Zitnick, C. Lawrence ;
Parikh, Devi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2425-2433
[5]   Scalable Discovery of Hybrid Process Models in a Cloud Computing Environment [J].
Cheng, Long ;
van Dongen, Boudewijn F. ;
van der Aalst, Wil M. P. .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (02) :368-380
[6]  
Firat M., 2023, Journal of Applied Learning and Teaching, V6, DOI DOI 10.37074/JALT.2023.6.1.22
[7]   Question -Led object attention for visual question answering [J].
Gao, Lianli ;
Cao, Liangfu ;
Xu, Xing ;
Shao, Jie ;
Song, Jingkuan .
NEUROCOMPUTING, 2020, 391 :227-233
[8]   From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models [J].
Guo, Jiaxian ;
Li, Junnan ;
Li, Dongxu ;
Tiong, Anthony Meng Huat ;
Li, Boyang ;
Tao, Dacheng ;
Hoi, Steven .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10867-10877
[9]   基于混合机器学习模型的短文本语义相似性度量算法 [J].
韩开旭 ;
袁淑芳 .
吉林大学学报(理学版), 2023, 61 (04) :909-914
[10]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110