Many facts come with an expiration date, from the name of the President to the basketball team Lebron James plays for. However, most language models (LMs) are trained on snapshots of data collected at a specific moment in time. This can limit their utility, especially in the closed-book setting where the pretraining corpus must contain the facts the model should memorize. We introduce a diagnostic dataset aimed at probing LMs for factual knowledge that changes over time and highlight problems with LMs at either end of the spectrum-those trained on specific slices of temporal data, as well as those trained on a wide range of temporal data. To mitigate these problems, we propose a simple technique for jointly modeling text with its timestamp. This improves memorization of seen facts from the training time period, as well as calibration on predictions about unseen facts from future time periods. We also show that models trained with temporal context can be efficiently "refreshed" as new data arrives, without the need for retraining from scratch.
机构:
Department of Computer Science and Engineering, Jeonbuk National University, Korea, Republic ofDepartment of Computer Science and Engineering, Jeonbuk National University, Korea, Republic of
Lee, Jong-Whi
Jung, Jinhong
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, Jeonbuk National University, Korea, Republic ofDepartment of Computer Science and Engineering, Jeonbuk National University, Korea, Republic of
机构:
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, ChinaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Hu, Yuelin
Xu, Yuanwu
论文数: 0引用数: 0
h-index: 0
机构:
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, ChinaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Xu, Yuanwu
Zhang, Yuejie
论文数: 0引用数: 0
h-index: 0
机构:
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, ChinaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Zhang, Yuejie
Feng, Rui
论文数: 0引用数: 0
h-index: 0
机构:
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, ChinaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Feng, Rui
Zhang, Tao
论文数: 0引用数: 0
h-index: 0
机构:
School of Information Management and Engineering, Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai,200433, ChinaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Zhang, Tao
Lu, Xuequan
论文数: 0引用数: 0
h-index: 0
机构:
School of Information Technology, Deakin University, Waurn Ponds, VIC,3216, AustraliaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
Lu, Xuequan
Gao, Shang
论文数: 0引用数: 0
h-index: 0
机构:
School of Information Technology, Deakin University, Waurn Ponds, VIC,3216, AustraliaSchool of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Collaborative Innovation Center of Intelligent Visual Computing, Fudan University, Shanghai,200433, China
机构:
Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Rao, Jun
Liu, Xuebo
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Liu, Xuebo
Lian, Lian
论文数: 0引用数: 0
h-index: 0
机构:
Huawei Cloud Computing Technologies Co., Ltd., ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Lian, Lian
Cheng, Shengjun
论文数: 0引用数: 0
h-index: 0
机构:
Huawei Cloud Computing Technologies Co., Ltd., ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Cheng, Shengjun
Liao, Yunjie
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Liao, Yunjie
Zhang, Min
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, ChinaInstitute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen, China
Zhang, Min
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference,
2024,
: 10064
-
10083