共 115 条
- [1] Chang T.A., Bergen B.K., Language model behavior: A comprehensive survey, (2023)
- [2] Dev S., Sheng E., Zhao J., Amstutz A., Sun J., Hou Y., Sanseverino M., Kim J., Nishi A., Peng N., Et al., On measures of biases and harms in NLP, (2021)
- [3] Ganguli D., Lovitt L., Kernion J., Askell A., Bai Y., Kadavath S., Mann B., Perez E., Schiefer N., Ndousse K., Et al., Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned, (2022)
- [4] Hassan S., Huenerfauth M., Alm C.O., Unpacking the interdependent systems of discrimination: Ableist bias in NLP systems through an intersectional lens, (2021)
- [5] Ousidhoum N., Zhao X., Fang T., Song Y., Yeung D.-Y., Probing toxic content in large pre-trained language models, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 4262-4274, (2021)
- [6] Nozza D., Bianchi F., Lauscher A., Hovy D., Et al., Measuring harmful sentence completion in language models for LGBTQIA+ individuals, Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion, Association for Computational Linguistics, (2022)
- [7] Gehman S., Gururangan S., Sap M., Choi Y., Smith N.A., RealToxicityPrompts: Evaluating neural toxic degeneration in language models, (2020)
- [8] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser L., Polosukhin I., Attention is all you need, Advances in Neural Information Processing Systems, 30, (2017)
- [9] Brown T., Mann B., Ryder N., Subbiah M., Kaplan J.D., Dhariwal P., Neelakantan A., Shyam P., Sastry G., Askell A., Et al., Language models are few-shot learners, Advances in Neural Information Processing Systems, 33, pp. 1877-1901, (2020)
- [10] Chowdhery A., Narang S., Devlin J., Bosma M., Mishra G., Roberts A., Barham P., Chung H.W., Sutton C., Gehrmann S., Et al., Palm: Scaling language modeling with pathways, (2022)