Evaluating and mitigating discrimination in language model decisions

Published:

Recommended citation: Alex Tamkin, Amanda Askell, Liane Lovitt, Esin Durmus, Nicholas Joseph, Shauna Kravec, Karina Nguyen, Jared Kaplan, Deep Ganguli. Evaluating and mitigating discrimination in language model decisions. arXiv preprint arXiv:2312.03689 (2023) https://arxiv.org/abs/2312.03689

Summary:

Leave a Comment