Our paper “Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination” has been accepted to ACL 2024. This Work was done during an internship at LG AI Research (Sep 2023 - Dec 2023).

Updated: