Our paper “Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination” is accepted to ACL 2024. This Work is done during an internship at LG AI Research (Sep 2023 - Dec 2023).

Updated: