Perplexity-aware Correction for Robust Alignment with Noisy Preferences

This is the source code for the NeurIPS 2024 paper "Perplexity-aware Correction for Robust Alignment with Noisy Preferences", Keyi Kong*(SDU), Xilie Xu* (NUS), Di Wang (KAUST), Jingfeng Zhang (University of Auckland/RIKEN-AIP), Mohan Kankanhalli (NUS).

Let's align LLMs via PerpCorrect

PerpCorrect corrects noisy preferences using PPLDiff, which is calculated through an iteratively trained surrogate LLM.

Python Environment

This code mainly uses Huggingface's trl library. You can use the following script to configure the environment.

pip install -r requirements.txt

Preprocessing and SFT

# preprocess preferences dataset first
python src/preprocessing.py
# supervised fine-tune
bash bash/sft.sh

PerpCorrect and Robust Alignment

For DPO series experiments, you can use following script.

bash bash/dpo.sh

For PPO series experiments, you can use following script.

bash bash/ppo.sh

If you need rDPO experiments, you need to modify the trl library as follows:

if self.loss_type == "sigmoid":
    # cDPO
    if self.label_smoothing >= 0:
        losses = (
            - F.logsigmoid(self.beta * logits) * (1 - self.label_smoothing)
            - F.logsigmoid(-self.beta * logits) * self.label_smoothing
        )
    # rDPO
    else :
        losses = (
            - F.logsigmoid(self.beta * logits) * (1 + self.label_smoothing)
            - F.logsigmoid(-self.beta * logits) * self.label_smoothing
        ) / (1 + 2 * self.label_smoothing)

Acknowledgement

The project is built upon trl .

BibTeX

@inproceedings{
    kong2024perplexityaware,
    title={Perplexity-aware Correction for Robust Alignment with Noisy Preferences},
    author={Keyi Kong and Xilie Xu and Di Wang and Jingfeng Zhang and Mohan Kankanhalli},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
    url={https://openreview.net/forum?id=OUXnnPJzXJ}
}

Contact

Please drop an e-mail to luxinyayaya@mail.sdu.edu.cn if you have any enquiry.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bash		bash
figures		figures
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Perplexity-aware Correction for Robust Alignment with Noisy Preferences

Let's align LLMs via PerpCorrect

Python Environment

Preprocessing and SFT

PerpCorrect and Robust Alignment

Acknowledgement

BibTeX

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Perplexity-aware Correction for Robust Alignment with Noisy Preferences

Let's align LLMs via PerpCorrect

Python Environment

Preprocessing and SFT

PerpCorrect and Robust Alignment

Acknowledgement

BibTeX

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages